Model Catalog
Explore and compare all AI models available on our platform

GPT-4 Turbo
OpenAI
Most capable GPT-4 model with 128K context window, optimized for chat and complex reasoning tasks. Best for production applications requiring high accuracy.
Input
$10.00/1M tokens
Output
$30.00/1M tokens
Context
128K
Speed
85 t/s
GPT-4o
OpenAI
Omni model with multimodal capabilities including text, vision, and audio understanding. Faster and more efficient than GPT-4 Turbo.
Input
$5.00/1M tokens
Output
$15.00/1M tokens
Context
128K
Speed
120 t/s
GPT-3.5 Turbo
OpenAI
Fast and cost-effective model for simple tasks. Great for chatbots, content generation, and basic reasoning.
Input
$0.50/1M tokens
Output
$1.50/1M tokens
Context
16K
Speed
150 t/s
Claude 3 Opus
Anthropic
Most powerful Claude model with exceptional performance on complex tasks, analysis, and creative writing. 200K context window.
Input
$15.00/1M tokens
Output
$75.00/1M tokens
Context
200K
Speed
65 t/s
Claude 3 Sonnet
Anthropic
Balanced model offering strong performance at a lower cost. Ideal for most enterprise workloads and customer-facing applications.
Input
$3.00/1M tokens
Output
$15.00/1M tokens
Context
200K
Speed
90 t/s
Claude 3 Haiku
Anthropic
Fastest and most compact Claude model. Perfect for high-volume, low-latency applications and real-time interactions.
Input
$0.25/1M tokens
Output
$1.25/1M tokens
Context
200K
Speed
140 t/s
Gemini 1.5 Pro
Advanced multimodal model with 1M token context window. Exceptional for long-document analysis, video understanding, and complex reasoning.
Input
$3.50/1M tokens
Output
$10.50/1M tokens
Context
1.0M
Speed
95 t/s
Gemini 1.5 Flash
Lightweight multimodal model optimized for speed and efficiency. Great for high-frequency tasks with multimodal inputs.
Input
$0.35/1M tokens
Output
$1.05/1M tokens
Context
1.0M
Speed
130 t/s
Llama 3 70B Instruct
Meta
Open-source model with strong performance on instruction following and reasoning. Cost-effective alternative to proprietary models.
Input
$0.90/1M tokens
Output
$0.90/1M tokens
Context
8K
Speed
145 t/s
Llama 3 8B Instruct
Meta
Compact open-source model perfect for edge deployment and high-throughput applications. Excellent price-performance ratio.
Input
$0.20/1M tokens
Output
$0.20/1M tokens
Context
8K
Speed
180 t/s
Mistral Large
Mistral
European flagship model with strong multilingual capabilities and reasoning. Excellent for enterprise applications requiring data sovereignty.
Input
$8.00/1M tokens
Output
$24.00/1M tokens
Context
32K
Speed
80 t/s
Mistral Medium
Mistral
Balanced model offering strong performance across various tasks. Good for general-purpose applications with multilingual support.
Input
$2.70/1M tokens
Output
$8.10/1M tokens
Context
32K
Speed
100 t/s
Mistral Small
Mistral
Cost-effective model for simple tasks and high-volume applications. Fast and efficient for basic reasoning and content generation.
Input
$1.00/1M tokens
Output
$3.00/1M tokens
Context
32K
Speed
135 t/s
DALL-E 3
OpenAI
Advanced image generation model with exceptional quality and prompt adherence. Supports 1024x1024, 1024x1792, and 1792x1024 resolutions.
Image Price
$0.04/image
Context
N/A
Stable Diffusion XL
Stability AI
Open-source image generation model with high-quality outputs. Cost-effective alternative for image generation at scale.
Image Price
$0.02/image
Context
N/A
Whisper Large V3
OpenAI
State-of-the-art speech recognition model supporting 99 languages. Exceptional accuracy for transcription and translation tasks.
Audio Price
$0.01/minute
Context
N/A
GPT-4 Vision
OpenAI
GPT-4 with vision capabilities for image understanding and analysis. Perfect for visual question answering and image description.
Input
$10.00/1M tokens
Output
$30.00/1M tokens
Context
128K
Speed
75 t/s
Command R+
Cohere
Enterprise-grade model optimized for RAG and tool use. Excellent for building AI assistants with external knowledge integration.
Input
$3.00/1M tokens
Output
$15.00/1M tokens
Context
128K
Speed
90 t/s
Mixtral 8x7B Instruct
Mistral
Mixture-of-experts model with 47B parameters. Outperforms many larger models while being more efficient and cost-effective.
Input
$0.70/1M tokens
Output
$0.70/1M tokens
Context
32K
Speed
155 t/s
PaLM 2
Legacy Google model with strong multilingual and reasoning capabilities. Being phased out in favor of Gemini models.
Input
$1.25/1M tokens
Output
$2.50/1M tokens
Context
8K
Speed
100 t/s