AI Models
Explore our full catalog of models from leading providers. Compare pricing, capabilities, and performance to find the right fit for your application.
Showing 25 of 25 models
| Model | Provider | Tier | Context | Speed | |||
|---|---|---|---|---|---|---|---|
| Claude Haiku 3.5 | Anthropic | Economy | 200K | $0.80 | $4 | fast | 4.4 |
| Claude Opus 4 | Anthropic | Frontier | 200K | $15 | $75 | standard | 4.9 |
| Claude Sonnet 4 | Anthropic | Frontier | 200K | $3 | $15 | fast | 4.7 |
| Codestral | Mistral | Balanced | 32K | $0.30 | $0.90 | fast | 4.3 |
| Command R | Cohere | Economy | 128K | $0.15 | $0.60 | fast | 4.1 |
| Command R+ | Cohere | Balanced | 128K | $3 | $10 | standard | 4.3 |
| DeepSeek R1 | DeepSeek | Balanced | 128K | $0.55 | $2 | standard | 4.7 |
| DeepSeek V3 | DeepSeek | Balanced | 128K | $0.27 | $1 | fast | 4.5 |
| Gemini 2.0 Flash | Economy | 1M | $0.075 | $0.30 | fast | 4.5 | |
| Gemini 2.5 ProNew | Frontier | 1M | $4 | $11 | standard | 4.7 | |
| Gemini Ultra | Frontier | 128K | $20 | $60 | slow | 4.6 | |
| GPT-4o | OpenAI | Frontier | 128K | $3 | $10 | fast | 4.6 |
| GPT-4o Mini | OpenAI | Economy | 128K | $0.15 | $0.60 | fast | 4.3 |
| GPT-5.2 CodexNew | OpenAI | Frontier | 256K | $12 | $48 | standard | 4.7 |
| GPT-5.4 | OpenAI | Frontier | 256K | $15 | $60 | standard | 4.8 |
| Grok 3 | xAI | Frontier | 131K | $3 | $15 | fast | 4.4 |
| Grok 4New | xAI | Frontier | 256K | $5 | $15 | standard | 4.6 |
| Llama 3.1 70B | Meta | Balanced | 128K | $0.59 | $0.79 | fast | 4.4 |
| Llama 3.1 8B | Meta | Economy | 128K | $0.050 | $0.080 | fast | 4.1 |
| Llama 3.3 70BNew | Meta | Balanced | 128K | $0.60 | $0.60 | fast | 4.5 |
| Llama 3.3 70B (Groq) | Groq | Balanced | 128K | $0.59 | $0.79 | fast | 4.4 |
| Mistral 7B | Mistral | Economy | 32K | $0.040 | $0.040 | fast | 4.0 |
| Mistral Large | Mistral | Frontier | 128K | $2 | $6 | standard | 4.5 |
| o3New | OpenAI | Frontier | 200K | $10 | $40 | slow | 4.9 |
| o3 Mini | OpenAI | Balanced | 200K | $1 | $4 | standard | 4.4 |
Claude Haiku 3.5
Fast and affordable model optimized for lightweight tasks. Excellent for customer support, content moderation, and high-volume processing.
Claude Opus 4
Anthropic's most powerful model. Exceptional at extended analysis, nuanced writing, advanced coding, and agentic tasks. Known for safety, reliability, and instruction-following.
Claude Sonnet 4
Ideal balance of intelligence and speed. Strong coding abilities, excellent instruction following, and reliable for production workloads.
Codestral
Specialized code model from Mistral supporting 80+ programming languages. Optimized for code completion, generation, and explanation.
Command R
Efficient model for retrieval-augmented generation and enterprise search. Cost-effective for high-volume document processing.
Command R+
Enterprise-focused model optimized for RAG workflows and tool use. Excellent at following complex instructions with structured outputs.
DeepSeek R1
State-of-the-art reasoning model with transparent chain-of-thought. Excels at mathematics, logic, and complex problem decomposition.
DeepSeek V3
High-performance general model offering impressive quality-to-cost ratio. Strong at code generation and multilingual tasks.
Gemini 2.0 Flash
Ultra-fast model with excellent multimodal understanding. Optimized for speed-critical applications while maintaining strong quality.
Gemini 2.5 Pro
NewGoogle's most capable model with advanced reasoning and a massive context window. Excellent for complex research, long-document analysis, and multi-step problem solving.
Gemini Ultra
Top-tier reasoning model with deep analytical capabilities. Best for research, science, and complex professional workflows.
GPT-4o
Multimodal model with strong performance at an accessible price point. Processes text, images, and audio with fast response times.
GPT-4o Mini
Cost-efficient model for lightweight tasks. Ideal for classification, extraction, summarization, and simple conversational AI applications.
GPT-5.2 Codex
NewSpecialized code generation model optimized for software engineering tasks. Superior at code completion, debugging, refactoring, and multi-file editing.
GPT-5.4
Most advanced reasoning model from OpenAI. Excels at complex multi-step tasks, code generation, and nuanced analysis with state-of-the-art performance across all benchmarks.
Grok 3
Strong general-purpose model with witty personality and broad knowledge. Fast and efficient for conversational AI and analysis.
Grok 4
NewxAI's most capable model with exceptional reasoning, real-time knowledge, and unfiltered analytical capabilities.
Llama 3.1 70B
Proven open-weight model with excellent multilingual support and strong coding abilities. Popular choice for cost-effective deployments.
Llama 3.1 8B
Lightweight open model optimized for edge deployment and high throughput. Great for simple tasks, embeddings, and cost-sensitive applications.
Llama 3.3 70B
NewLatest open-weight model from Meta with significant improvements in reasoning and instruction following. Strong general-purpose performance.
Llama 3.3 70B (Groq)
Llama 3.3 70B served on Groq's custom LPU hardware for ultra-low latency inference. Ideal for real-time applications requiring instant responses.
Mistral 7B
Compact model punching above its weight class. Efficient for moderate tasks with surprisingly good language understanding.
Mistral Large
Mistral's flagship model with strong multilingual and coding capabilities. Competitive with frontier models at a lower price point.
o3
NewAdvanced reasoning model that uses chain-of-thought to solve complex problems. Excels in mathematics, science, and logical reasoning with verifiable step-by-step explanations.
o3 Mini
Efficient reasoning model that balances chain-of-thought capabilities with speed. Good for tasks requiring structured thinking at a lower cost.