8 providers, 67 models compared side by side. Sorted by cost. Updated April 26, 2026.
| Provider | Model | Input / 1M | Output / 1M | Context | Type |
|---|---|---|---|---|---|
| Perplexity | pplx-embed-v1-0.6b | $0.0040 | $0.0040 | N/A | embedding |
| Perplexity | pplx-embed-context-v1-0.6b | $0.0080 | $0.0080 | N/A | embedding |
| Together AI | LFM2 24B A2B | $0.03 | $0.12 | N/A | chat |
| Perplexity | pplx-embed-v1-4b | $0.03 | $0.03 | N/A | embedding |
| Groq | Llama 3.1 8B Instant 128k | $0.05 | $0.08 | 128k | chat |
| Together AI | gpt-oss-20B | $0.05 | $0.20 | N/A | chat |
| Perplexity | pplx-embed-context-v1-4b | $0.05 | $0.05 | N/A | embedding |
| Together AI | Gemma 3n E4B Instruct | $0.06 | $0.12 | N/A | chat |
| Groq | GPT OSS 20B 128k | $0.07 | $0.30 | 128k | chat |
| Groq | GPT OSS Safeguard 20B | $0.07 | $0.30 | 128k | chat |
| Google Gemini | Gemini 2.5 Flash-Lite | $0.10 | $0.40 | Not available | chat |
| Together AI | Qwen3.5 9B | $0.10 | $0.15 | 128K | chat |
| Together AI | Llama 3 8B Instruct Lite | $0.10 | $0.10 | 128K | chat |
| Groq | Llama 4 Scout (17Bx16E) 128k | $0.11 | $0.34 | 128k | chat |
| Groq | GPT OSS 120B 128k | $0.15 | $0.60 | 128k | chat |
| Together AI | gpt-oss-120B | $0.15 | $0.60 | N/A | chat |
| Together AI | Rnj-1 Instruct | $0.15 | $0.15 | N/A | chat |
| Google Gemini | Gemini Embedding 2 | $0.20 | Free | Not available | embedding |
| Together AI | Gemma 4 31B | $0.20 | $0.50 | N/A | chat |
| xAI | grok-4-1-fast-reasoning | $0.20 | $0.50 | 2M | reasoning |
| xAI | grok-4-1-fast-non-reasoning | $0.20 | $0.50 | 2M | non-reasoning |
| Google Gemini | Gemini 3.1 Flash-Lite Preview | $0.25 | $1.50 | Not available | chat |
| Groq | Qwen3 32B 131k | $0.29 | $0.59 | 131k | chat |
| Google Gemini | Gemini 2.5 Flash | $0.30 | $2.50 | 1M | chat |
| Together AI | MiniMax M2.7 | $0.30 | $1.20 | N/A | chat |
| Together AI | MiniMax M2.5 | $0.30 | $1.20 | N/A | chat |
| Together AI | Qwen2.5 7B Instruct Turbo | $0.30 | $0.30 | 128K | chat |
| Cohere | Command-light | $0.30 | $0.60 | Not specified | generative |
| Together AI | Qwen3-Coder-Next | $0.50 | $1.20 | 128K | chat |
| Together AI | Kimi K2.5 | $0.50 | $2.80 | N/A | chat |
| Cohere | Command R 03-2024 | $0.50 | $1.50 | Not specified | generative |
| Cohere | Aya Expanse 8B | $0.50 | $1.50 | Not specified | generative |
| Cohere | Aya Expanse 32B | $0.50 | $1.50 | Not specified | generative |
| Groq | Llama 3.3 70B Versatile 128k | $0.59 | $0.79 | 128k | chat |
| Together AI | Qwen3.5-397B-A17B | $0.60 | $3.60 | 128K | chat |
| Together AI | DeepSeek-V3.1 | $0.60 | $1.70 | 128K | chat |
| OpenAI | GPT-5.4 mini | $0.75 | $4.50 | N/A | coding |
| Google Gemini | Gemini 3.1 Flash Live Preview | $0.75 | $4.50 | Not available | chat |
| Together AI | Llama 3.3 70B | $0.88 | $0.88 | 128K | chat |
| Anthropic | Claude Haiku 4.5 | $1.00 | $5.00 | 200K | chat |
| Together AI | GLM-5 | $1.00 | $3.20 | 128K | chat |
| Perplexity | Sonar | $1.00 | $1.00 | N/A | reasoning |
| Cohere | Command | $1.00 | $2.00 | Not specified | generative |
| Together AI | Kimi K2.6 | $1.20 | $4.50 | N/A | chat |
| Google Gemini | Gemini 2.5 Pro | $1.25 | $10.00 | 200K | chat |
| Together AI | Cogito v2.1 671B | $1.25 | $1.25 | N/A | chat |
| Together AI | GLM-5.1 | $1.40 | $4.40 | 128K | chat |
| Google Gemini | Gemini 3.1 Pro Preview | $2.00 | $12.00 | 200K | chat |
| Together AI | Qwen3-Coder-480B A35B Instruct | $2.00 | $2.00 | 128K | chat |
| xAI | grok-4.20-reasoning | $2.00 | $6.00 | 2M | reasoning |
| xAI | grok-4.20-non-reasoning | $2.00 | $6.00 | 2M | non-reasoning |
| Perplexity | Sonar Reasoning Pro | $2.00 | $8.00 | N/A | reasoning |
| Perplexity | Sonar Deep Research | $2.00 | $8.00 | N/A | reasoning |
| Together AI | DeepSeek V4 Pro | $2.10 | $4.40 | 128K | chat |
| OpenAI | GPT-5.4 | $2.50 | $15.00 | N/A | coding |
| Cohere | Command R+ 08-2024 | $2.50 | $10.00 | Not specified | generative |
| Anthropic | Claude Sonnet 4.6 | $3.00 | $15.00 | 1M | chat |
| Together AI | DeepSeek-R1-0528 | $3.00 | $7.00 | 128K | chat |
| Perplexity | Sonar Pro | $3.00 | $15.00 | N/A | reasoning |
| Cohere | Command R+ 04-2024 | $3.00 | $15.00 | Not specified | generative |
| OpenAI | GPT-realtime-1.5 (Text) | $4.00 | $16.00 | N/A | text |
| OpenAI | GPT-5.5 | $5.00 | $30.00 | N/A | coding |
| OpenAI | GPT-image-2 (Text) | $5.00 | $30.00 | N/A | text |
| Anthropic | Claude Opus 4.7 | $5.00 | $25.00 | 1M | chat |
| OpenAI | GPT-image-2 (Image) | $8.00 | $30.00 | N/A | image |
| OpenAI | GPT-realtime-1.5 (Audio) | $32.00 | $64.00 | N/A | audio |
| Google Gemini | Imagen 4 | Free | $0.04 | Not available | image |