Every model, priced for agents
Every one of the 343 models Zumik routes across its five first-class providers - OpenAI, Anthropic, Google, xAI, and Fireworks. List prices hide what a coding agent really pays, so the 15 flagships we profile in depth also carry cache-read discounts, measured capture, warm time-to-first-token, and a quality index. Sort by reuse-adjusted cost, not headline price.
| Model | Quality | Context | Input | Output | Cache read | Reuse-adj | Capture | Warm TTFT |
|---|---|---|---|---|---|---|---|---|
| Claude Fable 5 Anthropic | 98 | 1M | $10.00 | $50.00 | $1.00 −90% | $16.29 | 93% | 280 ms |
| Claude Opus 4.8 Anthropic | 97 | 1M | $5.00 | $25.00 | $0.50 −90% | $8.14 | 93% | 240 ms |
| GPT-5.5 Pro OpenAI | 97 | 1.1M | $30.00 | $180.00 | $30.00 | $67.50 | 90% | 900 ms |
| GPT-5.5 OpenAI | 96 | 1.1M | $5.00 | $30.00 | $0.50 −90% | $9.39 | 88% | 240 ms |
| Claude Opus 4.7 Anthropic | 95 | 1M | $5.00 | $25.00 | $0.50 −90% | $8.14 | 92% | 320 ms |
| Gemini 3.1 Pro Preview Google | 94 | 1M | $2.00 | $12.00 | $0.20 −90% | $3.76 | 82% | 360 ms |
| Grok 4.3 xAI | 92 | 1M | $1.25 | $2.50 | $0.20 −84% | $1.13 | 76% | 280 ms |
| Claude Sonnet 4.6 Anthropic | 90 | 1M | $3.00 | $15.00 | $0.30 −90% | $4.89 | 91% | 190 ms |
| DeepSeek-V4-Pro Fireworks · open | 88 | 1M | $1.74 | $3.48 | $0.14 −92% | $1.52 | 80% | 130 ms |
| GLM 5.1 Fireworks · open | 86 | 203K | $1.40 | $4.40 | $0.26 −81% | $1.68 | 82% | 130 ms |
| Gemini 3.5 Flash Google | 86 | 1M | $1.50 | $9.00 | $0.15 −90% | $2.82 | 81% | 150 ms |
| Kimi K2.6 Fireworks · open | 85 | 262K | $0.95 | $4.00 | $0.16 −83% | $1.39 | 81% | 140 ms |
| GPT-5 Mini OpenAI | 82 | 400K | $0.25 | $2.00 | $0.03 −90% | $0.59 | 85% | 150 ms |
| Claude Haiku 4.5 Anthropic | 80 | 200K | $1.00 | $5.00 | $0.10 −90% | $1.63 | 90% | 110 ms |
| OpenAI gpt-oss-120b Fireworks · open | 79 | 131K | $0.15 | $0.60 | $0.01 −90% | $0.21 | 79% | 120 ms |
| Claude Opus 4.1 Anthropic | — | 200K | $15.00 | $75.00 | $1.50 −90% | $24.43 | — | — |
| Claude Opus 4.5 Anthropic | — | 200K | $5.00 | $25.00 | $0.50 −90% | $8.14 | — | — |
| Claude Opus 4.6 Anthropic | — | 1M | $5.00 | $25.00 | $0.50 −90% | $8.14 | — | — |
| Claude Sonnet 4.5 Anthropic | — | 200K | $3.00 | $15.00 | $0.30 −90% | $4.89 | — | — |
| Chronos Hermes 13B v2 Fireworks · open | — | 4K | — | — | — | — | — | — |
| Code Llama 13B Fireworks · open | — | 16K | — | — | — | — | — | — |
| Code Llama 13B Instruct Fireworks · open | — | 16K | — | — | — | — | — | — |
| Code Llama 13B Python Fireworks · open | — | 16K | — | — | — | — | — | — |
| Code Llama 34B Fireworks · open | — | 16K | — | — | — | — | — | — |
| Code Llama 34B Instruct Fireworks · open | — | 16K | — | — | — | — | — | — |
| Code Llama 34B Python Fireworks · open | — | 16K | — | — | — | — | — | — |
| Code Llama 70B Fireworks · open | — | 4K | — | — | — | — | — | — |
| Code Llama 70B Instruct Fireworks · open | — | 4K | — | — | — | — | — | — |
| Code Llama 70B Python Fireworks · open | — | 4K | — | — | — | — | — | — |
| Code Llama 7B Fireworks · open | — | 16K | — | — | — | — | — | — |
| Code Llama 7B Instruct Fireworks · open | — | 16K | — | — | — | — | — | — |
| CodeQwen 1.5 7B Fireworks · open | — | 66K | — | — | — | — | — | — |
| CodeGemma 2B Fireworks · open | — | 8K | — | — | — | — | — | — |
| CodeGemma 7B Fireworks · open | — | 8K | — | — | — | — | — | — |
| Cogito v1 Preview Llama 3B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Cogito v1 Preview Llama 70B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Cogito v1 Preview Llama 8B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Cogito v1 Preview Qwen 14B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Cogito v1 Preview Qwen 32B Fireworks · open | — | 131K | — | — | — | — | — | — |
| DeepSeek Coder 1.3B Base Fireworks · open | — | 16K | — | — | — | — | — | — |
| DeepSeek Coder 33B Instruct Fireworks · open | — | 16K | — | — | — | — | — | — |
| DeepSeek Coder 7B Base Fireworks · open | — | 4K | — | — | — | — | — | — |
| DeepSeek Coder 7B Base v1.5 Fireworks · open | — | 4K | — | — | — | — | — | — |
| DeepSeek Coder 7B Instruct v1.5 Fireworks · open | — | 4K | — | — | — | — | — | — |
| DeepSeek Coder V2 Lite Base Fireworks · open | — | 164K | — | — | — | — | — | — |
| DeepSeek Coder V2 Lite Instruct Fireworks · open | — | 164K | — | — | — | — | — | — |
| DeepSeek Prover V2 Fireworks · open | — | 164K | — | — | — | — | — | — |
| DeepSeek R1 (Fast) Fireworks · open | — | 164K | $3.00 | $7.00 | $3.00 | $4.00 | — | — |
| Deepseek R1 05/28 Fireworks | — | 164K | $0.50 | $2.15 | $0.35 −30% | $0.85 | — | — |
| DeepSeek R1 0528 Distill Qwen3 8B Fireworks · open | — | 131K | — | — | — | — | — | — |
| DeepSeek R1 (Basic) Fireworks · open | — | 164K | — | — | — | — | — | — |
| DeepSeek R1 Distill Llama 70B Fireworks · open | — | 8K | $0.80 | $0.80 | $0.80 | $0.80 | — | — |
| DeepSeek R1 Distill Llama 8B Fireworks · open | — | 131K | — | — | — | — | — | — |
| DeepSeek R1 Distill Qwen 14B Fireworks · open | — | 33K | $0.15 | $0.15 | $0.15 | $0.15 | — | — |
| DeepSeek R1 Distill Qwen 1.5B Fireworks · open | — | 131K | — | — | — | — | — | — |
| DeepSeek R1 Distill Qwen 32B Fireworks · open | — | 64K | $0.30 | $0.30 | $0.30 | $0.30 | — | — |
| DeepSeek R1 Distill Qwen 7B Fireworks · open | — | 131K | — | — | — | — | — | — |
| DeepSeek V2 Lite Chat Fireworks · open | — | 164K | — | — | — | — | — | — |
| DeepSeek V2.5 Fireworks · open | — | 33K | — | — | — | — | — | — |
| DeepSeek V3 Fireworks · open | — | 131K | $1.25 | $1.25 | $1.25 | $1.25 | — | — |
| Deepseek V3 03-24 Fireworks · open | — | 164K | $0.27 | $1.12 | $0.14 −50% | $0.43 | — | — |
| DeepSeek V3.1 Fireworks · open | — | 164K | — | — | — | — | — | — |
| DeepSeek V3.1 Terminus Fireworks · open | — | 164K | — | — | — | — | — | — |
| Deepseek v3.2 Fireworks · open | — | 164K | — | — | — | — | — | — |
| DeepSeek-V4-Flash Fireworks · open | — | 1M | $0.14 | $0.28 | $0.03 −79% | $0.13 | — | — |
| Devstral-Small-2505 Fireworks · open | — | 128K | $0.10 | $0.30 | $0.10 | $0.15 | — | — |
| Dolphin 2.9.2 Qwen2 72B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Dolphin 2.6 Mixtral 8x7b Fireworks · open | — | 33K | — | — | — | — | — | — |
| ERNIE-4.5-21B-A3B-PT Fireworks · open | — | 131K | — | — | — | — | — | — |
| FARE-20B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Firesearch OCR V6 Fireworks · open | — | 8K | — | — | — | — | — | — |
| Gemma 2B Instruct Fireworks · open | — | 8K | — | — | — | — | — | — |
| Gemma 3 12B Instruct Fireworks · open | — | 131K | $0.05 | $0.10 | $0.05 | $0.06 | — | — |
| Gemma 3 27B Instruct Fireworks · open | — | 98K | $0.12 | $0.20 | $0.12 | $0.14 | — | — |
| Gemma 3 4B Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Gemma 4 31B IT NVFP4 Fireworks · open | — | 262K | — | — | — | — | — | — |
| Gemma 4 E4B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Gemma 7B Fireworks · open | — | 8K | — | — | — | — | — | — |
| Gemma 7B Instruct Fireworks · open | — | 8K | — | — | — | — | — | — |
| Gemma 2 9B Instruct Fireworks · open | — | 8K | — | — | — | — | — | — |
| GLM-4.5 Fireworks · open | — | 131K | — | — | — | — | — | — |
| GLM-4.5-Air Fireworks · open | — | 131K | — | — | — | — | — | — |
| GLM-4.5V Fireworks · open | — | 131K | — | — | — | — | — | — |
| GLM-4.6 Fireworks · open | — | 203K | — | — | — | — | — | — |
| GLM-4.7 Fireworks · open | — | 203K | — | — | — | — | — | — |
| GLM-4.7 Flash Fireworks · open | — | 203K | — | — | — | — | — | — |
| GLM-5 Fireworks · open | — | 205K | $1.00 | $3.20 | $0.20 −80% | $1.22 | — | — |
| OpenAI gpt-oss-20b Fireworks · open | — | 131K | $0.07 | $0.30 | $0.04 −50% | $0.11 | — | — |
| OpenAI gpt-oss-safeguard-120b Fireworks · open | — | 131K | — | — | — | — | — | — |
| OpenAI gpt-oss-safeguard-20b Fireworks · open | — | 131K | $0.07 | $0.30 | $0.04 −51% | $0.12 | — | — |
| Hermes 2 Pro Mistral 7B Fireworks · open | — | 33K | — | — | — | — | — | — |
| InternVL3 38B Fireworks · open | — | 16K | — | — | — | — | — | — |
| InternVL3 78B Fireworks · open | — | 16K | — | — | — | — | — | — |
| InternVL3 8B Fireworks · open | — | 16K | — | — | — | — | — | — |
| KAT Dev 32B Fireworks · open | — | 131K | — | — | — | — | — | — |
| KAT Dev 72B Exp Fireworks · open | — | 131K | — | — | — | — | — | — |
| Kimi K2 Instruct Fireworks · open | — | 131K | $0.57 | $2.30 | $0.57 | $1.00 | — | — |
| Kimi K2 Instruct 0905 Fireworks · open | — | 131K | $0.57 | $2.30 | $0.57 | $1.00 | — | — |
| Kimi K2 Thinking Fireworks · open | — | 262K | $0.60 | $2.50 | $0.15 −75% | $0.89 | — | — |
| Kimi K2.5 Fireworks · open | — | 262K | — | — | — | — | — | — |
| Kimi K2.7 Code Fireworks · open | — | 262K | $0.95 | $4.00 | $0.19 −80% | $1.40 | — | — |
| Llama Guard v2 8B Fireworks · open | — | 8K | — | — | — | — | — | — |
| Llama Guard v3 1B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama Guard 3 8B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 2 13B Fireworks · open | — | 4K | — | — | — | — | — | — |
| Llama 2 13B Chat Fireworks · open | — | 4K | — | — | — | — | — | — |
| Llama 2 70B Fireworks · open | — | 4K | — | — | — | — | — | — |
| Llama 2 7B Fireworks · open | — | 4K | — | — | — | — | — | — |
| Llama 2 7B Chat Fireworks · open | — | 4K | — | — | — | — | — | — |
| Llama 3 70B Instruct Fireworks · open | — | 8K | — | — | — | — | — | — |
| Llama 3 70B Instruct (HF version) Fireworks · open | — | 8K | — | — | — | — | — | — |
| Llama 3 8B Fireworks · open | — | 8K | — | — | — | — | — | — |
| Llama 3 8B Instruct Fireworks · open | — | 8K | — | — | — | — | — | — |
| Llama 3 8B Instruct (HF version) Fireworks · open | — | 8K | — | — | — | — | — | — |
| Llama 3.1 405B Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.1 70B Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.1 8B Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.1 Nemotron 70B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.2 11B Vision Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.2 1B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.2 1B Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.2 3B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.2 3B Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.2 90B Vision Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 3.3 70B Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Llama 4 Maverick Instruct (Basic) Fireworks · open | — | 1M | — | — | — | — | — | — |
| Llama 4 Scout Instruct (Basic) Fireworks · open | — | 1M | — | — | — | — | — | — |
| Llama Guard 7B Fireworks · open | — | 4K | — | — | — | — | — | — |
| MiniMax-M2 Fireworks · open | — | 205K | $0.30 | $1.20 | $0.03 −90% | $0.41 | — | — |
| MiniMax-M2.1 Fireworks · open | — | 197K | — | — | — | — | — | — |
| MiniMax-M2.5 Fireworks · open | — | 197K | — | — | — | — | — | — |
| MiniMax M2.7 Fireworks · open | — | 197K | $0.30 | $1.20 | $0.06 −80% | $0.43 | — | — |
| Minimax M3 Fireworks · open | — | 512K | $0.30 | $1.20 | $0.06 −80% | $0.43 | — | — |
| Ministral 3 14B Instruct 2512 Fireworks · open | — | 256K | — | — | — | — | — | — |
| Ministral 3 3B Instruct 2512 Fireworks · open | — | 256K | — | — | — | — | — | — |
| Ministral 3 8B Instruct 2512 Fireworks · open | — | 256K | — | — | — | — | — | — |
| MiroThinker-1.7 Fireworks · open | — | 262K | — | — | — | — | — | — |
| Mistral 7B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mistral 7B Instruct v0.2 Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mistral 7B Instruct v0.3 Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mistral 7B v0.2 Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mistral Large 3 675B Instruct 2512 Fireworks · open | — | 256K | — | — | — | — | — | — |
| Mistral Nemo Base 2407 Fireworks · open | — | 128K | — | — | — | — | — | — |
| Mistral Nemo Instruct 2407 Fireworks · open | — | 128K | — | — | — | — | — | — |
| Mistral Small 24B Instruct 2501 Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mixtral Moe 8x22B Fireworks · open | — | 66K | — | — | — | — | — | — |
| Mixtral MoE 8x22B Instruct Fireworks · open | — | 66K | — | — | — | — | — | — |
| Mixtral 8x7B v0.1 Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mixtral MoE 8x7B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mixtral MoE 8x7B Instruct (HF version) Fireworks · open | — | 33K | — | — | — | — | — | — |
| Molmo2-4B Fireworks · open | — | 37K | — | — | — | — | — | — |
| Molmo2-8B Fireworks · open | — | 37K | — | — | — | — | — | — |
| MythoMax L2 13B Fireworks · open | — | 4K | $0.09 | $0.09 | $0.09 | $0.09 | — | — |
| NVIDIA Nemotron 3 Super 120B A12B BF16 Fireworks · open | — | 256K | $0.30 | $0.90 | $0.30 | $0.45 | — | — |
| NVIDIA Nemotron 3 Ultra BF16 Fireworks · open | — | 262K | — | — | — | — | — | — |
| NVIDIA Nemotron 3 Ultra NVFP4 Fireworks · open | — | 262K | — | — | — | — | — | — |
| NVIDIA Nemotron Nano 2 VL Fireworks · open | — | 131K | — | — | — | — | — | — |
| Nous Capybara 7B V1.9 Fireworks · open | — | 33K | — | — | — | — | — | — |
| Nouse Hermes 2 Mixtral 8x7B DPO Fireworks · open | — | 33K | — | — | — | — | — | — |
| Nous Hermes Llama2 13B Fireworks · open | — | 4K | — | — | — | — | — | — |
| Nous Hermes Llama2 70B Fireworks · open | — | 4K | — | — | — | — | — | — |
| Nous Hermes Llama2 7B Fireworks · open | — | 4K | — | — | — | — | — | — |
| NVIDIA Nemotron 3 Nano Omni 30B A3B Fireworks · open | — | 262K | — | — | — | — | — | — |
| NVIDIA Nemotron 3 Super 120B A12B FP8 Fireworks · open | — | 256K | $0.30 | $0.90 | $0.30 | $0.45 | — | — |
| NVIDIA Nemotron 3 Super 120B A12B NVFP4 Fireworks · open | — | 262K | — | — | — | — | — | — |
| NVIDIA Nemotron Nano 12B v2 Fireworks · open | — | 128K | — | — | — | — | — | — |
| NVIDIA Nemotron Nano 9B v2 Fireworks · open | — | 128K | — | — | — | — | — | — |
| OpenChat 3.5 0106 Fireworks · open | — | 8K | — | — | — | — | — | — |
| OpenHermes 2 Mistral 7B Fireworks · open | — | 33K | — | — | — | — | — | — |
| OpenHermes 2.5 Mistral 7B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Mistral 7B OpenOrca Fireworks · open | — | 33K | — | — | — | — | — | — |
| Phi-3 Mini 128k Instruct Fireworks · open | — | 131K | — | — | — | — | — | — |
| Phi-3.5 Vision Instruct Fireworks · open | — | 32K | — | — | — | — | — | — |
| Phind CodeLlama 34B Python v1 Fireworks · open | — | 16K | — | — | — | — | — | — |
| Phind CodeLlama 34B v1 Fireworks · open | — | 16K | — | — | — | — | — | — |
| Phind CodeLlama 34B v2 Fireworks · open | — | 16K | — | — | — | — | — | — |
| Pythia 12B Fireworks · open | — | 2K | — | — | — | — | — | — |
| Qwen QWQ 32B Preview Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 14B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 7B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Qwen1.5 72B Chat Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2 72B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2 7B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2-VL 2B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2-VL 72B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2-VL 7B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 0.5B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 14B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Qwen2.5 14B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 1.5B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 32B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Qwen2.5 32B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 72B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Qwen2.5 72B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5 7B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Qwen2.5 7B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 0.5B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 0.5B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 14B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 14B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 1.5B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 1.5B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 32B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 32B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 32B Instruct 128K Fireworks · open | — | 131K | — | — | — | — | — | — |
| Qwen2.5-Coder 32B Instruct 32K RoPE Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 32B Instruct 64k Fireworks · open | — | 66K | — | — | — | — | — | — |
| Qwen2.5-Coder 3B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 3B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 7B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Coder 7B Instruct Fireworks · open | — | 33K | — | — | — | — | — | — |
| Qwen2.5-Math 72B Instruct Fireworks · open | — | 4K | — | — | — | — | — | — |
| Qwen2.5-VL 32B Instruct Fireworks · open | — | 128K | — | — | — | — | — | — |
| Qwen2.5-VL 3B Instruct Fireworks · open | — | 128K | — | — | — | — | — | — |
| Qwen2.5-VL 72B Instruct Fireworks · open | — | 128K | — | — | — | — | — | — |
| Qwen2.5-VL 7B Instruct Fireworks · open | — | 128K | — | — | — | — | — | — |
| Qwen3 0.6B Fireworks · open | — | 41K | — | — | — | — | — | — |
| Qwen3 14B Fireworks · open | — | 131K | $0.35 | $1.40 | $0.35 | $0.61 | — | — |
| Qwen3 1.7B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Qwen3 235B A22B Fireworks · open | — | 131K | $0.70 | $2.80 | $0.70 | $1.22 | — | — |
| Qwen3 235B A22B Instruct 2507 Fireworks · open | — | 131K | $0.09 | $0.58 | $0.09 | $0.21 | — | — |
| Qwen3 235B A22B Thinking 2507 Fireworks · open | — | 131K | $0.30 | $3.00 | $0.30 | $0.97 | — | — |
| Qwen3 30B-A3B Fireworks · open | — | 41K | $0.09 | $0.45 | $0.09 | $0.18 | — | — |
| Qwen3 30B A3B Instruct 2507 Fireworks · open | — | 128K | $0.10 | $0.30 | $0.01 −90% | $0.11 | — | — |
| Qwen3 30B A3B Thinking 2507 Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3 32B Fireworks · open | — | 131K | $0.70 | $2.80 | $0.70 | $1.22 | — | — |
| Qwen3 4B Fireworks · open | — | 128K | $0.03 | $0.03 | $0.03 | $0.03 | — | — |
| Qwen 3 4B Instruct 2507 Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3 8B Fireworks · open | — | 131K | $0.18 | $0.70 | $0.18 | $0.31 | — | — |
| Qwen3 Coder 30B A3B Instruct Fireworks · open | — | 262K | $0.45 | $2.25 | $0.45 | $0.90 | — | — |
| Qwen3 Coder 480B A35B Instruct Fireworks · open | — | 262K | $1.50 | $7.50 | $1.50 | $3.00 | — | — |
| Qwen3 Coder 480B Instruct BF16 Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3 Omni 30B A3B Instruct Fireworks · open | — | 66K | $0.25 | $0.97 | $0.25 | $0.43 | — | — |
| Qwen3 VL 235B A22B Instruct Fireworks · open | — | 131K | $0.30 | $1.50 | $0.30 | $0.60 | — | — |
| Qwen3 VL 235B A22B Thinking Fireworks · open | — | 131K | $0.98 | $3.95 | $0.98 | $1.72 | — | — |
| Qwen3 VL 30B A3B Instruct Fireworks · open | — | 131K | $0.20 | $0.70 | $0.20 | $0.33 | — | — |
| Qwen3 VL 30B A3B Thinking Fireworks · open | — | 131K | $0.20 | $1.00 | $0.20 | $0.40 | — | — |
| Qwen3-VL-8B-Instruct Fireworks · open | — | 131K | $0.08 | $0.50 | $0.08 | $0.18 | — | — |
| Qwen 3.5 122B A10B Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3.5 27B Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen 3.5 35B A3B Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3.5 397B A17B Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3.5 9B Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3.6 27B Fireworks · open | — | 262K | — | — | — | — | — | — |
| Qwen3.6-35B-A3B Fireworks · open | — | 262K | — | — | — | — | — | — |
| QWQ 32B Fireworks · open | — | 131K | — | — | — | — | — | — |
| Rolm OCR Fireworks · open | — | 128K | — | — | — | — | — | — |
| Seed OSS 36B Instruct Fireworks · open | — | 524K | — | — | — | — | — | — |
| Snorkel Mistral PairRM DPO Fireworks · open | — | 33K | — | — | — | — | — | — |
| Step-3.7-Flash-NVFP4 Fireworks · open | — | 262K | — | — | — | — | — | — |
| Toppy M 7B Fireworks · open | — | 33K | — | — | — | — | — | — |
| Zephyr 7B Beta Fireworks · open | — | 33K | — | — | — | — | — | — |
| Antigravity Agent Preview Google | — | 131K | — | — | — | — | — | — |
| Deep Research Max Preview (Apr-21-2026) Google | — | 131K | — | — | — | — | — | — |
| Deep Research Preview (Apr-21-2026) Google | — | 131K | — | — | — | — | — | — |
| Deep Research Pro Preview (Dec-12-2025) Google | — | 131K | — | — | — | — | — | — |
| Gemini 2.0 Flash Google | — | 1M | $0.10 | $0.40 | $0.03 −75% | $0.14 | — | — |
| Gemini 2.0 Flash 001 Google | — | 1M | $0.10 | $0.40 | $0.03 −75% | $0.14 | — | — |
| Gemini 2.0 Flash-Lite Google | — | 1M | $0.07 | $0.30 | $0.07 | $0.13 | — | — |
| Gemini 2.0 Flash-Lite 001 Google | — | 1M | $0.07 | $0.30 | $0.07 | $0.13 | — | — |
| Gemini 2.5 Computer Use Preview 10-2025 Google | — | 131K | — | — | — | — | — | — |
| Gemini 2.5 Flash Google | — | 1M | $0.30 | $2.50 | $0.07 −75% | $0.76 | — | — |
| Gemini 2.5 Flash-Lite Google | — | 1M | $0.10 | $0.40 | $0.01 −90% | $0.14 | — | — |
| Gemini 2.5 Pro Google | — | 1M | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| Gemini 3.1 Flash Lite Google | — | 1M | $0.25 | $1.50 | $0.03 −90% | $0.47 | — | — |
| Gemini 3.1 Flash Lite Preview Google | — | 1M | $0.25 | $1.50 | $0.03 −90% | $0.47 | — | — |
| Gemini 3 Flash Preview Google | — | 1M | $0.50 | $3.00 | $0.05 −90% | $0.94 | — | — |
| Gemini 3 Pro Preview Google | — | 1M | $2.00 | $12.00 | $0.20 −90% | $3.76 | — | — |
| Gemini Flash Latest Google | — | 1M | $0.30 | $2.50 | $0.07 −75% | $0.76 | — | — |
| Gemini Flash-Lite Latest Google | — | 1M | $0.10 | $0.40 | $0.03 −75% | $0.14 | — | — |
| Gemini Pro Latest Google | — | 1M | — | — | — | — | — | — |
| Gemma 4 26B A4B IT Google · open | — | 262K | — | — | — | — | — | — |
| Gemma 4 31B IT Google · open | — | 262K | — | — | — | — | — | — |
| Nano Banana Pro Google | — | 131K | — | — | — | — | — | — |
| GPT-3.5-turbo OpenAI | — | 16K | $0.50 | $1.50 | $0.00 −100% | $0.54 | — | — |
| GPT-3.5-turbo-0125 OpenAI | — | 16K | $0.50 | $1.50 | $0.00 −100% | $0.54 | — | — |
| GPT-3.5-turbo-1106 OpenAI | — | 16K | $0.50 | $1.50 | $0.00 −100% | $0.54 | — | — |
| GPT-3.5-turbo-16k OpenAI | — | — | — | — | — | — | — | — |
| GPT-3.5-turbo-instruct OpenAI | — | — | — | — | — | — | — | — |
| GPT-3.5-turbo-instruct-0914 OpenAI | — | — | — | — | — | — | — | — |
| GPT-4 OpenAI | — | 8K | $30.00 | $60.00 | $30.00 | $37.50 | — | — |
| GPT-4-0613 OpenAI | — | 8K | $30.00 | $60.00 | $30.00 | $37.50 | — | — |
| GPT-4.1 OpenAI | — | 1M | $2.00 | $8.00 | $0.50 −75% | $2.88 | — | — |
| GPT-4.1-2025-04-14 OpenAI | — | 1M | $2.00 | $8.00 | $0.50 −75% | $2.88 | — | — |
| GPT-4.1 Mini OpenAI | — | 1M | $0.40 | $1.60 | $0.10 −75% | $0.58 | — | — |
| GPT-4.1 Mini-2025-04-14 OpenAI | — | 1M | $0.40 | $1.60 | $0.10 −75% | $0.58 | — | — |
| GPT-4.1 Nano OpenAI | — | 1M | $0.10 | $0.40 | $0.03 −75% | $0.14 | — | — |
| GPT-4.1 Nano-2025-04-14 OpenAI | — | 1M | $0.10 | $0.40 | $0.03 −75% | $0.14 | — | — |
| GPT-4-turbo OpenAI | — | 128K | $10.00 | $30.00 | $10.00 | $15.00 | — | — |
| GPT-4-turbo-2024-04-09 OpenAI | — | 128K | $10.00 | $30.00 | $10.00 | $15.00 | — | — |
| GPT-4o OpenAI | — | 128K | $2.50 | $10.00 | $1.25 −50% | $3.86 | — | — |
| GPT-4o-2024-05-13 OpenAI | — | 128K | $5.00 | $15.00 | $5.00 | $7.50 | — | — |
| GPT-4o-2024-08-06 OpenAI | — | 128K | $2.50 | $10.00 | $1.25 −50% | $3.86 | — | — |
| GPT-4o-2024-11-20 OpenAI | — | 128K | $2.50 | $10.00 | $1.25 −50% | $3.86 | — | — |
| GPT-4o Mini OpenAI | — | 128K | $0.15 | $0.60 | $0.07 −50% | $0.23 | — | — |
| GPT-4o Mini-2024-07-18 OpenAI | — | 128K | $0.15 | $0.60 | $0.07 −50% | $0.23 | — | — |
| GPT-5 OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5.1 OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5.1-2025-11-13 OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5.1 Chat OpenAI | — | 128K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5.1 Codex OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5.1 Codex Max OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5.2 OpenAI | — | 400K | $1.75 | $14.00 | $0.17 −90% | $4.16 | — | — |
| GPT-5.2-2025-12-11 OpenAI | — | 400K | $1.75 | $14.00 | $0.17 −90% | $4.16 | — | — |
| GPT-5.2 Chat OpenAI | — | 128K | $1.75 | $14.00 | $0.17 −90% | $4.16 | — | — |
| GPT-5.2 Codex OpenAI | — | 400K | $1.75 | $14.00 | $0.17 −90% | $4.16 | — | — |
| GPT-5.2 Pro OpenAI | — | 400K | $21.00 | $168.00 | $21.00 | $57.75 | — | — |
| GPT-5.2 Pro-2025-12-11 OpenAI | — | 400K | $21.00 | $168.00 | $21.00 | $57.75 | — | — |
| GPT-5-2025-08-07 OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5.3 Chat OpenAI | — | 128K | $1.75 | $14.00 | $0.17 −90% | $4.16 | — | — |
| GPT-5.3 Codex OpenAI | — | 400K | $1.75 | $14.00 | $0.17 −90% | $4.16 | — | — |
| GPT-5.4 OpenAI | — | 1.1M | $2.50 | $15.00 | $0.25 −90% | $4.70 | — | — |
| GPT-5.4-2026-03-05 OpenAI | — | 1.1M | $2.50 | $15.00 | $0.25 −90% | $4.70 | — | — |
| GPT-5.4 Mini OpenAI | — | 400K | $0.75 | $4.50 | $0.07 −90% | $1.41 | — | — |
| GPT-5.4 Mini-2026-03-17 OpenAI | — | 400K | $0.75 | $4.50 | $0.07 −90% | $1.41 | — | — |
| GPT-5.4 Nano OpenAI | — | 400K | $0.20 | $1.25 | $0.02 −90% | $0.39 | — | — |
| GPT-5.4 Nano-2026-03-17 OpenAI | — | 400K | $0.20 | $1.25 | $0.02 −90% | $0.39 | — | — |
| GPT-5.4 Pro OpenAI | — | 1.1M | $30.00 | $180.00 | $30.00 | $67.50 | — | — |
| GPT-5.4 Pro-2026-03-05 OpenAI | — | 1.1M | $30.00 | $180.00 | $30.00 | $67.50 | — | — |
| GPT-5.5-2026-04-23 OpenAI | — | 1.1M | $5.00 | $30.00 | $0.50 −90% | $9.39 | — | — |
| GPT-5.5 Pro-2026-04-23 OpenAI | — | 1.1M | $30.00 | $180.00 | $30.00 | $67.50 | — | — |
| GPT-5 Chat OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5 Codex OpenAI | — | 400K | $1.25 | $10.00 | $0.13 −90% | $2.97 | — | — |
| GPT-5 Mini-2025-08-07 OpenAI | — | 400K | $0.25 | $2.00 | $0.03 −90% | $0.59 | — | — |
| GPT-5 Nano OpenAI | — | 400K | $0.05 | $0.40 | $0.01 −90% | $0.12 | — | — |
| GPT-5 Nano-2025-08-07 OpenAI | — | 400K | $0.05 | $0.40 | $0.01 −90% | $0.12 | — | — |
| GPT-5 Pro OpenAI | — | 400K | $15.00 | $120.00 | $15.00 | $41.25 | — | — |
| GPT-5 Pro-2025-10-06 OpenAI | — | 400K | $15.00 | $120.00 | $15.00 | $41.25 | — | — |
| o1 OpenAI | — | 200K | $15.00 | $60.00 | $7.50 −50% | $23.16 | — | — |
| o1-2024-12-17 OpenAI | — | 200K | $15.00 | $60.00 | $7.50 −50% | $23.16 | — | — |
| o1 Pro OpenAI | — | 200K | $150.00 | $600.00 | $150.00 | $262.50 | — | — |
| o1 Pro-2025-03-19 OpenAI | — | 200K | $150.00 | $600.00 | $150.00 | $262.50 | — | — |
| o3 OpenAI | — | 200K | $2.00 | $8.00 | $0.50 −75% | $2.88 | — | — |
| o3-2025-04-16 OpenAI | — | 200K | $2.00 | $8.00 | $0.50 −75% | $2.88 | — | — |
| o3 Mini OpenAI | — | 200K | $1.10 | $4.40 | $0.55 −50% | $1.70 | — | — |
| o3 Mini-2025-01-31 OpenAI | — | 200K | $1.10 | $4.40 | $0.55 −50% | $1.70 | — | — |
| o4 Mini OpenAI | — | 200K | $1.10 | $4.40 | $0.28 −75% | $1.58 | — | — |
| o4 Mini-2025-04-16 OpenAI | — | 200K | $1.10 | $4.40 | $0.28 −75% | $1.58 | — | — |
| o4 Mini-deep-research OpenAI | — | 200K | $2.00 | $8.00 | $0.50 −75% | $2.88 | — | — |
| Grok 4.20 (Non-Reasoning) xAI | — | 1M | $1.25 | $2.50 | $0.20 −84% | $1.13 | — | — |
| Grok 4.20 (Reasoning) xAI | — | 1M | $1.25 | $2.50 | $0.20 −84% | $1.13 | — | — |
| Grok 4.20 Multi-Agent xAI | — | 1M | $1.25 | $2.50 | $0.20 −84% | $1.13 | — | — |
| Grok Build 0.1 xAI | — | 256K | $1.00 | $2.00 | $0.20 −80% | $0.92 | — | — |
Showing 343 of 343 models across OpenAI, Anthropic, Google, xAI, and Fireworks. A “—” means no published per-token list price, or no Zumik measurement for that model yet. Prices are USD per 1M tokens; reuse-adjusted blends input at the assumed reuse against the cache-read rate, plus a 25% output share.
The reuse-adjusted column blends input price against the cache-read rate at your assumed reuse, plus a 25% output share. Capture rates come from the prompt-cache capture benchmark. Read how the providers differ on prompt caching.
Route to the right model by intent.
Use aliases like code.fast and auto.cheapest, and let Zumik resolve to the best model under policy.