Compare 66 AI models by token price, performance, and utility.
Find the cheapest AI API for your needs.
Last updated: April 7, 2026 · Prices in USD per 1M tokens · Source: OpenRouter API
Utility Score = Weighted combination of 4 factors (0-100)
💡 Select a profile above (Developer/Enterprise/Indie) or customize weights. Toggle "📊 Show Scores" to see per-factor breakdown.
Prices updated: April 7, 2026 from OpenRouter API. May vary by region.
How much would each model cost based on your actual usage?
~3000 messages/month
💡 Estimate based on 70% input / 30% output ratio. Actual costs vary by use case.
CheapTokenz is an AI token price comparison platform tracking 66 models across 15 providers including OpenAI, xAI Grok, Anthropic, Google, DeepSeek and Mistral. Find the cheapest AI API for your business.
Affiliate links help us keep the lights on — you pay no extra.
The cheapest paid model is Llama 3.1 8B at $0.07/1M total. DeepSeek V3.2 at $0.64/1M offers the best performance-to-price ratio (88 MMLU). DeepSeek V3.1 at $0.90/1M (86.5 MMLU). Free: Gemma 3 27B (74 MMLU), Gemma 3 12B (70 MMLU). Note: Llama 3.3 70B is no longer free ($0.42/1M).
OpenAI o3 leads at 95 MMLU, followed by Claude Opus 4.6 (94.5), GPT-5.4 Pro and o4-mini (94). Budget top performer: DeepSeek R1-0528 (91 MMLU) at only $2.60/1M total.
GPT-5.4: $2.50/1M input, $15.00/1M output ($17.50 total). GPT-5.4 Pro: $30/$180 ($210 total). GPT-5 mini: $0.25/$2.00 ($2.25 total). New: GPT-5 nano $0.05/$0.40 ($0.45) — cheapest GPT ever. GPT-5.4 mini: $0.75/$4.50 ($5.25).
Yes, 5-27x cheaper. DeepSeek V3.2: $0.26/$0.38 ($0.64 total) vs GPT-5.4: $2.50/$15 ($17.50 total). DeepSeek V3.1: $0.15/$0.75 ($0.90 total). DeepSeek R1: $0.70/$2.50 ($3.20 total) — 91 MMLU.
Grok 4 (xAI): 93 MMLU, 256K context, $3/$15 ($18 total). Grok 4.20: $2/$6 ($8 total) for multi-agent tasks. Grok 4 Fast: 2M context, $0.20/$0.50 ($0.70 total). Grok Code Fast: $0.20/$1.50 for code gen.
Claude Opus 4.6: $5/$25 ($30) — flagship tier. Claude Sonnet 4.6: $3/$15 ($18) ≈ GPT-5.4 ($17.50). Claude Haiku 4.5: $1/$5 ($6) vs GPT-5.4 nano: $0.20/$1.25 ($1.45). New Opus 4.5/4.6 are premium but capable.
Top free: Gemma 3 27B (Google, 74 MMLU, 131K ctx), Gemma 3 12B (Google, 70 MMLU, 32K ctx). ⚠️ Llama 3.3 70B no longer free on OpenRouter — now $0.10/$0.32 ($0.42/1M).
Grok 4 Fast: 2M tokens. GPT-5.4 Pro/mini/nano, GPT-4.1, Gemini 2.5 Pro/Flash: ~1M tokens. MiniMax M1: 1M at $2.60/1M — cheapest long-context option.
Gemini 2.5 Pro: $1.25/$10 ($11.25) — 36% cheaper than GPT-5.4 ($17.50). Gemini 3 Flash: $0.50/$3 ($3.50). Gemini 3.1 Pro: $2/$12 ($14) ≈ GPT-5.2.
DeepSeek V3.2 ($0.64, 88 MMLU), Qwen3.5 397B ($2.73, 87.5 MMLU), Qwen3 Max ($4.68, 86.5 MMLU), ERNIE 4.5 ($1.38, 84 MMLU), Kimi K2 ($2.75, 81 MMLU), HunYuan Pro ($0.71, 80 MMLU).