Leaderboard

Best intelligence per dollar

Quality index divided by reuse-adjusted blended cost at 50% reuse. Rewards models that are both capable and cheap to run once caching is working.

#	Model	Provider	Quality / $	List blended	Cache disc.
1	OpenAI gpt-oss-120b	Fireworks	325.7	$0.26	−90%
2	GPT-5 Mini	OpenAI	121.0	$0.69	−90%
3	Grok 4.3	xAI	72.7	$1.56	−84%
4	Kimi K2.6	Fireworks	61.4	$1.71	−83%
5	DeepSeek-V4-Pro	Fireworks	54.5	$2.17	−92%
6	GLM 5.1	Fireworks	51.7	$2.15	−81%
7	Claude Haiku 4.5	Anthropic	45.7	$2.00	−90%
8	Gemini 3.5 Flash	Google	32.8	$3.38	−90%
9	Gemini 3.1 Pro Preview	Google	24.6	$4.50	−90%
10	Claude Sonnet 4.6	Anthropic	17.6	$6.00	−90%
11	Claude Opus 4.8	Anthropic	11.9	$10.00	−90%
12	Claude Opus 4.7	Anthropic	11.8	$10.00	−90%
13	GPT-5.5	OpenAI	10.4	$11.25	−90%
14	Claude Fable 5	Anthropic	6.0	$20.00	−90%
15	GPT-5.5 Pro	OpenAI	1.3	$67.50	−0%

Method

intelligence Ã· reuseAdjustedBlended(model, 0.5, 0.25).

Other lenses

Rank the same models differently.

Which model is cheapest once a typical agent prefix is served from cache?

Which model responds fastest when the prefix is already cached?

Which models balance code quality with reuse economics?

Which models convert reuse opportunity into billed savings most reliably?

A diagnostic measures your real reuse and re-ranks the catalog for the way you actually call models.