How each provider really caches

The provider you route to decides how much of your reuse opportunity becomes a billed saving. These pages break down caching mechanics, batch tiers, BYOK support, and the proprietary capture numbers we measure on each.

Caching mechanics, side by side.

ProviderCache typeRead discountWrite premiumBatchBYOK
OpenAIautomatic75%none50%Yes
Anthropicexplicit90%+25%50%Yes
Google Geminiimplicit75%none50%Yes
xAIcontext75%none-Yes
Fireworks AIautomatic74%none40%Yes
Go deeper

Caching is where the savings live or die.

The prompt-caching guides explain exactly how to capture each provider's discount - and the mistakes that quietly throw it away.

Use any provider. Capture every discount.

Bring your own keys or use managed accounts, and let Zumik capture provider-native caching and batch tiers.