Median realized reuse ratio

The median share of input tokens served from cache once provider caching is configured through Zumik.

Updated 2026-06-09Rolling 6 months, agent workloads only
+14 % of input tokens over window
39
Jan
42
Feb
46
Mar
48
Apr
51
May
53
Jun

Measured in % of input tokens.

What this trend means.

Realized reuse keeps climbing as prompt ordering improves. The ceiling is opportunity; the gap to it is usually prompt construction, not infrastructure.

Tie-in

Trends describe the population; a diagnostic measures you. The two together tell you whether you are ahead of or behind the curve on reuse.

Measure your reuse against the trend.

Run a diagnostic and see where your workload sits relative to these aggregate patterns.