Median realized reuse ratio
The median share of input tokens served from cache once provider caching is configured through Zumik.
+14 % of input tokens over window
39Jan
42Feb
46Mar
48Apr
51May
53Jun
Measured in % of input tokens.
What this trend means.
Realized reuse keeps climbing as prompt ordering improves. The ceiling is opportunity; the gap to it is usually prompt construction, not infrastructure.
Tie-in
Trends describe the population; a diagnostic measures you. The two together tell you whether you are ahead of or behind the curve on reuse.
Measure your reuse against the trend.
Run a diagnostic and see where your workload sits relative to these aggregate patterns.