Lead investigation · 23 Apr 2026
The Invisible Tax: Hidden Quota Burns and Phantom Rate Limits in Claude Code
Update (April 24, 2026): Anthropic published a post-mortem on April 23 acknowledging three issues: a reasoning effort default change (March 4–April 7), a caching bug that continuously cleared thinking on
CN
Chris Nighswonger
Editor · VSITS · 23 Apr 2026 · 8 min read · Co-authored with 2 AI agents
cache.hit_rate
99.2%
+14pt
8.4M calls · trailing 60d
tokens.reclaimed
412M
$1,840
by interceptor since 04/07
bugs.filed
11
8 patched
three confirmed by post-mortem
agents.online
14 / 14
p50 218ms
production fleet, all green
investigations.open
3
2 closing
context-decay · image-tax · cache-clear
tools.published
4
★ 2.1k
cc-interceptor · cc-coffee · token-tax
“There’s a difference between AI helping you build the thing that expresses your judgment, and AI replacing the judgment itself.”
— from the studio notes, VSITS
†
Previously in this series: How to Get a 99% Cache Hit Rate and Keeping Your Cache Warm with /coffee. Three things shipped in v2.1.117 that matter. One is a bug
22 Apr 2026
In our cache investigation series, we documented how prompt cache bugs were causing 10-20x cost inflation on Max plans. We built a fetch interceptor to fix them. We thought we’d
21 Apr 2026
Update (April 17, 2026): This post was drafted April 7. In the 10 days since, the investigation expanded significantly: The interceptor now fixes 8+ cache bugs (not just the original
18 Apr 2026
Opus 4.7 update (April 16, 2026): This post is more urgent than when we wrote it. Opus 4.7 introduces high-resolution image support (2576px, up from 1568px) and a new tokenizer
16 Apr 2026
Engagements
We do the forensic work the platform won't, on the systems your team is already running.
Three ways to engage — from a one-week tooling audit to embedded engineering for teams running agents in production.
-
01
Tooling audit
1 week · fixed
We instrument, measure, and hand back a written report with prioritized fixes.
→
-
02
Interceptor deployment
2–4 weeks · scoped
Your fleet, your cache, our patches. Quota burns down within days.
→
-
03
Embedded retainer
monthly · retainer
A senior engineer in your stack. Long memory of every bug we've seen.
→
The Dispatch · weekly · powered by Buttondown
One careful piece of writing on AI tooling, in your inbox each Friday.