Lead investigation · 23 Apr 2026

The Invisible Tax: Hidden Quota Burns and Phantom Rate Limits in Claude Code

Update (April 24, 2026): Anthropic published a post-mortem on April 23 acknowledging three issues: a reasoning effort default change (March 4–April 7), a caching bug that continuously cleared thinking on

Chris Nighswonger Editor · VSITS · 23 Apr 2026 · 8 min read · Co-authored with 2 AI agents

Read the investigation → All 18 posts

8.4M

Tool calls audited

Bugs filed

10–20×

Quota inflation

412M

Tokens reclaimed

Mar 4 – Apr 7 Window of the silent reasoning-effort regression

8 / 11 Confirmed by Anthropic post-mortem (24 Apr)

2,576 px Opus 4.7 image limit — silently doubling token spend

Live readings · meter.veritassuperaitsolutions.com · refreshed nightly last write · — · awaiting cron

cache.hit_rate

99.2% +14pt

8.4M calls · trailing 60d

tokens.reclaimed

412M $1,840

by interceptor since 04/07

bugs.filed

11 8 patched

three confirmed by post-mortem

agents.online

14 / 14 p50 218ms

production fleet, all green

investigations.open

3 2 closing

context-decay · image-tax · cache-clear

tools.published

4 ★ 2.1k

cc-interceptor · cc-coffee · token-tax

“There’s a difference between AI helping you build the thing that expresses your judgment, and AI replacing the judgment itself.”

— from the studio notes, VSITS †

Recent dispatches

The full archive · 18 →

№ 018 General 5 min read

Claude Code v2.1.117: What Changed, What the Data Shows, and Should You Upgrade

Previously in this series: How to Get a 99% Cache Hit Rate and Keeping Your Cache Warm with /coffee. Three things shipped in v2.1.117 that matter. One is a bug

22 Apr 2026

№ 017 AI & Development 8 min read

The Bugs Go Deeper: Silent Context Degradation in Claude Code

In our cache investigation series, we documented how prompt cache bugs were causing 10-20x cost inflation on Max plans. We built a fetch interceptor to fix them. We thought we’d

21 Apr 2026

№ 016 AI & Development 8 min read

What Debugging Claude Code’s Cache Taught Us About AI Tooling Economics

Update (April 17, 2026): This post was drafted April 7. In the 10 days since, the investigation expanded significantly: The interceptor now fixes 8+ cache bugs (not just the original

18 Apr 2026

№ 015 AI & Development 7 min read

Images That Won’t Die and Directories That Shouldn’t Be There

Opus 4.7 update (April 16, 2026): This post is more urgent than when we wrote it. Opus 4.7 introduces high-resolution image support (2576px, up from 1568px) and a new tokenizer

16 Apr 2026

Engagements

We do the forensic work the platform won't, on the systems your team is already running.

Three ways to engage — from a one-week tooling audit to embedded engineering for teams running agents in production.

01

Tooling audit

1 week · fixed We instrument, measure, and hand back a written report with prioritized fixes.

→
02

Interceptor deployment

2–4 weeks · scoped Your fleet, your cache, our patches. Quota burns down within days.

→
03

Embedded retainer

monthly · retainer A senior engineer in your stack. Long memory of every bug we've seen.

→