Tag
prompt-caching
-
№ 019
AI & Development
8 min read →
56x Leverage: What Claude Code’s Max Plan Actually Costs at Today’s API Rates
We've spent the past several weeks dissecting Claude Code's cache management bugs, building fixes, and documenting hidden costs. There's still one question we owe a clean answer to: what does
-
№ 019
AI & Development
7 min read →
Images That Won’t Die and Directories That Shouldn’t Be There
Opus 4.7 update (April 16, 2026): This post is more urgent than when we wrote it. Opus 4.7 introduces high-resolution image support (2576px, up from 1568px) and a new tokenizer
-
№ 019
AI & Development
10 min read →
The Undocumented TTL Downgrade: How Exceeding Your Quota Makes Everything Worse
Two Ways to Lose Your Cache: The TTL Mechanisms Nobody Told You About In Part 3, we described the quota monitoring we built into our fetch interceptor — reading anthropic-ratelimit-unified-5h-utilization
-
№ 019
AI & Development
25 min read →
The Three-Layer Gate: What Actually Happens When You Cross Your Claude Code Quota
The Three-Layer Gate: What Actually Happens When You Cross Your Claude Code Quota VSITS LLC — April 2026 This post is a follow-up to Friday's "The 5-Minute Baseline", which documented
-
№ 019
AI & Development
17 min read →
The 5-Minute Baseline: What We Found in Claude Code’s Tools Array
Update (2026-04-13): Our original recommendation to pin Claude Code to v2.1.81 is now outdated. Anthropic has partially fixed the resume-scatter bug in v2.1.90 and further improved cache stability in v2.1.104.
-
№ 019
AI & Development
8 min read →
Building a Fetch Interceptor to Fix Claude Code’s Cache — Without Touching Claude Code
In Part 2, we mapped out three bugs that break Claude Code's prompt caching: resume block scatter, fingerprint instability, and non-deterministic tool ordering. Each one independently busts the cache. Together,
-
№ 019
AI & Development
7 min read →
How Claude Code’s Prompt Cache Actually Works — And Three Ways It Breaks
In Part 1, we showed that Claude Code sessions were burning through quota at 26-28% per turn — because the prompt cache was being rebuilt from scratch on every API
-
№ 019
AI & Development
8 min read →
We Taught Our AI Agents to Take Coffee Breaks (And It Saved Us Real Money)
We Taught Our AI Agents to Take Coffee Breaks (And It Saved Us Real Money) Here's something nobody tells you about Claude Code: when you step away from your keyboard,
-
№ 019
AI & Development
8 min read →
We Burned Through 100% of Our Claude Code Quota in Two Hours. Here’s What We Found.
Claude Code is, in our experience, the best AI coding tool available today. Anthropic's models are genuinely excellent — the reasoning is sharp, the context window is enormous, and when