Tag

prompt-caching

№ 019 AI & Development

56x Leverage: What Claude Code’s Max Plan Actually Costs at Today’s API Rates

We've spent the past several weeks dissecting Claude Code's cache management bugs, building fixes, and documenting hidden costs. There's still one question we owe a clean answer to: what does

29 Apr 2026

1,592 words

8 min read →
№ 019 AI & Development

Images That Won’t Die and Directories That Shouldn’t Be There

Opus 4.7 update (April 16, 2026): This post is more urgent than when we wrote it. Opus 4.7 introduces high-resolution image support (2576px, up from 1568px) and a new tokenizer

16 Apr 2026

1,233 words

7 min read →
№ 019 AI & Development

The Undocumented TTL Downgrade: How Exceeding Your Quota Makes Everything Worse

Two Ways to Lose Your Cache: The TTL Mechanisms Nobody Told You About In Part 3, we described the quota monitoring we built into our fetch interceptor — reading anthropic-ratelimit-unified-5h-utilization

14 Apr 2026

1,847 words

10 min read →
№ 019 AI & Development

The Three-Layer Gate: What Actually Happens When You Cross Your Claude Code Quota

The Three-Layer Gate: What Actually Happens When You Cross Your Claude Code Quota VSITS LLC — April 2026 This post is a follow-up to Friday's "The 5-Minute Baseline", which documented

13 Apr 2026

4,943 words

25 min read →
№ 019 AI & Development

The 5-Minute Baseline: What We Found in Claude Code’s Tools Array

Update (2026-04-13): Our original recommendation to pin Claude Code to v2.1.81 is now outdated. Anthropic has partially fixed the resume-scatter bug in v2.1.90 and further improved cache stability in v2.1.104.

11 Apr 2026

3,361 words

17 min read →
№ 019 AI & Development

Building a Fetch Interceptor to Fix Claude Code’s Cache — Without Touching Claude Code

In Part 2, we mapped out three bugs that break Claude Code's prompt caching: resume block scatter, fingerprint instability, and non-deterministic tool ordering. Each one independently busts the cache. Together,

11 Apr 2026

1,473 words

8 min read →
№ 019 AI & Development

How Claude Code’s Prompt Cache Actually Works — And Three Ways It Breaks

In Part 1, we showed that Claude Code sessions were burning through quota at 26-28% per turn — because the prompt cache was being rebuilt from scratch on every API

9 Apr 2026

1,383 words

7 min read →
№ 019 AI & Development

We Taught Our AI Agents to Take Coffee Breaks (And It Saved Us Real Money)

We Taught Our AI Agents to Take Coffee Breaks (And It Saved Us Real Money) Here's something nobody tells you about Claude Code: when you step away from your keyboard,

7 Apr 2026

1,409 words

8 min read →
№ 019 AI & Development

We Burned Through 100% of Our Claude Code Quota in Two Hours. Here’s What We Found.

Claude Code is, in our experience, the best AI coding tool available today. Anthropic's models are genuinely excellent — the reasoning is sharp, the context window is enormous, and when

7 Apr 2026

1,435 words

8 min read →

prompt-caching

56x Leverage: What Claude Code’s Max Plan Actually Costs at Today’s API Rates

Images That Won’t Die and Directories That Shouldn’t Be There

The Undocumented TTL Downgrade: How Exceeding Your Quota Makes Everything Worse

The Three-Layer Gate: What Actually Happens When You Cross Your Claude Code Quota

The 5-Minute Baseline: What We Found in Claude Code’s Tools Array

Building a Fetch Interceptor to Fix Claude Code’s Cache — Without Touching Claude Code

How Claude Code’s Prompt Cache Actually Works — And Three Ways It Breaks

We Taught Our AI Agents to Take Coffee Breaks (And It Saved Us Real Money)

We Burned Through 100% of Our Claude Code Quota in Two Hours. Here’s What We Found.