Show HN: Claude Code's $200 plan is a 17× subsidy on the raw API

wpnews.pro

cd /news/large-language-models/show-hn-claude-code-s-200-plan-is-a-… · home › topics › large-language-models › article

[ARTICLE · art-15578] src=github.com ↗ pub=2026-05-27T17:25Z topic=large-language-models verified=true sentiment=· neutral

Show HN: Claude Code's $200 plan is a 17× subsidy on the raw API

A developer reverse-engineered their own Claude Code logs to reveal that 64% of the $3,371 monthly bill at Opus 4.7 list rates goes to re-reading the same context every turn, not generating new content. Hidden reasoning tokens account for 84% of output costs and 60% of all re-read data, with caching already serving 98% of input yet failing to eliminate the re-read overhead. The analysis shows Claude Code's $200 plan represents a 17× subsidy on raw API costs, as the same workload without caching would cost approximately $22,630.

read3 min views10 publishedMay 27, 2026

Reverse-engineer a month of your own local Claude Code logs (~/.claude/projects/*/*.jsonl

) into where the tokens, time, and cost actually go — and run it on yours. Reads only local logs; nothing is sent anywhere.

What it found(one month of my own logs — 181 sessions, 25,564 model calls):

You don't pay to generate, you pay to re-read.~29M unique tokens →4.35B billed (~150×), because every turn re-sends the whole ~173K-token context.- The bill is 84% input / 16% output— and re-reading the same context is64%of it.- The biggest line is the one you never see: hidden reasoningis 84% of outputand~60% of everything re-read.~$3,371for the month at Opus 4.7 list rates. Caching already serves 98% of input — and re-reading isstill64% of the bill.Full write-up (all the tables, the why, the main-thread-vs-subagent split) →

[coralbricks.ai/blog/claude-code-token-xray]

pip install -r requirements.txt   # just tiktoken
python3 token_time_breakdown.py
python3 cost.py
python3 main_vs_sidecar.py
python3 reread_breakdown.py

tiktoken is OpenAI's tokenizer, not Claude's, so token

proportionsare reliable to ~±15%, not Claude-exact. The billed-token counts incost.py

come straight from the APIusage

blocks and are exact.

From cost.py

on my logs, priced at Opus 4.7 list rates:

Line item	Cost	Share
Input — re-reading context (cache reads)	$2,176	64%
Input — cache writes	$682	20%
Input — fresh (uncached)	$2	0%
Output — reasoning	$429	13%
Output — tool calls + summaries	$82	2%
Total
$3,371
100%

Caching is the only thing keeping it sane — without it the same work lists at ~$22,630 (~7×). Your numbers will differ; that's the point. Run it on yours.

— the headline table: tokens (marked input/output)token_time_breakdown.py

and wall-clock time per activity (reasoning, running commands, writing tool calls, subagents, summaries, reading/searching, editing) plus the passive-context rows (system prompt + tools, attachments, the typed prompt, injected reminders). One pass, so tokens and time stay consistent. Reasoning isn't stored in plaintext (only an encrypted signature), so it's recovered by subtraction:output − tool_calls − summaries

. Time is reconstructed from event timestamps.— billed token totals (cache reads / cache writes by TTL / fresh input / output) priced at Opus 4.7 list rates, plus the no-caching counterfactual.cost.py

— splits the human-driven main thread from spawned subagents (logged under nestedmain_vs_sidecar.py

*/subagents/*.jsonl

); reports billed tokens, per-model mix, cache-hit rate, turns per agent (per session for the main thread, per subagent for the sidecar), and cost for each, plus the combined total.— per-activityreread_breakdown.py

cumulativeinput: replays each session's context growth to show what each kind of context costs once it's re-read every turn. Reportsunique

vsre-read

tokens per activity (reasoning is the biggest re-read line). The replay is scaled to the measured billed input (exact); the per-activity split is a model.

One person's month on one machine — directional, not a benchmark. Claude Code is dynamic, so your split will differ. That's the point: run it on yours.
A generation-time gap also includes the model reading its context before it writes; Bash time is real execution (commands auto-approved), but code run in the background or a separate terminal isn't counted.
The system-prompt row is estimated from each session's first cache write.

If this helped you see where your Claude Code tokens, time, and cost actually go, please ⭐ the repo — it helps others find it. Curious what your re-read share comes out to.

Apache 2.0 — see the repository LICENSE.

source & further reading

github.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/show-hn-claude-code-s-20…

Read original on github.com → github.com/Coral-Bricks-AI/coral-ai/tree/main/cl…

mentioned entities

Claude Code

Opus 4.7

Anthropic

OpenAI

tiktoken

coralbricks.ai

metadata

slugshow-hn-claude-code-s-200-plan-is-a-17x-subsidy-on-the-raw-api

topic#large-language-models

secondary4 topics

sentimentneutral

canonicalgithub.com

navigation

← prevRobinhood opens its platform for…

next →Context Window Packing – Agent P…

── more in #large-language-models 4 stories · sorted by recency

twitter.com · 11 Jul · #large-language-models

GPT 5.6 Ultra better in Claude Code than in Codex?

machinebrief.com · 11 Jul · #large-language-models

The Great AI Shift: Why Open-Source Models Are Taking Over

machinebrief.com · 11 Jul · #large-language-models

AI Model Race Intensifies: A Rapid Sprint for Tech Giants

startupfortune.com · 11 Jul · #large-language-models

Elon Musk Orders Tesla Staff Off Claude Despite Admitting Grok Is Worse

── more on @claude code 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

SpaceXAI unveils Grok 4.5 AI model ahead of July 2026 public release

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required