I spent $788 on an AI coding agent in one day. Here's the breakdown.

wpnews.pro

cd /news/ai-agents/i-spent-788-on-an-ai-coding-agent-in… · home › topics › ai-agents › article

[ARTICLE · art-26013] src=dev.to ↗ pub=2026-06-13T08:18Z topic=ai-agents verified=true sentiment=· neutral

I spent $788 on an AI coding agent in one day. Here's the breakdown.

A developer spent $788 on API calls from an AI coding agent in a single day, running 3,572 calls across four models. The breakdown reveals that 73% of calls went to the most expensive model, Fable 5, costing $617, while cheaper models could have handled most tasks. The developer built an open-source list of AI gateways with a reproducible cost benchmark to help others optimize model routing.

read2 min views19 publishedJun 13, 2026

I left an AI coding agent running for one day. Then I read the invoice.

$788. In about 13 hours.

I'm posting the real breakdown because I think a lot of people are quietly running up this kind of bill without seeing where it goes — and the fix is boring and effective.

One day, 10:21–23:05. 11 sessions, 3,572 API calls across 4 models:

Model	Calls	Output tokens	Cache-read tokens	Cost
Fable 5 ($10/$50)	2,613	1.04M	448M	~$617
Opus 4.8 ($5/$25)	671	769K	248M	~$168
Haiku 4.5 ($1/$5)	242	27K	9M	~$1.70
Sonnet 4.6 ($3/$15)	46	6K	2M	~$0.90
Total
3,572	~$788

Two numbers reframed how I think about this:

That's not a 2× or 3× gap. Per call it's a ~360× difference, and I was sending almost everything to the expensive end out of pure default-laziness.

Notice 448M + 248M = ~700M cache-read tokens. Agentic coding re-sends a big context every turn; cache reads are billed at ~0.1× input, which is the only reason this was $788 and not several thousand. The flip side: anything that breaks your cache (a changed timestamp, reordered tool list, a proxy that normalizes prompts) silently re-bills at full input price. On this volume, a broken cache is a 10× event.

I didn't conclude "stop using good models." I concluded "stop sending everything to them." The pattern:

This is exactly what an AI gateway / model router does — it's the layer that lets you express "cheap by default, escalate when it's hard" once, instead of hard-coding a model everywhere. I've since taken the flagship out of the default path, and the same workload now lands in the low tens of dollars a day.

While digging into routing I built an open-source, pain-point-organized list of AI gateways — with a reproducible cost benchmark that prices concrete workloads (including a coding scenario with reasoning tokens) across 11 models, computed by a unit-tested script. Plug in your own token mix and see your real number before the invoice does: ** github.com/cuihuan/awesome-ai-gateway** ·

If you're running agents daily — have you actually looked at your per-model breakdown? I'd bet most of the bill is one model doing work a cheaper one could.

source & further reading

dev.to — original article AgentENV: Distributed Runtime for AI Agents at Scale (Open Source, Rust) I Made REGENT: An MCP Server for Configuring OpenWrt Routers Through an AI Physics-Augmented Diffusion Modeling for satellite anomaly response operations with embodied agent feedback loops

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-spent-788-on-an-ai-cod…

Read original on dev.to → dev.to/_7a561cb4673b6d2a455c5/i-spent-788-on-an-…

mentioned entities

Fable 5

Opus 4.8

Haiku 4.5

Sonnet 4.6

github.com/cuihuan/awesome-ai-gateway

metadata

slugi-spent-788-on-an-ai-coding-agent-in-one-day-here-s-the-breakdown

topic#ai-agents

secondary4 topics

sentimentneutral

canonicaldev.to

navigation

← prevThe Simple Plan and Phase 3 of t…

next →Chatbots vs inline automation an…

── more in #ai-agents 4 stories · sorted by recency

dev.to · 28 Jul · #ai-agents

MCP Is Not Replacing REST. It Is Replacing the Entire Mental Model of How Agents Use Tools.

byteiota.com · 28 Jul · #ai-agents

Claude Sonnet 5 Effort Levels: What Adaptive Thinking Costs

poolside.ai · 28 Jul · #ai-agents

Poolside Desktop Assistant – multi harness ACP app for macOS

dev.to · 28 Jul · #ai-agents

AgentENV: Distributed Runtime for AI Agents at Scale (Open Source, Rust)

── more on @fable 5 3 stories trending now

wpnews · 26 Jul · #artificial-intelligence

Nobel laureate Simon Johnson on the AI race and China’s ‘over-automation’ problem

wpnews · 26 Jul · #artificial-intelligence

China’s Moonshot, Z.AI, and DeepSeek are challenging U.S. AI labs—and beating them on cost

wpnews · 26 Jul · #ai-safety

University of Washington study reveals prompt injection risks lurking in AI agent memory

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required