cd /news/artificial-intelligence/coinbase-cut-its-ai-spend-in-half-wi… · home topics artificial-intelligence article
[ARTICLE · art-45539] src=dev.to ↗ pub= topic=artificial-intelligence verified=true sentiment=· neutral

Coinbase Cut Its AI Spend in Half Without Throttling Engineers - Here's the Playbook

Coinbase halved its AI spending without restricting engineer access, CEO Brian Armstrong revealed on X. The company achieved this by routing traffic to cheaper open-weight models like GLM 5.2 and Kimi 2.7, with 91% of engineers never hitting previous usage limits. The strategy signals growing enterprise pressure on premium AI providers like Anthropic and OpenAI.

read1 min views1 publishedJun 30, 2026

Coinbase halved its AI spend while token usage kept growing exponentially. CEO Brian Armstrong posted the breakdown on X this week — five concrete levers, no access caps, and 91% of engineers never hit the old usage limits.

That last point matters. This isn't a story about restricting developers. It's a story about routing smarter.

"We're experimenting with defaulting to open weight GLM 5.2 and Kimi 2.7 through our LLM gateway, while still encouraging engineers to choose the right model for the task."

— Brian Armstrong, CEO Coinbase

Armstrong outlined five levers Coinbase pulled:

GLM 5.2 runs at roughly $1.40/$4.40 per million input/output tokens. Anthropic Opus 4.8 is $5/$25 — a 3–6x price differential that compounds fast at Coinbase-scale token volumes.

Coinbase isn't alone. Snowflake's CEO found GLM 5.2 competitive with Opus 4.7 at a fraction of the cost. Lindy, an AI startup, moved off Claude entirely to DeepSeek v4. These aren't one-off experiments — they're signals that enterprise budget pressure is shifting real workloads to cheaper open-weight models.

That's direct revenue pressure on Anthropic and OpenAI, both of which are approaching or actively building towards IPO moments that require durable enterprise revenue growth.

If you're running AI infra at any scale, three of Coinbase's five tactics are independently implementable right now: Open-weight Chinese models (GLM, Kimi, DeepSeek) carry licensing and data residency considerations worth checking against your compliance requirements — especially in regulated industries. Routing policies can also introduce silent quality degradation at edge cases, which Armstrong's post doesn't address. Test before you trust.

Source: Let's Data Science · Armstrong's X post (June 28, 2026) ✏️ Drafted with KewBot (AI), edited and approved by Drew.

── more in #artificial-intelligence 4 stories · sorted by recency
── more on @coinbase 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/coinbase-cut-its-ai-…] indexed:0 read:1min 2026-06-30 ·