Intelligence Per Dollar

wpnews.pro

cd /news/artificial-intelligence/intelligence-per-dollar · home › topics › artificial-intelligence › article

[ARTICLE · art-20395] src=tomtunguz.com ↗ pub=2026-06-03T00:00Z topic=artificial-intelligence verified=true sentiment=· neutral

Intelligence Per Dollar

Microsoft added average token usage to a model release card yesterday, introducing a new dual benchmark that measures both performance and cost. The metric shows Microsoft's model achieving a 71.6 SWE-Bench Verified score using roughly one-third the tokens of Claude Haiku 4.5, signaling a shift toward evaluating intelligence per dollar. The change reflects growing pressure on AI companies to compete on cost efficiency as enterprises like Uber and Microsoft cap AI spending after budgets were exceeded.

read2 min views15 publishedJun 3, 2026

Yesterday Microsoft added a new metric to a model release card, one that will likely become a standard.

Average token usage.

In the first row, the Microsoft model hits 71.6 on SWE-Bench Verified using about a third of the tokens Claude Haiku 4.5 burns.

Benchmarks are now measured on two different dimensions, the overall performance & the cost to achieve that intelligence.

This is yet another sign that the era of subsidies 1, tokenmaxxing

Even the most valuable companies in the world cannot afford state-of-the-art intelligence for every conceivable use case. 3 Uber capped employee AI spending after blowing through its budget in four months.

This new dual benchmark answers the buyer’s only question : what is my intelligence per dollar?

Artificial Analysis already benchmarks this. 6 GPT 5.5 & Claude Opus 4.8 land within a point of each other on the Intelligence Index, around 60. Running the index costs $3,357 on GPT 5.5 & $4,685 on Opus 4.8. Same answer, 40% more expensive.

Model companies must now compete on both dimensions. The application layer will compete one level up, on dollars per outcome, what a closed ticket, a shipped PR, or a resolved support case actually costs.

Every layer in the stack now has to price the same way the customer thinks : per result, not per token.

[The Unsustainable Subsidy](https://tomtunguz.com/ai-model-inflation/) — The era of AI subsidies is ending. [↩︎](https://tomtunguz.com/index.xml#fnref:1)

[Tokenmaxxing](https://tomtunguz.com/tokenmaxxing/) — Models that game benchmarks with extra tokens are losing their edge. [↩︎](https://tomtunguz.com/index.xml#fnref:2)

Microsoft cancels Claude Code licenses, shifting developers to GitHub Copilot CLI — Microsoft cancelled Claude Code licenses across its Experiences and Devices division (Windows, Microsoft 365, Outlook, Teams, Surface) after engineering usage outran budgets. ↩︎

Uber caps employee AI spending after blowing through budget in 4 months — Uber caps employee AI spending after blowing through budget in four months. ↩︎

Salesforce Spends $300M on AI, Freezes Engineering Hires — Salesforce Spends $300M on AI, Freezes Engineering Hires. ↩︎

AI Model & API Providers Analysis — Independent analysis of AI model costs. ↩︎

source & further reading

tomtunguz.com — original article Google's Cloud Revenue Converges to NVIDIA's Growth Rate AI Engineering Productivity is Anything But Normal Open Models Tack Toward the Frontier

~/api · this article 200

$curl api.wpnews.pro/v1/news/intelligence-per-dollar

Read original on tomtunguz.com → www.tomtunguz.com/tokens-per-result/

mentioned entities

Microsoft

Claude Haiku 4.5

Uber

GPT 5.5

Claude Opus 4.8

Artificial Analysis

Tom Tunguz

metadata

slugintelligence-per-dollar

topic#artificial-intelligence

secondary4 topics

sentimentneutral

canonicaltomtunguz.com

navigation

← prevGPT-4.1 deprecated

next →Neural synchrony between mothers…

── more in #artificial-intelligence 4 stories · sorted by recency

businesstimes.com.sg · 25 Jul · #artificial-intelligence

US: Nasdaq lags on angst over AI spending ahead of earnings reports

cryptobriefing.com · 25 Jul · #artificial-intelligence

Investors ambivalent about SpaceX’s AI pivot under Elon Musk

cryptobriefing.com · 25 Jul · #artificial-intelligence

Naver secures $1B from Nvidia, $9B from Brookfield for AI data center expansion

dissenter.com · 24 Jul · #artificial-intelligence

NVIDIA's Open-Weight AI Push Masks Its GPU Monopoly

── more on @microsoft 3 stories trending now

wpnews · 24 Jul · #artificial-intelligence

A $700 Billion Sovereign Fund Just Made the Chinese AI Cost Argument Impossible to Ignore

wpnews · 28 May · #ai-tools

Grok Build introduces /remember command for persistent context across coding sessions

wpnews · 24 Jul · #artificial-intelligence

Claude Opus 5 matches Fable on coding at half the price, and Anthropic says it is the most aligned model it has ever shipped

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required