Claude Fable 5 costs $6K just to benchmark, highlighting the soaring price of frontier AI

wpnews.pro

cd /news/large-language-models/claude-fable-5-costs-6k-just-to-benc… · home › topics › large-language-models › article

[ARTICLE · art-31885] src=cryptobriefing.com ↗ pub=2026-06-17T23:23Z topic=large-language-models verified=true sentiment=· neutral

Claude Fable 5 costs $6K just to benchmark, highlighting the soaring price of frontier AI

Anthropic's Claude Fable 5 model cost $6,227.74 to benchmark on the Artificial Analysis Intelligence Index, consuming 87 million output tokens at $50 per million. The model scored 64.9, topping the index and surpassing OpenAI's GPT-5.5, highlighting the soaring costs of frontier AI evaluation.

read2 min views33 publishedJun 17, 2026

Anthropic's new flagship reasoning model tops the Artificial Analysis Intelligence Index but burns through 87 million output tokens in the process

Running a benchmark suite on Anthropic’s newest AI model now costs roughly what a used Honda Civic does. Claude Fable 5, the company’s latest flagship reasoning model, racked up a bill of $6,227.74 just to complete the Artificial Analysis Intelligence Index evaluations.

The model launched on June 9, 2026, and immediately claimed the top spot on the Intelligence Index with a score of 64.9. That dethroned its predecessor, Claude Opus 4.8, and left OpenAI’s GPT-5.5 in the rearview at 58.6.

The price of being the best #

Claude Fable 5 is priced at $10 per million input tokens and $50 per million output tokens. The benchmark evaluation consumed 87 million output tokens, which is why the final tab landed north of $6K.

Reasoning models produce far more output tokens than standard chat models because they work through problems step by step. That 87 million token output figure is a direct consequence of asking a reasoning model to grind through coding challenges, logic puzzles, and complex multi-step evaluations.

Anthropic does offer a 90% discount on prompt caching for repeated input hits, but that doesn’t help much when your primary cost driver is output generation.

What you get for the money #

On SWE-Bench Pro, a benchmark that tests a model’s ability to solve real-world software engineering problems, Fable 5 scored 80.3% accuracy. That’s a substantial improvement over Claude Opus 4.8’s 69.2% and ahead of GPT-5.5’s 58.6%.

The model also ships with a 1 million token context window and handles both image and text inputs.

Disclosure: This article was edited by Editorial Team. For more information on how we create and review content, see our

Editorial Policy.

source & further reading

cryptobriefing.com — original article AI-related ETFs now account for a record 19% of all US ETF trading volume Amazon’s debt nearly doubles to $129B amid $220B data center spending spree Amazon’s Anthropic investment boosts S&P 500 profit growth

~/api · this article 200

$curl api.wpnews.pro/v1/news/claude-fable-5-costs-6k-…

Read original on cryptobriefing.com → cryptobriefing.com/claude-fable-5-benchmark-cost…

mentioned entities

Anthropic

Claude Fable 5

Artificial Analysis Intelligence Index

Claude Opus 4.8

OpenAI

GPT-5.5

SWE-Bench Pro

metadata

slugclaude-fable-5-costs-6k-just-to-benchmark-highlighting-the-soaring-price-of-ai

topic#large-language-models

secondary2 topics

sentimentneutral

canonicalcryptobriefing.com

navigation

← prevVlk: MemAct for the IDE – persis…

next →Most Engineers Use AI. Few Engin…

── more in #large-language-models 4 stories · sorted by recency

dev.to · 2 Aug · #large-language-models

I Ran 8 AI APIs Through the Same 50 Prompts — Here's the Real Cost Breakdown

byteiota.com · 2 Aug · #large-language-models

Claude Opus 4.1 Retires August 5: Migrate to 4.8 Now

hmans.dev · 2 Aug · #large-language-models

Chatto Is Robots

pub.towardsai.net · 2 Aug · #large-language-models

DeepSeek V4-Flash vs GLM-5.2: The 1.7-Point Win Collapses When You Swap the Harness

── more on @anthropic 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required