Claude Sonnet 5 – benchmark results

wpnews.pro

cd /news/large-language-models/claude-sonnet-5-benchmark-results · home › topics › large-language-models › article

[ARTICLE · art-45494] src=artificialanalysis.ai ↗ pub=2026-06-30T20:09Z topic=large-language-models verified=true sentiment=↑ positive

Claude Sonnet 5 – benchmark results

Anthropic released Claude Sonnet 5 (Adaptive Reasoning, Max Effort) on June 30, 2026, scoring 53 on the Artificial Analysis Intelligence Index, well above the average of 8 among comparable models. The model supports text and image input, outputs text, has a 1m token context window, and is competitively priced at $0.00 per 1M input and output tokens.

read4 min views1 publishedJun 30, 2026

Claude Sonnet 5 – benchmark results — Image: source

API Provider Benchmarks

Model summary

IntelligenceUpdated

Speed

Input Price

Output Price

Verbosity

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is amongst the leading models in intelligence and well priced when comparing to other models of similar price. The model supports text and image input, outputs text, and has a 1m tokens context window.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) scores 53 on the Artificial Analysis Intelligence Index, placing it well above average among comparable models (averaging 8). When evaluating the Intelligence Index, it generated 300M tokens, which is very verbose in comparison to the average of 37M.

Pricing for Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is $0.00 per 1M input tokens (competitively priced, average: $0.00) and $0.00 per 1M output tokens (competitively priced, average: $0.00).

Reasoning	Yes This page shows the reasoning version of this model. A non-reasoning variant may also exist.
Input modality	Supports: text, image
Output modality	Supports: text
Context window	1m ~1500 A4 pages of size 12 Arial font

Metrics are compared against models of the same class:

Non-reasoning models → compared only with other non-reasoning models
Reasoning models → compared across both reasoning and non-reasoning
Open weights models → compared only with other open weights models of the same size class:

- Tiny: ≤4B parameters
- Small: 4B–40B parameters
- Medium: 40B–150B parameters
- Large: >150B parameters

Proprietary models → compared across proprietary and open weights models of the same price range, using a blended 3:1 input/output price ratio:
<$0.15 per 1M tokens
$0.15–$1 per 1M tokens
$1 per 1M tokens Highlights

IntelligenceUpdated #

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index by Open Weights / Proprietary

Intelligence Evaluations

[GDPval-AA v2](/evaluations/gdpval-aa)Updated

Agentic real-world work tasks, (Elo-500)/2000

[𝜏³-Banking](/evaluations/tau3-banking)New

Agentic tool use

Agentic coding & terminal use

Coding

Reasoning & knowledge

Scientific reasoning

Physics reasoning

Knowledge

1 - hallucination rate

Long context reasoning

[AA-Briefcase](/evaluations/aa-briefcase)New

Agentic knowledge work, (Elo-500)/2000

Instruction following

Long-horizon agentic tasks

Kubernetes incident root-cause analysis

Visual reasoning

AA-BriefcaseNew #

AA-Briefcase Elo

Openness #

Artificial Analysis Openness Index: Score

Intelligence Index Comparisons #

Intelligence vs. Cost per Intelligence Index Task

Token UseUpdated #

Output Tokens per Intelligence Index Task

Price and CostUpdated #

Cost per Intelligence Index Task

Cost to Run Artificial Analysis Intelligence Index

Pricing: Cache Hit, Input, and Output

Context Window #

Context Window

SpeedUpdated #

Measured by Output Speed (tokens per second)

Output Speed

Time per Intelligence Index Task

Latency #

Measured by Time (seconds) to First Token

Latency: Time To First Answer Token

End-to-End Response Time #

Seconds to output 500 tokens, calculated based on time to first token, 'thinking' time for reasoning models, and output speed

End-to-End Response Time

Model Size (Open Weights Models Only) #

Model Size: Total and Active Parameters

Frequently Asked Questions #

Common questions about Claude Sonnet 5 (Adaptive Reasoning, Max Effort)

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) was released on June 30, 2026.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) was created by Anthropic.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) scores 53 on the Artificial Analysis Intelligence Index, placing it well above average among other reasoning models in a similar price tier (median: 8).

When evaluated on the Intelligence Index, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) generated 300M output tokens, which is at the higher end compared to other reasoning models in a similar price tier (median: 37M).

Yes, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) supports text and image input.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) supports text output.

Yes, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) supports image input and can analyze, describe, and answer questions about images.

Yes, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is multimodal. It can process text and image input and generate text output.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) has a context window of 1.0M tokens. This determines how much text and conversation history the model can process in a single request.

No, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is proprietary. The model weights are not publicly available.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is a proprietary model and Anthropic has not disclosed the model size or parameter count.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) achieves a score of 53 on the Artificial Analysis Intelligence Index. This composite benchmark evaluates models across reasoning, knowledge, mathematics, and coding.

Check the providers page for current API availability. [View providers](/models/claude-sonnet-5/providers)

Visit the providers page to see availability. [Compare providers](/models/claude-sonnet-5/providers)

source & further reading

artificialanalysis.ai — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/claude-sonnet-5-benchmar…

Read original on artificialanalysis.ai → artificialanalysis.ai/models/claude-sonnet-5

mentioned entities

Anthropic

Claude Sonnet 5

metadata

slugclaude-sonnet-5-benchmark-results

topic#large-language-models

secondary3 topics

sentimentpositive

canonicalartificialanalysis.ai

navigation

← prevPart 1

next →I Got Tired of Asking "What Am I…

── more in #large-language-models 4 stories · sorted by recency

science.slashdot.org · 30 Jun · #large-language-models

Claude Science is Here, Antibiotics Designed by Text Prompt Among Applications

dev.to · 30 Jun · #large-language-models

Claude Code vs GitHub Copilot: I Used Both for 30 Days. Here's What Won.

testingcatalog.com · 30 Jun · #large-language-models

Anthropic launches Claude Sonnet 5 model on Claude and APIs

dev.to · 30 Jun · #large-language-models

Reading Anthropic's "When AI Builds Itself" Changed How I Think About AI and Software Engineering

── more on @anthropic 3 stories trending now

wpnews · 27 May · #machine-learning

hunting for headroom on modded-nanoGPT (WR #82)

wpnews · 30 May · #ai-tools

I was wasting 10 minutes every Claude session. So I built a fix.

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required