cd /news/large-language-models/claude-sonnet-5-benchmark-results · home topics large-language-models article
[ARTICLE · art-45494] src=artificialanalysis.ai ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

Claude Sonnet 5 – benchmark results

Anthropic released Claude Sonnet 5 (Adaptive Reasoning, Max Effort) on June 30, 2026, scoring 53 on the Artificial Analysis Intelligence Index, well above the average of 8 among comparable models. The model supports text and image input, outputs text, has a 1m token context window, and is competitively priced at $0.00 per 1M input and output tokens.

read4 min views1 publishedJun 30, 2026
Claude Sonnet 5 – benchmark results
Image: source

API Provider Benchmarks

Model summary

IntelligenceUpdated

Speed

Input Price

Output Price

Verbosity

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is amongst the leading models in intelligence and well priced when comparing to other models of similar price. The model supports text and image input, outputs text, and has a 1m tokens context window.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) scores 53 on the Artificial Analysis Intelligence Index, placing it well above average among comparable models (averaging 8). When evaluating the Intelligence Index, it generated 300M tokens, which is very verbose in comparison to the average of 37M.

Pricing for Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is $0.00 per 1M input tokens (competitively priced, average: $0.00) and $0.00 per 1M output tokens (competitively priced, average: $0.00).

Reasoning Yes This page shows the reasoning version of this model. A non-reasoning variant may also exist.
Input modality Supports: text, image
Output modality Supports: text
Context window 1m ~1500 A4 pages of size 12 Arial font

Metrics are compared against models of the same class:

  • Non-reasoning models → compared only with other non-reasoning models
  • Reasoning models → compared across both reasoning and non-reasoning
  • Open weights models → compared only with other open weights models of the same size class:
- Tiny: ≤4B parameters
- Small: 4B–40B parameters
- Medium: 40B–150B parameters
- Large: >150B parameters
  • Proprietary models → compared across proprietary and open weights models of the same price range, using a blended 3:1 input/output price ratio:

  • <$0.15 per 1M tokens

  • $0.15–$1 per 1M tokens

  • $1 per 1M tokens Highlights

IntelligenceUpdated #

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index by Open Weights / Proprietary

Intelligence Evaluations

[GDPval-AA v2](/evaluations/gdpval-aa)Updated

Agentic real-world work tasks, (Elo-500)/2000

[𝜏³-Banking](/evaluations/tau3-banking)New

Agentic tool use

Agentic coding & terminal use

Coding

Reasoning & knowledge

Scientific reasoning

Physics reasoning

Knowledge

1 - hallucination rate

Long context reasoning

[AA-Briefcase](/evaluations/aa-briefcase)New

Agentic knowledge work, (Elo-500)/2000

Instruction following

Long-horizon agentic tasks

Kubernetes incident root-cause analysis

Visual reasoning

AA-BriefcaseNew #

AA-Briefcase Elo

Openness #

Artificial Analysis Openness Index: Score

Intelligence Index Comparisons #

Intelligence vs. Cost per Intelligence Index Task

Token UseUpdated #

Output Tokens per Intelligence Index Task

Price and CostUpdated #

Cost per Intelligence Index Task

Cost to Run Artificial Analysis Intelligence Index

Pricing: Cache Hit, Input, and Output

Context Window #

Context Window

SpeedUpdated #

Measured by Output Speed (tokens per second)

Output Speed

Time per Intelligence Index Task

Latency #

Measured by Time (seconds) to First Token

Latency: Time To First Answer Token

End-to-End Response Time #

Seconds to output 500 tokens, calculated based on time to first token, 'thinking' time for reasoning models, and output speed

End-to-End Response Time

Model Size (Open Weights Models Only) #

Model Size: Total and Active Parameters

Frequently Asked Questions #

Common questions about Claude Sonnet 5 (Adaptive Reasoning, Max Effort)

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) was released on June 30, 2026.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) was created by Anthropic.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) scores 53 on the Artificial Analysis Intelligence Index, placing it well above average among other reasoning models in a similar price tier (median: 8).

When evaluated on the Intelligence Index, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) generated 300M output tokens, which is at the higher end compared to other reasoning models in a similar price tier (median: 37M).

Yes, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) supports text and image input.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) supports text output.

Yes, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) supports image input and can analyze, describe, and answer questions about images.

Yes, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is multimodal. It can process text and image input and generate text output.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) has a context window of 1.0M tokens. This determines how much text and conversation history the model can process in a single request.

No, Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is proprietary. The model weights are not publicly available.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) is a proprietary model and Anthropic has not disclosed the model size or parameter count.

Claude Sonnet 5 (Adaptive Reasoning, Max Effort) achieves a score of 53 on the Artificial Analysis Intelligence Index. This composite benchmark evaluates models across reasoning, knowledge, mathematics, and coding.

Check the providers page for current API availability. [View providers](/models/claude-sonnet-5/providers)

Visit the providers page to see availability. [Compare providers](/models/claude-sonnet-5/providers)
── more in #large-language-models 4 stories · sorted by recency
── more on @anthropic 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/claude-sonnet-5-benc…] indexed:0 read:4min 2026-06-30 ·