Claude’s new model is more ‘honest’ when it messes up

wpnews.pro

cd /news/artificial-intelligence/claudes-new-model-is-more-honest-whe… · home › topics › artificial-intelligence › article

[ARTICLE · art-16666] src=theverge.com ↗ pub=2026-05-28T17:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

Claude’s new model is more ‘honest’ when it messes up

Anthropic released Claude Opus 4.8 on Thursday, claiming the model is more likely to flag uncertainties about its work and less likely to make unsupported claims. The AI lab said early testers found Opus 4.8 is roughly four times less likely than its predecessor to allow flaws in code to pass unremarked. The update also lets users direct the amount of effort Claude puts into a task and introduces dynamic workflows for running hundreds of parallel subagents in a single session.

read1 min views11 publishedMay 28, 2026

Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model’s “honesty.”

Opus 4.8 is launching on Thursday.

According to Anthropic, it trains “all [its] models to be honest — for instance, to avoid making claims that they can’t support.” But it notes that “a general problem with AI models is that they sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence.”

The AI lab claims that early testers have found that Opus 4.8 “is more likely to flag uncertainties about its work and less likely to make unsupported claims.” In the company’s evaluations, Opus 4.8 is “around 4x less likely than its predecessor to allow flaws in code it’s written to pass unremarked.”

In addition to the honesty improvements, with Opus 4.8, users can direct the amount of effort Claude puts into a task. Higher-effort responses will use more tokens, giving users the option of lower-effort responses if they don’t want to burn through their rate limits as quickly.

Anthropic is also launching a feature called “dynamic workflows” in research preview, which the company says will let Claude “take on even bigger tasks.” With dynamic workflows, “Claude can plan the work and then run hundreds of parallel subagents in a single session (and with Opus 4.8, the agents can run for even longer). It then verifies its outputs before reporting back to the user.”

Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.

source & further reading

theverge.com — original article Lorde says Ray-Ban Meta AI glasses are ‘not sexy’ Apple’s failed self-driving car program left a legacy of powerful AI chips The fight against AI data centers is just beginning

~/api · this article 200

$curl api.wpnews.pro/v1/news/claudes-new-model-is-mor…

Read original on theverge.com → www.theverge.com/ai-artificial-intelligence/9390…

mentioned entities

Anthropic

Claude Opus 4.8

Claude

metadata

slugclaudes-new-model-is-more-honest-when-it-messes-up

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicaltheverge.com

navigation

← prevThe Pulse: a trend of trying to …

next →Anthropic launches Opus 4.8, wit…

── more in #artificial-intelligence 4 stories · sorted by recency

machinebrief.com · 12 Jul · #artificial-intelligence

How AI is Redefining Software Engineering Teams: Insights from Anthropic

startupfortune.com · 12 Jul · #artificial-intelligence

Anthropic's New Lens Shows What Claude Is Thinking Before It Speaks

forbes.com.au · 12 Jul · #artificial-intelligence

Australia Tops Claude Usage

sourcefeed.dev · 13 Jul · #artificial-intelligence

AWS MCP Plugin Puts Agents Under CloudTrail

── more on @anthropic 3 stories trending now

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

wpnews · 8 Jul · #artificial-intelligence

SpaceXAI unveils Grok 4.5 AI model ahead of July 2026 public release

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required