Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models

wpnews.pro

cd /news/large-language-models/thinking-past-the-answer-evaluating-… · home › topics › large-language-models › article

[ARTICLE · art-19893] src=arxiv.org pub=2026-06-03T04:00Z topic=large-language-models verified=true sentiment=· neutral

Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models

A new study on Large Reasoning Models (LRMs) has found that continuing to reason after reaching a correct answer can cause the model to deviate from that answer, a phenomenon termed "harmful overthinking." Researchers introduced a prefix-level evaluation protocol to distinguish harmless verbose reasoning from harmful overthinking, and discovered that stopping at the first correct prefix improved accuracy by up to 21% across multimodal and language-only benchmarks. The findings challenge the assumption that longer reasoning is always beneficial, revealing that current models are limited not only by their reasoning ability but also by their inability to stop at the right time.

read1 min publishedJun 3, 2026

arXiv:2606.02835v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) improve performance by generating explicit intermediate reasoning traces through increased test-time compute, yet the assumption that longer reasoning is consistently beneficial remains under-examined. While recent evidence shows that additional reasoning can lead models to overthink, we ask: "Once a model has reached the correct answer, does further reasoning refine the solution, or deviate from it?" To study the dynamics after correctness, we introduce a prefix-level trajectory evaluation protocol grounded in reasoning sufficiency, defining the minimum reasoning budget required for a model to first generate the correct answer. This allows us to disentangle verbose overthinking, where additional reasoning is redundant but harmless, from harmful overthinking, where continued reasoning destabilizes an already-correct trajectory. Starting from multimodal benchmarks, we find that many instances considered reasoning-intensive require surprisingly little reasoning. Moreover, stopping at the first correct prefix improves accuracy over standard reasoning up to 21%, revealing that current models are limited not only by their ability to reason, but also by their inability to stop at the right time. Furthermore, while common efficiency strategies like early stopping substantially reduce verbose overthinking (up to 50%), they fail to mitigate harmful overthinking. Failure analysis reveals that correctness deviations are mainly driven by logical drift and visual reinterpretation. Finally, we show that our findings generalize to language-only reasoning benchmarks, highlighting harmful overthinking as a broader reliability risk. Code available at https://simonecaldarella.github.io/thinking-past-the-answer.

source & further reading

arxiv.org — original article

── more in #large-language-models 4 stories · sorted by recency

arxiv.org · 3 Jun · #large-language-models

ReLoRA: Knowledge-Reusing Adaptation for Fast Rollout of Evolving LLM Services

arxiv.org · 3 Jun · #large-language-models

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

arxiv.org · 3 Jun · #large-language-models

Visual Graph Scaffolds for Structural Reasoning in Large Language Models

arxiv.org · 3 Jun · #large-language-models

Don't Gamble, GAMBLe: An Analytical Framework for AI-Driven Research Systems

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required