cd /news/large-language-models/stop-when-further-reasoning-won-t-he… · home topics large-language-models article
[ARTICLE · art-28959] src=arxiv.org ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

Stop When Further Reasoning Won't Help: Attention-State Adaptive Generation in Reasoning Models

Researchers propose ASAG, a training-free method that monitors attention distributions to detect when a reasoning model has reached a conclusion, stopping generation early. Applied to DeepSeek-R1-Distill and Qwen3 models, ASAG improves average accuracy by 3.2% while reducing generated tokens by nearly 40% on Qwen3-8B across nine benchmarks.

read1 min views1 publishedJun 16, 2026

arXiv:2606.15070v1 Announce Type: new Abstract: By incorporating test-time compute scaling, large reasoning models (LRMs) can solve complex problems through explicit chain-of-thought (CoT) reasoning processes. However, they often suffer from overthinking, resulting in redundant token outputs and degraded accuracy. Current methods to mitigate this issue remain limited: training-based approaches require substantial computational resources, while training-free methods rely on well-crafted prompts or unreliable confidence signals. In this work, we investigate early stopping from the perspective of attention distributions and propose a simple method, ASAG, which infers the model's reasoning state and adaptively adjusts the generation strategy. The proposed framework is training-free and plug-and-play, enabling seamless integration into existing LRMs. Extensive experiments on nine benchmarks demonstrate consistent improvements across mainstream LRMs with varying parameter scales, including the DeepSeek-R1-Distill and Qwen3 series. Specifically, ASAG improves average accuracy by 3.2% while reducing the number of generated tokens by nearly 40% across all reasoning tasks on Qwen3-8B.

── more in #large-language-models 4 stories · sorted by recency
── more on @deepseek-r1-distill 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/stop-when-further-re…] indexed:0 read:1min 2026-06-16 ·