When Do LLMs Reason? A Dynamical Systems View via Entropy Phase Transitions

wpnews.pro

cd /news/large-language-models/when-do-llms-reason-a-dynamical-syst… · home › topics › large-language-models › article

[ARTICLE · art-13545] src=arxiv.org ↗ pub=2026-05-25T04:00Z topic=large-language-models verified=true sentiment=· neutral

When Do LLMs Reason? A Dynamical Systems View via Entropy Phase Transitions

Researchers have found that chain-of-thought reasoning in large language models is only beneficial when early-stage entropy dynamics show consistent reduction, according to a new study on arXiv. The team introduced EDRM, a lightweight routing framework that uses early decoding entropy to selectively apply reasoning, achieving up to 55% token reduction and 4.7% accuracy gains across 15 benchmarks. The findings challenge the default use of CoT reasoning, suggesting it should be invoked adaptively rather than universally.

read1 min views6 publishedMay 25, 2026

arXiv:2605.22873v1 Announce Type: new Abstract: Chain-of-thought (CoT) reasoning has become the default strategy for enhancing LLM capabilities, yet its application raises a fundamental question: when is explicit reasoning actually beneficial? Empirical evidence reveals a striking paradox: CoT often provides marginal or even negative gains on factual and open-ended tasks while multiplying token consumption. In this work, we show that LLM reasoning is not a static property of tasks or models, but a \emph{dynamic decoding state} that emerges during generation. Through systematic analysis, we find early-stage entropy dynamics provide a reliable signal of this state: tasks benefiting from CoT exhibit consistent entropy reduction, while others display unstable or increasing patterns. This behavior can be interpreted as a phase-transition-like shift from a high-entropy exploratory regime to a low-entropy structured reasoning regime. Based on these insights, we propose \textbf{EDRM} (Entropy Dynamics-based Reasoning Manifold), a lightweight and training-free routing framework that leverages early decoding entropy to adaptively select inference strategies. EDRM embeds entropy trajectories into a compact and interpretable manifold representation, enabling both zero-shot deployment and fine-grained instance-level adaptation. Across 15 benchmarks and 4 LLMs of varying scales and architectures, EDRM consistently outperforms static baselines. At the dataset level, EDRM achieves \textbf{41--55%} token reduction while improving accuracy with as few as 50 calibration samples. At the instance level, it further improves accuracy by up to \textbf{4.7%} while maintaining \textbf{27--45%} token savings. These results suggest that reasoning should be invoked selectively rather than by default, and demonstrate the effectiveness of entropy-driven decoding control for efficient and adaptive LLM inference.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/when-do-llms-reason-a-dy…

Read original on arxiv.org → arxiv.org/abs/2605.22873

mentioned entities

EDRM

CoT

metadata

slugwhen-do-llms-reason-a-dynamical-systems-view-via-entropy-phase-transitions

topic#large-language-models

secondary4 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevThe Eternal Sloptember

next →Samsung memory workers call off …

── more in #large-language-models 4 stories · sorted by recency

machinebrief.com · 10 Jul · #large-language-models

Revamping Neural Topology: The Cost of Precision

404media.co · 10 Jul · #large-language-models

AI Fiction Is Easy to Detect Because It's Stupid and Bad, Research Finds

machinebrief.com · 10 Jul · #large-language-models

Why AI Struggles in Real-World Negotiations

machinebrief.com · 10 Jul · #large-language-models

Are Large Language Models Really Getting Smarter?

── more on @edrm 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

SpaceXAI unveils Grok 4.5 AI model ahead of July 2026 public release

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required