ReasoningFlow: Discourse Structures for Understanding LLM Reasoning Traces

wpnews.pro

cd /news/large-language-models/reasoningflow-discourse-structures-f… · home › topics › large-language-models › article

[ARTICLE · art-22183] src=arxiv.org ↗ pub=2026-06-05T04:00Z topic=large-language-models verified=true sentiment=· neutral

ReasoningFlow: Discourse Structures for Understanding LLM Reasoning Traces

Researchers have developed ReasoningFlow, a framework that maps the non-linear discourse structures of large reasoning model (LRM) traces into directed acyclic graphs (DAGs) to improve evaluation and monitoring. Through manual annotation of 31 traces and automatic scaling to 1,260 traces across three tasks and five models, the team found that LRMs exhibit structurally similar reasoning patterns, most erroneous steps do not contribute to final answers, and causal dependencies between steps differ from language-level discourse structure. The findings enable better trace monitorability by revealing fine-grained behaviors like local verification and self-reflection.

read1 min views18 publishedJun 5, 2026

arXiv:2606.05402v1 Announce Type: new Abstract: Large reasoning models (LRMs) produce reasoning traces with non-linear structures, such as backtracking and self-correction, that complicate the evaluation and monitoring of the reasoning process. We introduce ReasoningFlow, a framework that captures the discourse structures of LRM reasoning traces into fine-grained directed acyclic graphs (DAGs). We develop and validate our annotation schema through careful manual annotation of 31 traces (2.1k steps), achieving high inter-annotator agreement, then scale to automatic annotation of 1,260 traces (247.7k steps) spanning three tasks (math, science, argumentation) and five models (Qwen2.5-32B-Inst, QwQ-32B, DeepSeek-V3, DeepSeek-R1, GPT-oss-120B). By analyzing ReasoningFlow graphs, we find: (1) LRMs exhibit structurally similar traces, despite being trained from different base models and potentially non-overlapping post-training data. (2) ReasoningFlow reveals diverse fine-grained reasoning behaviors (e.g., local verification, self-reflection, and assumptions) that can be used for better reasoning trace monitorability. (3) In LRMs, most of the erroneous steps are not used to derive final answers. (4) Mechanistic causal dependencies between steps do not reflect the language-level discourse structure. We release the dataset and code in: https://github.com/jinulee-v/reasoningflow.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/reasoningflow-discourse-…

Read original on arxiv.org → arxiv.org/abs/2606.05402

mentioned entities

ReasoningFlow

Qwen2.5-32B-Inst

QwQ-32B

DeepSeek-V3

DeepSeek-R1

GPT-oss-120B

arXiv

metadata

slugreasoningflow-discourse-structures-for-understanding-llm-reasoning-traces

topic#large-language-models

secondary4 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevThe Arms Dealer’s Nintendo 64 Wa…

next →New infosec products of the week…

── more in #large-language-models 4 stories · sorted by recency

blog.devgenius.io · 21 Jul · #large-language-models

Ollama Was Fun for About Two Weeks. Then Reality Showed Up.

snipvote.com · 21 Jul · #large-language-models

PlanFlip attacks achieve 0.68 success rate on GPT-5

dev.to · 21 Jul · #large-language-models

I Watched Two AI Agents Invent Their Own Language

irishtechnews.ie · 21 Jul · #large-language-models

World Brain Day 2026: Eleos AI Research Solving LLMs Sentience, Consciousness?

── more on @reasoningflow 3 stories trending now

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required