{"slug": "narration-of-thought-inference-time-scaffolding-for-defeasible-ethical-reasoning", "title": "Narration-of-Thought: Inference-Time Scaffolding for Defeasible Ethical Reasoning in Large Language Models", "summary": "Researchers introduced narration-of-thought (NoT), a system prompt that structures chain-of-thought reasoning into five sections to improve ethical reasoning in large language models. On 100 DailyDilemmas scenarios, NoT reduced stakeholder collapse from up to 31% to under 1% and uncertainty suppression from up to 72% to 1-24% across four models. The method requires no additional training and provides an auditable reasoning trace for dependable AI deployment.", "body_md": "arXiv:2606.26366v1 Announce Type: new\nAbstract: Standard chain-of-thought on moral dilemmas exhibits two failure modes: stakeholder collapse (the trace names at most one party with a stake in the outcome) and uncertainty suppression (no explicit unknowns or hedges before committing to an action). We introduce narration-of-thought (NoT), a system prompt that structures chain-of-thought into five sections: protagonist, stakeholders, two-step consequences, uncertainty, then commitment. NoT adds no training, parameters, or fine-tuning. On 100 DailyDilemmas scenarios across four generators from three vendors, NoT cuts stakeholder collapse from up to 31% to under 1% and uncertainty suppression from up to 72% to 1-24% on every model. A matched-budget verbose-CoT control rules out token spend as the active ingredient; NoT retains Cliff's delta advantages of +0.79 to +0.90 on stakeholder count and +0.65 to +0.93 on uncertainty score for three of four generators, and a section ablation attributes each shift to its specific sub-instruction. Textual-gradient descent initialised at NoT improves the scaffold further; a cross-family training judge (different vendor from the generator) dominates an in-family one on every measured axis. Extended to a five-round multi-stakeholder debate protocol, the scaffold converts a 6% standoff into 95% full consensus on a calibration set and 100% combined convergence on a DailyDilemmas replication. The resulting traces externalise the stakeholders, consequences, and uncertainty grounding each commitment, providing an auditable substrate for dependable agentic deployment.", "url": "https://wpnews.pro/news/narration-of-thought-inference-time-scaffolding-for-defeasible-ethical-reasoning", "canonical_source": "https://arxiv.org/abs/2606.26366", "published_at": "2026-06-26 04:00:00+00:00", "updated_at": "2026-06-26 04:19:36.400325+00:00", "lang": "en", "topics": ["large-language-models", "ai-safety", "ai-ethics", "natural-language-processing", "ai-research"], "entities": ["arXiv", "DailyDilemmas", "Narration-of-Thought", "Cliff's delta"], "alternates": {"html": "https://wpnews.pro/news/narration-of-thought-inference-time-scaffolding-for-defeasible-ethical-reasoning", "markdown": "https://wpnews.pro/news/narration-of-thought-inference-time-scaffolding-for-defeasible-ethical-reasoning.md", "text": "https://wpnews.pro/news/narration-of-thought-inference-time-scaffolding-for-defeasible-ethical-reasoning.txt", "jsonld": "https://wpnews.pro/news/narration-of-thought-inference-time-scaffolding-for-defeasible-ethical-reasoning.jsonld"}}