{"slug": "uncertainty-decomposition-for-clarification-seeking-in-llm-agents", "title": "Uncertainty Decomposition for Clarification Seeking in LLM Agents", "summary": "Researchers introduced a prompt-based uncertainty decomposition method that separates action confidence from request uncertainty, enabling LLM agents to proactively seek clarification when task specifications are ambiguous. Tested on new clarification-augmented benchmarks across five LLM backbones, the method improved clarification F1 by up to 73% over existing approaches, demonstrating generalizable gains.", "body_md": "arXiv:2606.19559v1 Announce Type: new\nAbstract: Recent position papers argue that the classical aleatoric/epistemic uncertainty framework is insufficient for interactive large language model (LLM) agents and call for underspecification-aware, decomposed, and communicable uncertainty representations that can unlock new agent capabilities such as proactive clarification seeking and shared mental-model building. Practical deployment constraints -- black-box APIs, interactive latency budgets, and the absence of labeled trajectories -- rule out logprob-based, multi-sampling, and training-based methods, leaving prompt-based estimation as the most viable family for surfacing such signals at deployment time. We answer this call with a simple prompt-based decomposition that separates action confidence from request uncertainty (u), enabling the agent to ask for clarification when the task specification is ambiguous. To evaluate it, we introduce two clarification-augmented benchmarks (WebShop-Clarification and ALFWorld-Clarification) in which 50% of tasks are deliberately underspecified, and systematically compare the proposed decomposition against ReAct+UE and Uncertainty-Aware Memory (UAM) across five LLM backbones (GPT-5.1, DeepSeek-v3.2-exp, GLM-4.7, Qwen3.5-35B, GPT-OSS-120B) on these variants together with the standard WebShop, ALFWorld, and REAL benchmarks for fault detection. Averaged across the five backbones, the proposed decomposition improves clarification F1 on ALFWorld-Clarification by 73% over ReAct+UE and by 36% over UAM, and leads clarification F1 on every backbone on WebShop-Clarification and on four of five backbones on ALFWorld-Clarification, indicating that the gains generalize beyond a single LLM.", "url": "https://wpnews.pro/news/uncertainty-decomposition-for-clarification-seeking-in-llm-agents", "canonical_source": "https://arxiv.org/abs/2606.19559", "published_at": "2026-06-19 04:00:00+00:00", "updated_at": "2026-06-19 04:03:23.566512+00:00", "lang": "en", "topics": ["large-language-models", "ai-agents", "ai-research", "natural-language-processing"], "entities": ["WebShop", "ALFWorld", "GPT-5.1", "DeepSeek-v3.2-exp", "GLM-4.7", "Qwen3.5-35B", "GPT-OSS-120B", "ReAct+UE"], "alternates": {"html": "https://wpnews.pro/news/uncertainty-decomposition-for-clarification-seeking-in-llm-agents", "markdown": "https://wpnews.pro/news/uncertainty-decomposition-for-clarification-seeking-in-llm-agents.md", "text": "https://wpnews.pro/news/uncertainty-decomposition-for-clarification-seeking-in-llm-agents.txt", "jsonld": "https://wpnews.pro/news/uncertainty-decomposition-for-clarification-seeking-in-llm-agents.jsonld"}}