Towards Reliable and Robust LLM Planning: Symbolic Feedback-Driven Iterative Self-Refinement Framework

wpnews.pro

cd /news/large-language-models/towards-reliable-and-robust-llm-plan… · home › topics › large-language-models › article

[ARTICLE · art-42939] src=arxiv.org ↗ pub=2026-06-29T04:00Z topic=large-language-models verified=true sentiment=↑ positive

Towards Reliable and Robust LLM Planning: Symbolic Feedback-Driven Iterative Self-Refinement Framework

Researchers propose a symbolic feedback-driven iterative self-refinement framework to improve the robustness and reliability of large language models in long-horizon planning tasks. The framework uses natural language prompting, a symbolic verifier, and a plan recognizer to enhance feasibility and correctness, demonstrating consistent improvements in empirical results.

read1 min views1 publishedJun 29, 2026

arXiv:2606.27757v1 Announce Type: new Abstract: Large language models (LLMs) have attracted widespread attention from academia and industry, yet their deployment raises critical security concerns regarding robustness and reliability. Planning, a core component of intelligent behavior, remains challenging for LLMs, which often produce infeasible or incorrect solutions in long-horizon decision-making tasks due to inherent complexity. In this paper, we propose a symbolic feedback-driven iterative self-refinement framework to enhance the robustness and reliability of LLMs in long-horizon planning. Specifically, a natural language prompting mechanism is introduced to map logical symbols into natural language descriptions, enabling LLMs to better capture task constraints and semantics. We further design a symbolic verifier that identifies errors and converts them into corrective instructions interpretable by the LLM, thereby guiding self-refinement. In addition, we leverage a plan recognizer to infer goal reachability, facilitating more effective guidance toward desired goals. Empirical results demonstrate that the proposed framework consistently improves both feasibility and correctness in long-horizon planning tasks. This highlights its effectiveness in enhancing the reliability of LLM-based planning and potential to enable more trustworthy AI systems.

source & further reading

arxiv.org — original article

── more in #large-language-models 4 stories · sorted by recency

arxiv.org · 29 Jun · #large-language-models

Grounded Iterative Language Planning: How Parameterized World Models Reduce Hallucination Propagation in LLM Agents

arxiv.org · 29 Jun · #large-language-models

Position: The Term "Machine Unlearning" Is Overused in LLMs

arxiv.org · 29 Jun · #large-language-models

Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents

arxiv.org · 29 Jun · #large-language-models

ATOD: Annealed Turn-aware On-policy Distillation for Multi-turn Autonomous Agents

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required