cd /news/large-language-models/grounded-iterative-language-planning… · home topics large-language-models article
[ARTICLE · art-42940] src=arxiv.org ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

Grounded Iterative Language Planning: How Parameterized World Models Reduce Hallucination Propagation in LLM Agents

Researchers introduced Grounded Iterative Language Planning (GILP), a hybrid method combining a small parameterized world model with LLM-based agent reasoning, reducing hallucinated-state rates from 0.176 to 0.035 on graph-structured planning benchmarks and boosting success rates from 0.668 to 0.838 with only 22% extra LLM calls.

read1 min views1 publishedJun 29, 2026

arXiv:2606.27806v1 Announce Type: new Abstract: World models for language agents come in two useful forms. An agent-based world model calls an LLM API and reasons flexibly in language, but its errors appear as hallucinated state changes that are hard to score with ordinary regression losses. A parameterized world model is a trained transition predictor; its errors are easier to measure with quantities such as NodeMSE, delta accuracy, and validity accuracy, but it is usually weaker as a standalone planner. We compare these two families on four graph-structured planning benchmarks and introduce operational hallucination metrics for the agent-based case. The comparison motivates \textbf{Grounded Iterative Language Planning} (GILP), which trains only a small parameterized backbone and combines it with API-based agent reasoning. The backbone supplies valid actions, predicted state deltas, risk, and value; the LLM drafts an action and imagined delta; and a consistency gate asks for revision when the two disagree. On real GPT-4o-mini calls, GILP reduces hallucinated-state rate from 0.176 to 0.035. In calibrated simulator ablations, it raises success from 0.668 to 0.838 while adding only ~22% extra LLM calls.

── more in #large-language-models 4 stories · sorted by recency
── more on @gpt-4o-mini 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/grounded-iterative-l…] indexed:0 read:1min 2026-06-29 ·