04:00
2026-05-29
arxiv.org
large-language-models
Thoughts-as-Planning: Latent World Models for Chain-of-Thoughts Optimization via Reinforcement Planning
Researchers have introduced Thoughts-as-Planning, a framework that formalizes reasoning chain optimization in large language models as a sequential decision-making process over a latent semantic space…