How AIClaw Keeps Agent Plans Out of Chat History with Runtime Plan State

wpnews.pro

cd /news/ai-agents/how-aiclaw-keeps-agent-plans-out-of-… · home › topics › ai-agents › article

[ARTICLE · art-26341] src=dev.to ↗ pub=2026-06-13T16:35Z topic=ai-agents verified=true sentiment=· neutral

How AIClaw Keeps Agent Plans Out of Chat History with Runtime Plan State

AIClaw's Runtime Plan State feature keeps agent plans out of chat history by treating them as runtime state owned by the executor. The system validates, persists, and streams the plan separately from the assistant message, preventing the final answer from being buried under noisy TODO updates. This design enforces a plan lifecycle and injects only a compact summary into each model call to conserve context budget.

read3 min views16 publishedJun 13, 2026

Most agent products eventually hit the same UX problem: complex tasks need planning, but users do not want the final answer buried under noisy TODO updates.

AIClaw handles that with an existing core feature called Runtime Plan State. Instead of storing planning as ordinary assistant text, AIClaw treats the plan as runtime state owned by the executor. The model can propose or revise a plan, but the harness validates it, persists it, streams it live, and links the final snapshot to the assistant message after execution finishes.

This post is not announcing a brand-new feature. It is a deeper look at how AIClaw already implements planning in a way that stays useful during execution without polluting the conversation itself.

If an agent writes plans directly into chat, several issues show up quickly:

AIClaw's approach is to separate these concerns:

The repository README describes this directly: AIClaw uses Plan State instead of chat-visible TODO blocks, and streaming chat plus execution logs show the plan separately from the assistant answer.

At a high level, AIClaw's execution loop does this:

The plan has a small lifecycle instead of being treated like free-form prose:

pending -> running -> completed
                  -> failed
                  -> blocked
pending -> skipped

That lifecycle matters because the harness can enforce behavior the model should not be trusted to enforce by itself.

In internal/agent/plan.go, the internal

plan

control tool supports actions such as set

, update

, revise

, and read

. But the important part is not the tool surface. The important part is ownership:PlanManager

normalizes and validates stateThat split keeps the model flexible without giving it full control over task state.

For example, AIClaw enforces that only one plan item can be running

at a time. The tests in internal/agent/plan_test.go explicitly verify that if multiple items are proposed as

running

, the plan is normalized back to a single running item.One subtle but important implementation detail is that AIClaw does not inject the full plan history into every model call.

The PromptBlock

path in internal/agent/plan.go builds a compact

<plan_state>

block with:The design notes in docs/design/agent-improvements.md call this out clearly: only the goal, current running item, remaining pending summary, and recent revision reason are injected each round so the full history does not consume context budget.

This is a practical design choice. Planning helps the model stay oriented, but dumping the whole plan transcript back into the prompt every round would work against that goal.

The main run loop in internal/agent/run.go refreshes plan state before each model call. When tool or LLM work fails, the harness can mark the current step as failed. When a step succeeds, the harness can complete it and advance to the next pending one.

That behavior is also covered by tests:

running

This is the difference between "the model wrote a checklist" and "the system is actually operating a task state machine."

From the product side, Runtime Plan State gives AIClaw a cleaner split between response and observability:

That matters for real tool-using agents. If an agent reads files, runs commands, searches the web, or delegates to sub-agents, users need to inspect progress and failures without turning the final answer into a debug trace.

Imagine an AIClaw agent is asked to:

With Runtime Plan State, the plan can exist as structured execution state while the tool timeline records the underlying work. If the test step fails, AIClaw can mark that step as failed and continue the state transition logic cleanly. If the work completes, the final answer can stay focused on outcome, not internal bookkeeping.

That is a better fit for production-style agent work than chat-visible TODO spam.

AIClaw's design makes a strong distinction:

Those should not all be the same thing.

A lot of agent systems blur the line between them. AIClaw's Runtime Plan State is interesting precisely because it does not.

If you are building self-hosted agents and want both cleaner chat UX and better execution observability, this is one of the AIClaw features worth studying in the codebase.

AIClaw is open source here: github.com/chowyu12/aiclaw

source & further reading

dev.to — original article MCP Agents, Explained: What Actually Makes an LLM an "Agent" Stop Fighting Your Router: Full URL Parsing in React with Zero Dependencies Building with mini, Part 8/9: Health commands — changelog and doctor

~/api · this article 200

$curl api.wpnews.pro/v1/news/how-aiclaw-keeps-agent-p…

Read original on dev.to → dev.to/chowyu12/how-aiclaw-keeps-agent-plans-out…

mentioned entities

AIClaw

metadata

slughow-aiclaw-keeps-agent-plans-out-of-chat-history-with-runtime-plan-state

topic#ai-agents

secondary2 topics

sentimentneutral

canonicaldev.to

navigation

← prevThe Day the US Government Shut D…

next →Anthropic suspends top AI models…

── more in #ai-agents 4 stories · sorted by recency

byteiota.com · 29 Jul · #ai-agents

OfficeCLI: Give Your AI Agent Control of Office Files

dev.to · 29 Jul · #ai-agents

MCP Agents, Explained: What Actually Makes an LLM an "Agent"

dev.to · 29 Jul · #ai-agents

AI-Driven Development: Transforming Software Workflows in 2026

dev.to · 29 Jul · #ai-agents

Google’s AI Search Guidance Shifts the Focus From Schema Markup to Agent-Ready Websites

── more on @aiclaw 3 stories trending now

wpnews · 16 Jul · #artificial-intelligence

Women entrepreneurs are less likely to leverage AI—but more likely to benefit from it

wpnews · 28 Jul · #artificial-intelligence

How Claude Code and VS Code turned Anthropic from a safety lab into a developer phenomenon

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required