{"slug": "agent-frameworks-create-workflows-production-needs-run-receipts", "title": "Agent frameworks create workflows. Production needs run receipts.", "summary": "A developer building Armorer argues that agent frameworks like LangGraph, CrewAI, and AutoGen create workflows but fail to provide operational visibility. Armorer is a local control plane that generates run receipts capturing tool inventory, side effects, approvals, and recovery state. The project aims to make agents operable by providing session logs, approval history, and safe rollback capabilities.", "body_md": "Everyone is comparing agent frameworks: LangGraph, CrewAI, AutoGen, OpenAI Agents SDK, Claude Code, Codex, MCP routers, custom harnesses.\n\nThat comparison matters, but it misses the layer that starts hurting once the demo works.\n\nThe framework creates the workflow. It does not automatically answer:\n\n- what is installed and running locally?\n- which tools, MCP servers, skills, and providers are mounted?\n- what repo, files, or workspace state were in scope?\n- what did the agent change?\n- which actions created side effects?\n- which actions required approval, warning, redaction, block, or review?\n- what evidence came from tests, evals, traces, or browser checks?\n- what can be retried, resumed, rolled back, or cleaned up safely?\n\nThat is the layer we are building Armorer for: a local control plane around agents.\n\nThe split we are converging on:\n\n-\n**Armorer**: sessions, jobs, tool inventory, config, approvals, run records, and recovery\n-\n**Armorer Guard**: fast runtime decisions on proposed tool calls and model/tool-output transitions\n\nThe goal is not to replace agent frameworks. It is to make agents operable once they exist.\n\nThe artifact I keep coming back to is a **run receipt**.\n\nA useful agent run receipt should capture:\n\n- the agent/app, version, and config\n- the mounted tools, MCP servers, skills, and providers\n- the workspace/repo/files in scope\n- checkpoints before and after the run\n- tool calls and side effects\n- approval and review decisions\n- test/eval/check evidence\n- retry, resume, rollback, and cleanup state\n\nWithout this, debugging agent runs turns into transcript archaeology.\n\nWith it, operating agents starts to feel more like operating software again.\n\nRepos:\n\nQuestions I would love feedback on:\n\n- What is the minimum useful run receipt for an agent session?\n- Which approval events should become first-class history?\n- Where should MCP/tool metadata stop and runtime policy begin?\n- What recovery action do you wish your agent harness exposed after a bad run?", "url": "https://wpnews.pro/news/agent-frameworks-create-workflows-production-needs-run-receipts", "canonical_source": "https://dev.to/armorer_labs/agent-frameworks-create-workflows-production-needs-run-receipts-5eb6", "published_at": "2026-06-13 18:01:30+00:00", "updated_at": "2026-06-13 18:14:44.949997+00:00", "lang": "en", "topics": ["ai-agents", "developer-tools", "ai-infrastructure", "ai-tools", "mlops"], "entities": ["Armorer", "LangGraph", "CrewAI", "AutoGen", "OpenAI Agents SDK", "Claude Code", "Codex", "MCP"], "alternates": {"html": "https://wpnews.pro/news/agent-frameworks-create-workflows-production-needs-run-receipts", "markdown": "https://wpnews.pro/news/agent-frameworks-create-workflows-production-needs-run-receipts.md", "text": "https://wpnews.pro/news/agent-frameworks-create-workflows-production-needs-run-receipts.txt", "jsonld": "https://wpnews.pro/news/agent-frameworks-create-workflows-production-needs-run-receipts.jsonld"}}