Agent frameworks create workflows. Production needs run receipts.

wpnews.pro

cd /news/ai-agents/agent-frameworks-create-workflows-pr… · home › topics › ai-agents › article

[ARTICLE · art-26373] src=dev.to ↗ pub=2026-06-13T18:01Z topic=ai-agents verified=true sentiment=· neutral

Agent frameworks create workflows. Production needs run receipts.

A developer building Armorer argues that agent frameworks like LangGraph, CrewAI, and AutoGen create workflows but fail to provide operational visibility. Armorer is a local control plane that generates run receipts capturing tool inventory, side effects, approvals, and recovery state. The project aims to make agents operable by providing session logs, approval history, and safe rollback capabilities.

read2 min views19 publishedJun 13, 2026

Everyone is comparing agent frameworks: LangGraph, CrewAI, AutoGen, OpenAI Agents SDK, Claude Code, Codex, MCP routers, custom harnesses.

That comparison matters, but it misses the layer that starts hurting once the demo works.

The framework creates the workflow. It does not automatically answer:

what is installed and running locally?
which tools, MCP servers, skills, and providers are mounted?
what repo, files, or workspace state were in scope?
what did the agent change?
which actions created side effects?
which actions required approval, warning, redaction, block, or review?
what evidence came from tests, evals, traces, or browser checks?
what can be retried, resumed, rolled back, or cleaned up safely?

That is the layer we are building Armorer for: a local control plane around agents.

The split we are converging on:

Armorer: sessions, jobs, tool inventory, config, approvals, run records, and recovery #

Armorer Guard: fast runtime decisions on proposed tool calls and model/tool-output transitions

The goal is not to replace agent frameworks. It is to make agents operable once they exist.

The artifact I keep coming back to is a run receipt.

A useful agent run receipt should capture:

the agent/app, version, and config
the mounted tools, MCP servers, skills, and providers
the workspace/repo/files in scope
checkpoints before and after the run
tool calls and side effects
approval and review decisions
test/eval/check evidence
retry, resume, rollback, and cleanup state

Without this, debugging agent runs turns into transcript archaeology.

With it, operating agents starts to feel more like operating software again.

Repos:

Questions I would love feedback on:

What is the minimum useful run receipt for an agent session?
Which approval events should become first-class history?
Where should MCP/tool metadata stop and runtime policy begin?
What recovery action do you wish your agent harness exposed after a bad run?

source & further reading

dev.to — original article Block AI Crawlers: The 15 Bots That Matter AI Worms in Word: How Document-Borne Threats Self-Propagate AI Weekly: Opus 5 Lands, MCP Goes Stateless, and AMD Ships Helios

~/api · this article 200

$curl api.wpnews.pro/v1/news/agent-frameworks-create-…

Read original on dev.to → dev.to/armorer_labs/agent-frameworks-create-work…

mentioned entities

Armorer

LangGraph

CrewAI

AutoGen

OpenAI Agents SDK

Claude Code

Codex

MCP

metadata

slugagent-frameworks-create-workflows-production-needs-run-receipts

topic#ai-agents

secondary4 topics

sentimentneutral

canonicaldev.to

navigation

← prevApple Wrote a $1 Billion Check t…

next →Ask HN: Did we witness the "Trin…

── more in #ai-agents 4 stories · sorted by recency

byteiota.com · 29 Jul · #ai-agents

Microsoft Agent Framework Harness: What It Actually Does

agentsafelabs.com · 29 Jul · #ai-agents

Framework Doesn’t Matter (Much). Model Does.

promptcube3.com · 28 Jul · #ai-agents

Which AI Agent Framework Actually Works for Production?

dev.to · 29 Jul · #ai-agents

What AI agents actually pay for — six weeks of data from 101 pay-per-call endpoints

── more on @armorer 3 stories trending now

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

wpnews · 16 Jul · #artificial-intelligence

Women entrepreneurs are less likely to leverage AI—but more likely to benefit from it

wpnews · 28 Jul · #artificial-intelligence

How Claude Code and VS Code turned Anthropic from a safety lab into a developer phenomenon

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required