cd /news/ai-agents/agent-evidence-levels-ael-grading-th… · home topics ai-agents article
[ARTICLE · art-47564] src=dev.to ↗ pub= topic=ai-agents verified=true sentiment=· neutral

Agent Evidence Levels (AEL): grading the evidence your AI agent leaves behind

A developer building an agent firewall created Agent Evidence Levels (AEL), an open standard that grades how much of an AI agent's recorded actions an outside party can verify without trusting the vendor or operator. The standard includes levels AEL-0 through AEL-4, a reference checker, and a conformance corpus, all open-sourced on GitHub. The developer aims to donate the standard to a neutral home once it gains traction.

read2 min views1 publishedJul 4, 2026

I build an agent firewall, and the question I keep hitting is not "did it block the attack." It is "how would anyone else know what my agent did, without taking my word for it." Most tools answer that with "we keep tamper-proof logs" and stop. That phrase claims the strongest property that still requires trusting whoever holds the signing key. So I wrote down a way to grade the gap, as an open standard, and shipped it with a checker so nobody has to trust me about it either.

Agent Evidence Levels (AEL) grades a record of what an AI agent did by one question: how much of it can an outside party verify, and how much omission can they detect, without trusting the vendor or the operator? It runs AEL-0 through AEL-4, and it ships with a runnable reference checker and a conformance corpus, so a grade is something you demonstrate, not something you assert.

A grade is the minimum across the required dimensions, cumulative from AEL-0. There is a reproducibility suffix, R, for when the recorded decision can be re-derived from the recorded inputs.

No level proves completeness against the party holding the signing keys. A keyholder can construct a clean history, sign every part of it, and pass every internal check. Omission-evidence is bought only with additional signed evidence, one verified keyholder at a time, and organizational independence stays declared unless it is established outside AEL. Each level states plainly the limit it does not cover. That honesty is the point of the scale.

The spec, the reference checker, and the conformance corpus are public and open-source. It is authored under my company and meant to be donated to a neutral home once the vocabulary has a life of its own. I would rather find the holes now than defend them later, so if a level claims more than the checker proves, open an issue and show me.

github.com/luckyPipewrench/agent-evidence-levels Run the checker on your own agent's evidence, or on a vendor's, and read the grade for yourself.

── more in #ai-agents 4 stories · sorted by recency
── more on @agent evidence levels 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/agent-evidence-level…] indexed:0 read:2min 2026-07-04 ·