Agent Evidence Levels (AEL): grading the evidence your AI agent leaves behind

wpnews.pro

cd /news/ai-agents/agent-evidence-levels-ael-grading-th… · home › topics › ai-agents › article

[ARTICLE · art-47564] src=dev.to ↗ pub=2026-07-04T02:15Z topic=ai-agents verified=true sentiment=· neutral

Agent Evidence Levels (AEL): grading the evidence your AI agent leaves behind

A developer building an agent firewall created Agent Evidence Levels (AEL), an open standard that grades how much of an AI agent's recorded actions an outside party can verify without trusting the vendor or operator. The standard includes levels AEL-0 through AEL-4, a reference checker, and a conformance corpus, all open-sourced on GitHub. The developer aims to donate the standard to a neutral home once it gains traction.

read2 min views1 publishedJul 4, 2026

I build an agent firewall, and the question I keep hitting is not "did it block the attack." It is "how would anyone else know what my agent did, without taking my word for it." Most tools answer that with "we keep tamper-proof logs" and stop. That phrase claims the strongest property that still requires trusting whoever holds the signing key. So I wrote down a way to grade the gap, as an open standard, and shipped it with a checker so nobody has to trust me about it either.

Agent Evidence Levels (AEL) grades a record of what an AI agent did by one question: how much of it can an outside party verify, and how much omission can they detect, without trusting the vendor or the operator? It runs AEL-0 through AEL-4, and it ships with a runnable reference checker and a conformance corpus, so a grade is something you demonstrate, not something you assert.

A grade is the minimum across the required dimensions, cumulative from AEL-0. There is a reproducibility suffix, R, for when the recorded decision can be re-derived from the recorded inputs.

No level proves completeness against the party holding the signing keys. A keyholder can construct a clean history, sign every part of it, and pass every internal check. Omission-evidence is bought only with additional signed evidence, one verified keyholder at a time, and organizational independence stays declared unless it is established outside AEL. Each level states plainly the limit it does not cover. That honesty is the point of the scale.

The spec, the reference checker, and the conformance corpus are public and open-source. It is authored under my company and meant to be donated to a neutral home once the vocabulary has a life of its own. I would rather find the holes now than defend them later, so if a level claims more than the checker proves, open an issue and show me.

github.com/luckyPipewrench/agent-evidence-levels Run the checker on your own agent's evidence, or on a vendor's, and read the grade for yourself.

source & further reading

dev.to — original article Workflow Series (06): Security — Cross-Step Injection Propagation and Four Defense Principles The Agent Can Drive. You Still Need to Know the Route. David Just Beat Goliath on Terminal-Bench 2.1

~/api · this article 200

$curl api.wpnews.pro/v1/news/agent-evidence-levels-ae…

Read original on dev.to → dev.to/luckypipewrench/agent-evidence-levels-ael…

mentioned entities

Agent Evidence Levels

AEL

GitHub

luckyPipewrench

metadata

slugagent-evidence-levels-ael-grading-the-evidence-your-ai-agent-leaves-behind

topic#ai-agents

secondary4 topics

sentimentneutral

canonicaldev.to

navigation

← prevFable departure skill creation

next →Argentina's plan for AI-run comp…

── more in #ai-agents 4 stories · sorted by recency

dev.to · 4 Jul · #ai-agents

Workflow Series (06): Security — Cross-Step Injection Propagation and Four Defense Principles

dev.to · 4 Jul · #ai-agents

AI Won’t Replace Developers — But Developers Who Use AI Will Replace Those Who Don’

byteiota.com · 4 Jul · #ai-agents

Meta AI Agents Are Stalling: What Zuckerberg Admitted

dev.to · 4 Jul · #ai-agents

Cocos Creator 3.x Debugging: 17 Skills the Gray Screen Won't Tell You About

── more on @agent evidence levels 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required