CI gates for AI-generated PRs need re-derivable evidence

wpnews.pro

cd /news/developer-tools/ci-gates-for-ai-generated-prs-need-r… · home › topics › developer-tools › article

[ARTICLE · art-35245] src=dev.to ↗ pub=2026-06-21T01:08Z topic=developer-tools verified=true sentiment=· neutral

CI gates for AI-generated PRs need re-derivable evidence

Agent Gate v0.2.1 adds evidence snapshots to AI-generated pull request findings, enabling third parties to re-derive why a CI gate fired. The GitHub Action checks deterministic merge evidence without using LLMs at runtime, and the new feature provides canonical material alongside stable finding IDs for auditability. The developer behind Agent Gate aims to make failure modes visible, reproducible, and tunable before promoting warnings to merge gates.

read3 min views1 publishedJun 21, 2026

When a CI gate flags an AI-generated PR, the important question is not only "what did it flag?"

It is also:

"Could someone else come back later and re-derive why this finding fired?"

That is the reason I added evidence snapshots to Agent Gate v0.2.1.

Agent Gate is a GitHub Action for AI-generated pull requests.

It does not review code with an LLM. It checks deterministic merge evidence in CI:

The Action does not checkout PR code, call LLMs at runtime, or execute repository scripts.

In v0.2.0, Agent Gate added stable finding IDs.

That gave every finding a short audit handle, for example:

agf_987ab9ddb8c1b299

That is useful for references, comments, future override workflows, and log-based debugging.

But an ID by itself is not proof. If someone sees the ID later, they still need to know what recorded material produced it.

v0.2.1 adds evidenceSnapshot

to public findings.

The split is:

findingId = short audit handle
evidenceSnapshot = canonical material used to derive that handle

The snapshot is intentionally boring. It contains stable rule material such as:

It does not include timestamps, report order, risk score, version, commit SHA, or mutable display text.

Example compact log output:

Agent Gate: NEEDS HUMAN DECISION
Decision: warn
Risk score: 49 / 100
Why: Agent-generated PRs must include an agent-gate contract.
Recommended next step: Add a PR contract before relying on scope checks.
Policy status: warning today; eligible to become a merge gate after tuning.

Findings:
- error agf_be0c2c2a66312aff contract/missing
- error agf_987ab9ddb8c1b299 risk/high-risk-path .github/workflows/agent-gate.yml
- warn agf_6016e753491255d7 workflow/dangerous-pattern .github/workflows/agent-gate.yml

The compact log stays short, but the JSON and Markdown reports carry the fuller evidence.

Example JSON shape:

{
  "findingId": "agf_987ab9ddb8c1b299",
  "ruleId": "risk/high-risk-path",
  "severity": "error",
  "path": ".github/workflows/agent-gate.yml",
  "evidenceSnapshot": {
    "ruleId": "risk/high-risk-path",
    "severity": "error",
    "path": ".github/workflows/agent-gate.yml",
    "evidence": [
      {
        "label": "changed_file",
        "value": ".github/workflows/agent-gate.yml"
      }
    ]
  }
}

For me, the bar for promoting a finding from warning to blocking is:

A third party should be able to re-derive the finding from recorded evidence.

That does not mean the check is magically correct.

It means the failure mode is visible, reproducible, and tunable.

A repo can start in warn mode, observe which findings are useful, and only later promote low-noise findings into merge gates.

Agent Gate still does not prove semantic correctness.

Matching test-file evidence is not proof that the tests cover the behavior. It is change evidence / self-consistency evidence.

Maintainer override storage is also not implemented yet. That is probably the next hard design question: if someone bypasses a finding, where should that override live so it is durable enough to inspect later?

CODEOWNERS / reviewer evidence and package dependency drift are also future work.

If you maintain a repo where coding agents open PRs, I would love feedback on whether this kind of evidence is useful or too noisy in observe mode.

Repo:

https://github.com/sjh9714/Agent-Gate

Disclosure: I maintain Agent Gate. v0.2.1 is still a prerelease; I would start in warn mode before treating any finding as a merge gate.

source & further reading

dev.to — original article Building a Practical AI Assistant with Python: From Prompt to Production Thinking Output is cheap. Judgment is the job. Struggling with Slow AI Responses: Building a Streaming Chat UI with SSE

~/api · this article 200

$curl api.wpnews.pro/v1/news/ci-gates-for-ai-generate…

Read original on dev.to → dev.to/sjh9714/ci-gates-for-ai-generated-prs-nee…

mentioned entities

Agent Gate

GitHub

sjh9714

metadata

slugci-gates-for-ai-generated-prs-need-re-derivable-evidence

topic#developer-tools

secondary2 topics

sentimentneutral

canonicaldev.to

navigation

← prevUC Berkeley Robot Learns Motor T…

next →Mistral Vibe: Coding Agent With …

── more in #developer-tools 4 stories · sorted by recency

dev.to · 21 Jun · #developer-tools

I Built an Afriex MCP Prompt Cookbook So Developers Never Have to Stare at a Blank Prompt Again

github.com · 21 Jun · #developer-tools

Show HN: Rlsgate – Block the Supabase RLS leak before you deploy (CLI)

byteiota.com · 20 Jun · #developer-tools

Cloudflare Temporary Accounts Let AI Agents Deploy Without OAuth Hell

dev.to · 21 Jun · #developer-tools

Why you still do not trust your AI's memory

── more on @agent gate 3 stories trending now

wpnews · 20 Jun · #ai-safety

SR 11-7 Model Risk for AI Systems: What Banks Actually Need to Build

wpnews · 20 Jun · #ai-agents

Amazon Bedrock AgentCore Memory: Build AI Agents That Remember

wpnews · 20 Jun · #artificial-intelligence

Building a Voice AI Platform with 28 Modules in Python

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required