I Built a Dual-Pool Adversarial Review System for AI Agents — And It Actually Works

wpnews.pro

cd /news/artificial-intelligence/i-built-a-dual-pool-adversarial-revi… · home › topics › artificial-intelligence › article

[ARTICLE · art-41900] src=dev.to ↗ pub=2026-06-27T14:53Z topic=artificial-intelligence verified=true sentiment=↑ positive

I Built a Dual-Pool Adversarial Review System for AI Agents — And It Actually Works

A developer built a dual-pool adversarial review system for AI agents that uses real engineers with searchable philosophies instead of abstract roles. The system employs a fixed pool for stability and a random pool for divergence, with managers curating teams per task. Testing on a pull request to the claude-skills repository (18.7K stars) showed the random pool uncovered issues missed by fixed-pool reviewers.

read2 min views1 publishedJun 27, 2026

AI code review has a problem: abstract roles produce generic feedback. "Saboteur" says "add error handling." "New Hire" says "this is confusing." Useful? Sometimes. Specific? Rarely.

I built something different: a review system that uses real engineers with searchable philosophies instead of abstract roles. Linus Torvalds doesn't say "consider error handling" — he says "eliminate the special case entirely." That's not a wording difference. That's a completely different action.

Fixed Pool (Convergence)          Random Pool (Divergence)
Digital-twin matched              Web-searched fresh each time
Stability & depth                 Surprise & blind-spot coverage
    │                                    │
    └────────── Cross-orchestrated ──────┘
              explore ←→ exploit

9 workers + 2 managers, curated to match the user's expertise, personality, and goals. Patty McCord (Netflix's former Chief Talent Officer) and Ed Catmull (Pixar's Braintrust creator) serve as managers who recruit teams per task instead of using a fixed template.

Fresh personas via web search each session. No preset list — the manager defines search keywords based on what the task needs. This is where the surprises come from.

[Manager] picked [A,B,C]. Found N issues. Verdict: BLOCK/CONCERNS/CLEAN

Next round: new manager, keep at most 2 previous members.

I tested this on my own PR to alirezarezvani/claude-skills (18.7K stars):

The random pool found things both fixed-pool rounds completely missed. Fixed pool reviewers — who know me — were blind to how an outsider would perceive the skill.

alirezarezvani adversarial-reviewer	gaurav-yadav adversarial-ai-review	This System
Reviewers	Abstract roles	Domain agents	Real people + searchable philosophy
Team formation	Fixed 3-template	22 agent pairs	Manager-curated per task
Cross-round	Rotate roles	Same agent set	Swap pool + manager + workers
Personalization	None	None	Digital twin matching
Evolution	Static	Static	Promote/demote/audit cycle

source & further reading

dev.to — original article PostCSS Adopted Staged Publishing. 685M Weekly Downloads Now Gated. I Built 9 Production-Ready Telegram Bots in Python (Open Source) I Built the Same Event App With Five AI Coding Tools and Only Two Didn't Embarrass Me

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-built-a-dual-pool-adve…

Read original on dev.to → dev.to/yuhaolin2005/i-built-a-dual-pool-adversar…

mentioned entities

Linus Torvalds

Patty McCord

Ed Catmull

Netflix

Pixar

alirezarezvani

claude-skills

gaurav-yadav

metadata

slugi-built-a-dual-pool-adversarial-review-system-for-ai-agents-and-it-actually

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevI Built 9 Production-Ready Teleg…

next →PostCSS Adopted Staged Publishin…

── more in #artificial-intelligence 4 stories · sorted by recency

adrianavillela.com · 27 Jun · #artificial-intelligence

Autonomous AI Software Development: Good Idea, or Bad Idea?

dev.to · 27 Jun · #artificial-intelligence

The Linear CLI Linear never shipped: driving your tracker from the terminal

voodootikigod.com · 27 Jun · #artificial-intelligence

Meet the ADLC

dev.to · 27 Jun · #artificial-intelligence

Session-Level Spending Limits Are Not Governance. Your Agent Needs Autonomy Tiers.

── more on @linus torvalds 3 stories trending now

wpnews · 25 May · #artificial-intelligence

Maia-3: free and open source

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required