I built a 9-agent AI dev team in a Claude Code plugin — here's what happened A developer built a nine-agent AI development team as a Claude Code plugin, orchestrating specialized agents to take a feature request from requirements analysis through production deployment. The plugin, called claude-dev-pipeline, assigns roles including product manager, architect, backend and frontend engineers, QA tester, code reviewer, and DevOps specialist, with human approval checkpoints at each phase. The system successfully generated a complete authentication feature for a task manager app, including code, tests, and deployment configuration, in approximately 20 minutes. I was building a side project — a simple task manager app. I opened Claude Code, typed: "Add user authentication with email and password login" …and hit enter. Twenty minutes later, I had code. A lot of code. Authentication logic, routes, middleware, even some basic tests. But there was a problem. The frontend me, on a different day had assumed a different API shape. The tests only covered the happy path. There was no architecture decision to reference — I just picked JWT because it felt right. And the docker-compose.yml ? It didn't exist yet. I had AI-generated code, but no real software development workflow. Good software isn't just code. Before you write a single line, you need: Normally, a team handles all of this. A PM writes the spec. An architect proposes options. Engineers implement and review each other's work. A DevOps person sets up CI/CD. What if AI could fill all those roles? I built claude-dev-pipeline — a Claude Code plugin that orchestrates a team of specialized AI agents, each with a specific job. A Claude Code plugin that orchestrates 7 specialized AI agentsto take your feature request all the way from requirements analysis to production deployment — with a human-in-the-loop checkpoint at every phase. Writing a feature involves more than just code. You need: claude-dev-pipeline encodes that workflow as a Claude Code plugin. Each agent is an expert. You stay in control at every gate. | | Agent | Role | Output | |---|---|---|---| | 0 | Discovery | Clarifies vague requirements via dialogue | Confirmed requirement | | 1 | Exploration | Scans existing codebase in parallel | .pipeline/exploration.md | | 2 | PM | Writes a structured PRD with user stories & acceptance criteria | .pipeline/pm.md | | 3 | Architect | Proposes 2–3 architecture options | The idea was simple: instead of one big AI doing everything, use multiple agents in sequence — each expert at one job — with you approving the output at every important gate. Nine agents, nine phases: | | Agent | Role | Output | |---|---|---|---| | 0 | Discovery | Clarifies vague requirements via dialogue | Confirmed requirement | | 1 | Exploration | Scans existing codebase in parallel | .pipeline/exploration.md | | 2 | PM | Writes a structured PRD with user stories & acceptance criteria | .pipeline/pm.md | | 3 | Architect | Proposes 2–3 architecture options with trade-offs | .pipeline/architect.md | | 4a | Backend | Implements REST APIs, services, repositories | src/backend/ | | 4b | Frontend | Implements React UI, hooks, API client | src/frontend/ | | 5 | QA | Writes and runs unit, integration, and E2E tests | tests/ | | 6 | Reviewer | Audits code for security, bugs, and quality confidence ≥ 80 | .pipeline/review.md | | 7 | DevOps | Creates Dockerfile, docker-compose, GitHub Actions CI/CD | deploy/ | Phases 4a and 4b run in parallel. The full flow: User requirement │ Discovery ← asks clarifying questions if vague │ confirmed requirement Exploration ← scans codebase in parallel 2 sub-agents │ exploration.md PM ← you review & approve PRD │ pm.md Architect ← you choose 1 of 3 architecture options │ architect.md ┌────┴────┐ Backend Frontend ← run in parallel └────┬────┘ QA │ tests green Reviewer ← pipeline pauses if Critical issues 3 │ review.md clean DevOps │ 🎉 Done Human-in-the-loop at every critical gate. You approve the PRD before architecture begins. You choose one of three architecture options before code is written. The Reviewer pauses everything if it finds more than 3 critical issues. AI does the work; you stay in control. Backend and Frontend run in parallel. Since both agents were given the same architecture document, they fit together. This shaves real time off the pipeline and eliminates the classic "your API doesn't match what I expected" problem. Every agent writes a structured artifact. The PM writes .pipeline/pm.md . The Architect writes .pipeline/architect.md . These files become the living memory of your project — persistent knowledge that survives across pipeline runs and future features. Git auto-commits after each approved phase. Every milestone is tracked: pipeline: PM — add PRD for