AgentForge–28 production-grade skills that make AI agents ship reliable code

wpnews.pro

Forge production-grade AI agents.

28 battle-tested skills. 6 specialist personas. 5 reference checklists. One mission: make AI agents build software like senior engineers.

AI coding agents are fast. They're also reckless.

They skip specs. They "forget" tests. They ship without review. They treat "it works on my machine" as a success criteria. In short, they build prototypes, not production software.

AgentForge fixes this.

We don't give agents vague suggestions. We give them structured, battle-tested workflows that encode how senior engineers actually build software — the same workflows that power teams at Google, Netflix, and Stripe. Every skill has steps, checkpoints, anti-rationalization defenses, and evidence-based verification. When an agent follows these, it ships code you can trust.

  DEFINE          PLAN           BUILD          VERIFY         REVIEW          SHIP           OPS
 ┌──────┐      ┌──────┐      ┌──────┐      ┌──────┐      ┌──────┐      ┌──────┐      ┌──────┐
 │ Idea │ ───▶ │ Spec │ ───▶ │ Code │ ───▶ │ Test │ ───▶ │  QA  │ ───▶ │  Go  │ ───▶ │ Run  │
 │Refine│      │  PRD │      │ Impl │      │Debug │      │ Gate │      │ Live │      │ Ops  │
 └──────┘      └──────┘      └──────┘      └──────┘      └──────┘      └──────┘      └──────┘
  /spec          /plan          /build        /test         /review       /ship

7 slash commands. 28 skills. 6 phases. Zero excuses.

Other Prompt Packs	AgentForge
Structure
Vague advice	Step-by-step workflows with checkpoints
Verification
"Make sure it works"	Evidence-based exit criteria (tests, builds, runtime data)
Anti-cheating
None	Rationalization tables that call out excuses agents use to skip steps
Scope
Generic coding tips	Full lifecycle: spec → plan → build → verify → review → ship → ops
Quality gates
None	Built-in CI pipeline with 8 automated checks
Cross-reference
Silos	Every skill references related skills; no duplication

7 slash commands that map to the development lifecycle. Each one activates the right skills automatically.

What you're doing	Command	Key principle
Define what to build	`/spec`
Spec before code
Plan how to build it	`/plan`
Small, atomic tasks
Build incrementally	`/build`
One slice at a time
Prove it works	`/test`
Tests are proof
Review before merge	`/review`
Improve code health
Simplify the code	`/code-simplify`
Clarity over cleverness
Ship to production	`/ship`
Faster is safer

Want fewer manual steps once the spec exists? ** /build auto** generates the plan and implements every task in a single approved pass — you approve the plan once, then it runs autonomously. It removes the human stepping

betweentasks, not the verification: every task is still test-driven and committed individually, and it s on failures or risky steps.

Skills also activate automatically based on what you're doing — designing an API triggers api-and-interface-design

, building UI triggers frontend-ui-engineering

, and so on.

Claude Code (recommended)

Marketplace install:

/plugin marketplace add borhen68/SkillEngine
/plugin install agentforge@borhen-agentforge

SSH errors?The marketplace clones repos via SSH. If you don't have SSH keys set up on GitHub, either[add your SSH key]or use the full HTTPS URL to force the HTTPS cloning:

/plugin marketplace add https://github.com/borhen68/SkillEngine.git
/plugin install agentforge@borhen-agentforge

Local / development:

git clone https://github.com/borhen68/SkillEngine.git
claude --plugin-dir /path/to/agentforge

Cursor

Copy any SKILL.md

into .cursor/rules/

, or reference the full skills/

directory. See docs/cursor-setup.md.

Antigravity CLI

Install as a native plugin for skills, subagents, and slash commands. See docs/antigravity-setup.md.

Install from the repo:

agy plugin install https://github.com/borhen68/SkillEngine.git

Install from a local clone:

git clone https://github.com/borhen68/SkillEngine.git
agy plugin install ./agentforge

Gemini CLI

Install as native skills for auto-discovery, or add to GEMINI.md

for persistent context. See docs/gemini-cli-setup.md.

Install from the repo:

gemini skills install https://github.com/borhen68/SkillEngine.git --path skills

Install from a local clone:

gemini skills install ./agentforge/skills/

Windsurf

Add skill contents to your Windsurf rules configuration. See docs/windsurf-setup.md.

OpenCode

Uses agent-driven skill execution via AGENTS.md and the skill

tool.

GitHub Copilot

Use agent definitions from agents/

as Copilot personas and skill content in .github/copilot-instructions.md

. See docs/copilot-setup.md.

**Kiro IDE & CLI **

Skills for Kiro reside under ".kiro/skills/" and can be stored under Project or Global level. Kiro also supports Agents.md. See Kiro docs at https://kiro.dev/docs/skills/

Codex / Other Agents

Skills are plain Markdown - they work with any agent that accepts system prompts or instruction files. See docs/getting-started.md.

The commands above are entry points. The pack includes 28 skills total — 24 lifecycle skills, 4 operations skills, plus the using-agentforge

meta-skill. Each skill is a structured workflow with steps, verification gates, and anti-rationalization tables. You can also reference any skill directly.

Skill	What It Does	Use When

Skill	What It Does	Use When

idea-refine spec-driven-development| Skill | What It Does | Use When | |---|---|---| |

Skill	What It Does	Use When

test-driven-development context-engineering source-driven-development doubt-driven-development frontend-ui-engineering api-and-interface-design| Skill | What It Does | Use When | |---|---|---| |

debugging-and-error-recovery| Skill | What It Does | Use When | |---|---|---| |

code-simplification security-and-hardening performance-optimization| Skill | What It Does | Use When | |---|---|---| |

ci-cd-and-automation deprecation-and-migration documentation-and-adrswhyobservability-and-instrumentation shipping-and-launch| Skill | What It Does | Use When | |---|---|---| |

cost-optimization data-engineering ai-opsPre-configured specialist personas for targeted reviews:

Agent	Role	Perspective

test-engineer security-auditor web-performance-auditor/webperf

site-reliability-engineerQuick-reference material that skills pull in when needed:

Reference	Covers

security-checklist.md performance-checklist.md accessibility-checklist.md reliability-checklist.mdEvery skill follows a consistent anatomy:

┌─────────────────────────────────────────────────┐
│  SKILL.md                                       │
│                                                 │
│  ┌─ Frontmatter ─────────────────────────────┐  │
│  │ name: lowercase-hyphen-name               │  │
│  │ description: Guides agents through [task].│  │
│  │              Use when…                    │  │
│  └───────────────────────────────────────────┘  │                                                                                                
│  Overview         → What this skill does        │
│  When to Use      → Triggering conditions       │
│  Process          → Step-by-step workflow       │
│  Rationalizations → Excuses + rebuttals         │
│  Red Flags        → Signs something's wrong     │
│  Verification     → Evidence requirements       │
└─────────────────────────────────────────────────┘

Key design choices:

Process, not prose. Skills are workflows agents follow, not reference docs they read. Each has steps, checkpoints, and exit criteria.Anti-rationalization. Every skill includes a table of common excuses agents use to skip steps (e.g., "I'll add tests later") with documented counter-arguments.Verification is non-negotiable. Every skill ends with evidence requirements - tests passing, build output, runtime data. "Seems right" is never sufficient.Progressive disclosure. TheSKILL.md

is the entry point. Supporting references load only when needed, keeping token usage minimal.

agent-skills/
├── skills/                            # 28 skills (24 lifecycle + 4 ops + 1 meta)
│   ├── interview-me/                  #   Define
│   ├── idea-refine/                   #   Define
│   ├── spec-driven-development/       #   Define
│   ├── planning-and-task-breakdown/   #   Plan
│   ├── incremental-implementation/    #   Build
│   ├── context-engineering/           #   Build
│   ├── source-driven-development/     #   Build
│   ├── doubt-driven-development/      #   Build
│   ├── frontend-ui-engineering/       #   Build
│   ├── test-driven-development/       #   Build
│   ├── api-and-interface-design/      #   Build
│   ├── browser-testing-with-devtools/ #   Verify
│   ├── debugging-and-error-recovery/  #   Verify
│   ├── code-review-and-quality/       #   Review
│   ├── code-simplification/           #   Review
│   ├── security-and-hardening/        #   Review
│   ├── performance-optimization/      #   Review
│   ├── git-workflow-and-versioning/   #   Ship
│   ├── ci-cd-and-automation/          #   Ship
│   ├── deprecation-and-migration/     #   Ship
│   ├── documentation-and-adrs/        #   Ship
│   ├── observability-and-instrumentation/ # Ship
│   ├── shipping-and-launch/           #   Ship
│   ├── chaos-engineering/             #   Ops
│   ├── cost-optimization/             #   Ops
│   ├── data-engineering/              #   Ops
│   ├── ai-ops/                        #   Ops
│   └── using-agentforge/            #   Meta: how to use this pack
├── agents/                            # 5 specialist personas
├── references/                        # 5 supplementary checklists
├── hooks/                             # Session lifecycle hooks
├── scripts/                           # Validation & build automation
├── .claude/commands/                  # 7 slash commands (Claude Code)
├── .gemini/commands/                  # 7 slash commands (Gemini CLI)
├── commands/                          # 8 slash commands (Antigravity CLI)
├── plugin.json                        # Antigravity plugin manifest
├── package.json                       # Node.js tooling & scripts
├── Makefile                           # Local development workflows
└── docs/                              # Setup guides per tool

This repository includes a comprehensive validation and quality pipeline:

npm install

npm test

make ci

Available commands:

Command	What It Does
`npm run validate`
Validate all skill files for anatomy compliance
`npm run validate:strict`
Same, but warnings block CI
`npm run quality:cross-skill`
Check cross-skill consistency and references
`npm run quality:agents`
Validate agent persona files
`npm run test:hooks`
Test session lifecycle hooks
`npm run build:packages`
Build .zip packages for distribution
`npm run stats`
Show project statistics dashboard

Quality gates enforced:

YAML frontmatter validation (name, description, max length)
Required sections: Overview, When to Use, Common Rationalizations, Red Flags, Verification
Cross-skill reference integrity (no dead links)
Internal markdown link validation
Description quality (must contain both "what" and "when" signals)
Token estimation and size warnings
Code block language specifier checks
Agent persona consistency
Lifecycle coverage completeness

"I watched an AI agent ship a 'working' feature that had no tests, no error handling, and a SQL injection vulnerability. It was 'done' in 20 minutes. It would have taken 2 days to fix in production." — Every engineering lead, 2024-2025

AI agents are incredible accelerators. They're also incredible liability generators — because they optimize for speed, not correctness. They don't know what they don't know, and they don't know that they don't know it.

AgentForge is the guardrail.

Every skill in this pack encodes hard-won judgment from production engineering:

When to write a spec (always, for anything non-trivial)What to test (behavior, not implementation; edge cases, not just happy path)How to review (five axes, not just "does it compile")When to ship (when rollback is faster than fix-forward)

These aren't theoretical ideals. They're the workflows that separate teams that sleep through launches from teams that don't.

This pack draws from the best engineering cultures in the world:

Google: Hyrum's Law, Beyonce Rule, test pyramid, change sizing, trunk-based development, code as liabilityNetflix: Chaos engineering, circuit breakers, graceful degradationStripe: API design, backward compatibility, developer experienceAmazon: Two-pizza teams, service boundaries, operational readiness

Every principle is embedded directly into the step-by-step workflows agents follow — not as footnotes, but as non-negotiable steps.

We accept contributions that make agents more reliable, not more clever.

See docs/skill-anatomy.md for the format specification and CONTRIBUTING.md for guidelines. Every PR goes through the same quality gates the skills enforce — eat your own dog food.

MIT — use these skills in your projects, teams, and tools. Build something great.

source & further reading

github.com — original article

AgentForge–28 production-grade skills that make AI agents ship reliable code

Run your AI side-project on zahid.host