I built a Claude skill that keeps your AI coding tools from contradicting each other — and I need beta testers

wpnews.pro

cd /news/artificial-intelligence/i-built-a-claude-skill-that-keeps-yo… · home › topics › artificial-intelligence › article

[ARTICLE · art-3149] src=dev.to ↗ pub=2026-05-20T15:02Z topic=artificial-intelligence verified=true sentiment=↑ positive

I built a Claude skill that keeps your AI coding tools from contradicting each other — and I need beta testers

A new Claude skill called "spec-driven-development" that creates standardized specification files (requirements.md, design.md, tasks.md) and matching configuration files for multiple AI coding tools (Claude Code, Cursor, Windsurf, Copilot, Aider) to prevent them from contradicting each other. The system includes a CONTEXT.md session journal for maintaining continuity across sessions and a retrofit workflow for existing codebases, along with a comprehensive test suite of 135 passing assertions. The developer is seeking five beta testers to use the tool on real projects and report issues via GitHub.

read3 min views22 publishedMay 20, 2026

If you use more than one AI coding tool — Claude Code, Cursor, Copilot, Windsurf — you've probably hit this: You ask one to build a feature. It does something reasonable. You ask another to extend it. It contradicts the first. You ask a third to clean up. Now you have three different interpretations of what the system should do. This isn't a bug in any of the tools. It's a missing source of truth. A Claude skill called spec-driven-development that generates three files before any code is written: requirements.md — what the system must do (REQ-xxx IDs, acceptance criteria) design.md — how it will be built (data models, endpoints, file structure) tasks.md — atomic ordered steps, each linked to a requirement Then it generates matching AI config files for every tool you use: CLAUDE.md ← Claude Code reads this automatically .cursorrules ← Cursor .windsurfrules ← Windsurf .github/copilot-instructions.md ← GitHub Copilot .aider.conf.yml ← Aider Each config file contains the same Universal Instruction Block — identical constraint rules pointing every agent at the same spec files. They can't drift because they all defer to the same authority. There's a fourth file: CONTEXT.md . It's a session journal. When your context window fills and you start a fresh Claude Code session, Claude reads CONTEXT.md first and announces: "Session 4 resuming. Last session we completed TASK-005 (JWT middleware). Active task is TASK-007 — POST /tasks implementation. Ready to continue." No re-explaining. No lost context. Just continuation. If you already have code but no specs, the retrofit workflow reverse-engineers them from what you describe. Fields that weren't explicitly confirmed get marked [TO VERIFY] . The first phase of tasks.md is always "Spec Verification" — tasks that confirm the spec actually matches the live code before any new work starts. I didn't just ship it and hope. I built a proper test suite: Phase 2A — Static assertions (67 checks) A Python script that checks SKILL.md and reference files for structural correctness. Runs in GitHub Actions CI on every push. Phase 2B — Behavioral tests (15 prompts) Run in a live Claude Code session. For each prompt, Claude simulates a full response before looking at the assertions — blind evaluation. Tests include "continue where we left off" (CONTEXT.md present) and "what are we working on?" (CONTEXT.md absent). Phase 2C — Generation quality (53 checks) Three full end-to-end flows: greenfield project, retrofit codebase, cross-AI configuration. Claude Code generates real files, a Python checker validates every file. These run in CI against committed fixtures. Total: 135 assertions. All passing. CI is green. The test suite ships with the skill. Every future change must pass before merging. The 135 assertions were written by me, so they test what I anticipated. What they don't test: a stranger saying "help me get organised" or "scaffold me a project" — phrasing I didn't think of. That's the beta. I'm looking for 5 testers: All you do is use it naturally on your real work and file GitHub Issues when something doesn't work. One issue per problem. Include the exact phrase you used — that's the most valuable data. Repo (MIT): https://github.com/FredAntB/Spec-Driven-Development Open an issue titled [Beta] I'd like to test and describe which profile fits you. I'll get back to you within 24 hours.

source & further reading

dev.to — original article The MCP 2026-07-28 spec is final - check your server in one command! AI Pricing This Week: DeepSeek Gets Cheaper, Claude Sonnet 5 Gets Pricier The Hour Between Dog and Wolf, and the Hour Between People and AI

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-built-a-claude-skill-t…

Read original on dev.to → dev.to/lmntrix/i-built-a-claude-skill-that-keeps…

mentioned entities

Claude Code

Cursor

Copilot

Windsurf

GitHub

Aider

metadata

slugi-built-a-claude-skill-that-keeps-your-ai-coding-tools-from-contradicting-each-i

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevGoogle I/O 2026 - Day 1 - Live f…

next →Google just shifted the agent wo…

── more in #artificial-intelligence 4 stories · sorted by recency

makethisbetter.dev · 3 Aug · #artificial-intelligence

Show HN: An AI-Powered Widget for Collecting User Feedback

n3mo.shop · 3 Aug · #artificial-intelligence

Show HN: N3MO better ROI than CodeRabbit

runtimewire.com · 3 Aug · #artificial-intelligence

Arnav Gupta launched Prismor to govern AI agent tool calls

promptcube3.com · 2 Aug · #artificial-intelligence

Claude Code errors, best AI coding tools 2026, adv

── more on @claude code 3 stories trending now

wpnews · 2 Aug · #artificial-intelligence

I Ran 8 AI APIs Through the Same 50 Prompts — Here's the Real Cost Breakdown

wpnews · 2 Aug · #developer-tools

Agent-Browser – Browser Automation for AI

wpnews · 2 Aug · #artificial-intelligence

Payment Rail vs. Settlement Layer: What AEON's Coinbase x402 Partnership Actually Validates

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required