Two tiny Claude Code skills that fixed my two biggest agent problems

wpnews.pro

cd /news/developer-tools/two-tiny-claude-code-skills-that-fix… · home › topics › developer-tools › article

[ARTICLE · art-26724] src=dev.to ↗ pub=2026-06-14T03:57Z topic=developer-tools verified=true sentiment=↑ positive

Two tiny Claude Code skills that fixed my two biggest agent problems

A developer at dualform labs released two open-source skills for Claude Code that address common agent failures: spec, which forces upfront clarification of ambiguous tasks, and review-audit, which performs a read-only audit across six axes with evidence. Both are single prompt files under Apache-2.0 with no dependencies.

read2 min views22 publishedJun 14, 2026

Two open-source skills for Claude Code. Each is a single prompt file, Apache-2.0, no dependencies. Repos at the bottom.

Working with a coding agent, I kept hitting the same two failure modes. Not "the model can't write code" — it writes code fine. The failures were upstream and downstream of the code: the agent guessing on an ambiguous task, and me trusting a review that hadn't actually checked anything.

So I built one small skill for each. Here's what they do and why they're shaped the way they are.

Hand a vague task to an agent and you watch the same thing happen. It guesses. It drifts. It quietly makes a call you'd have made differently — and you find out after the code is written, when changing your mind is expensive. The cost isn't the typing. It's the rework.

** spec** moves the decisions to the front. You run

/spec <one-line idea>

, it reads your repo and the conversation, then asks only what it Two things keep it honest:

done

only after it ran on a real case and the output was shown.The whole skill is one prompt file (SKILL.md

). No build, no dependencies.

Ask an AI to "review this change" and you can get a confident, plausible PASS — that skipped half the checks, cited no evidence, and never ran the tests. A green light you can't trust is worse than no review.

** review-audit** is a read-only, single-pass audit over your change across six axes: correctness, wiring (built-but-never-called / dead code), security, test efficacy, spec compliance, and regression. The discipline is simple and strict:

file:line

grep/run evidence. "I didn't check this" is a first-class output, not a silent gap.file:line

. "Looks fine" isn't evidence.It runs in the calling agent's own context — no sub-agent fan-out — so it stays cheap enough to run on every change. When one pass genuinely isn't enough (a release gate, a high-risk change), it tells you, in its own output, to escalate.

Both are one prompt file each.

git clone https://github.com/dualform-labs/spec-skill.git
cp -r spec-skill/skills/spec ~/.claude/skills/

git clone https://github.com/dualform-labs/review-audit.git
cp -r review-audit/skills/review-audit ~/.claude/skills/

Then in Claude Code: /spec a menu-bar app that warns me when my Mac is thermally throttled

, or /review-audit

on a change before you call it done. Output language is auto

/ ja

/ en

No network calls (Claude Code only), no telemetry, no bypass-permissions.

These are prompt-file skills, not magic. Single-pass detection in review-audit

is model-dependent; if you need per-run proof of detection power or fresh-context adversarial verification, that's the heavier review-audit-pro

tier (coming soon). And spec

won't make a bad idea good — it just makes the decisions explicit before code is written, where they're cheap to change.

If you try them, I'd genuinely like to hear where they break or annoy you.

— a dualform project

source & further reading

dev.to — original article `finish_reason=length` Returned Empty Content — and the Error Message Lied to Me Combined Offense + Defense (Engineering Edition) — Cross-Project Reuse Matrix and When Not to Use What actually belongs in CLAUDE.md — and what to move to skills, hooks, or docs

~/api · this article 200

$curl api.wpnews.pro/v1/news/two-tiny-claude-code-ski…

Read original on dev.to → dev.to/dualform/two-tiny-claude-code-skills-that…

mentioned entities

dualform labs

Claude Code

spec

review-audit

metadata

slugtwo-tiny-claude-code-skills-that-fixed-my-two-biggest-agent-problems

topic#developer-tools

secondary2 topics

sentimentpositive

canonicaldev.to

navigation

← prevThe future of Siri, or: why priv…

next →Speeding Up JumpReLU SAE Inferen…

── more in #developer-tools 4 stories · sorted by recency

dev.to · 30 Jul · #developer-tools

What actually belongs in CLAUDE.md — and what to move to skills, hooks, or docs

dev.to · 30 Jul · #developer-tools

Combined Offense + Defense (Engineering Edition) — Cross-Project Reuse Matrix and When Not to Use

github.com · 30 Jul · #developer-tools

Show HN: A local merge queue for parallel Claude Code agents

dev.to · 30 Jul · #developer-tools

I generated 207 MCP tools from an OpenAPI spec. Generating them was the easy part.

── more on @dualform labs 3 stories trending now

wpnews · 29 Jul · #ai-safety

News Summary for July 29, 2026

wpnews · 29 Jul · #artificial-intelligence

Investors are selling Meta as it heads to its earnings report

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required