Anthropic's open-source framework for AI-powered vulnerability discovery Anthropic released an open-source reference implementation for autonomous vulnerability discovery and remediation using its Claude AI model, based on learnings from security team partnerships. The framework provides a configurable pipeline for source code scanning, triage, and patching, and is designed to help organizations build their own vulnerability-finding systems using Claude APIs across platforms including Bedrock, Vertex, or Azure. A reference implementation for autonomous vulnerability discovery and remediation with Claude, based on our learnings from partnering with security teams at several organizations https://www.anthropic.com/glasswing since launching Claude Mythos Preview. For a write up of these learnings along with best practices, see the accompanying blog post https://claude.com/blog/using-llms-to-secure-source-code also available in blog-post.md /anthropics/defending-code-reference-harness/blob/main/docs/blog-post.md . For a lightweight SDK-only walkthrough of the same recon → find → triage → report → patch loop, see the companion cookbook https://platform.claude.com/cookbook/claude-agent-sdk-06-the-vulnerability-detection-agent . This repo is not maintained and is not accepting contributions. 🔒 Want a managed option?Anthropic offers Claude Security , a hosted product that finds and fixes vulnerabilities in your source code across multiple projects. Claude Security scans your repository for vulnerabilities, applies a multi-stage verification pipeline to reduce false positives, and lets you manage findings through their lifecycle: triage, fix validation, and rapid fix generation.This repository is an open-source reference implementation based on general best practices for finding vulnerabilities using Claude. You can use it to build your own vulnerability finding pipeline, customize the logic, and it can be used with whatever access you have to Claude APIs including Bedrock, Vertex, or Azure . Claude Code skills : /quickstart , /threat-model , /vuln-scan , /triage , /patch , /customize : interactive scoping, scanning, triage, and patching. Open this repo in Claude Code and run /quickstart to get oriented.: the autonomous reference pipeline recon → find → verify → report → patch , configured for finding C/C++ memory vulnerabilities using Docker and ASAN. This harness is a harness/ reference, not a product . The general shape, prompts, and sandboxing are reusable, but the harness will not work on every codebase out of the box. Run /customize to port it to your language, detector, or vuln class. ⚠️ Security: /quickstart , /threat-model , /vuln-scan , and /triage only read and write files. Running /patch on static findings TRIAGE.json or VULN-FINDINGS.json is likewise read- and write-only. /customize edits the harness code and runs validation commands. Any of these skills are safe to run unsandboxed, as long as you review and approve each tool use in Claude Code. The autonomous reference pipeline including /patch on pipeline results executes target code, so it refuses to run outside of a gVisor sandbox unless explicitly overridden. To get set up, run scripts/setup sandbox.sh once, then invoke the pipeline via bin/vp-sandboxed . See docs/security.md and docs/agent-sandbox.md for more details. git clone https://github.com/anthropics/defending-code-reference-harness cd defending-code-reference-harness claude 30-sec intro + guided first run on the canary target /quickstart /quickstart how do I port the pipeline to Java? /quickstart how do I triage all these bugs? · The accompanying blog post with learnings + best practices Blog Post · How it works: diagram, stages, CLI flags Pipeline · Sandboxing, what not to mount Security · gVisor isolation + egress allowlist for every agent Agent sandbox · Port to my stack; which files change and why Customize · Generate and verify fixes for verified crashes Patching · Duplicates, rate limits, subagent model pinning Troubleshooting · Block for dangerous cyber work Safeguards The most successful security teams we've partnered with are those that have gotten hands-on the fastest. Though it's tempting to spend months designing the perfect pipeline, we recommend starting small on Day 1 and building from there as learnings come. The steps below follow that pattern and set an ambitious but reasonable pace based on what we've seen. | Day 1 Step 2 step-2-day-2-run-the-reference-pipeline-on-a-cc-library Day 2 Step 3 step-3-days-3-5-customize-the-pipeline-for-your-target Days 3-5 Step 4 step-4-week-2-start-autonomous-scanning-triage-and-patching Week 2 Day 1 is focused on seeing the whole loop end-to-end. Using only the interactive skills, you'll build a threat model, run a static scan scoped by it, triage what comes back, and draft candidate fixes. You'll finish the day with a threat model, a ranked list of static findings, and candidate patches. The relevant skills only read and write files in your repo. As long as you run Claude Code interactively and approve each tool use, no sandbox is needed. Pin every subagent to the model you want export CLAUDE CODE SUBAGENT MODEL=