I Packed DeepSeek V4 + Claude Code Into a Starter Kit. Clone It and Ship. A developer has released an open-source starter kit that integrates DeepSeek V4 into Claude Code, featuring nine custom agents, seven behavior rules, security hooks, local OCR, and auto-backup. The kit, available via a single git clone command, optimizes performance by assigning the Pro model to the main agent and Flash to sub-agents, and includes a six-level decision ladder to enforce YAGNI principles. TL;DR:I spent weeks tuning DeepSeek V4 to feel native inside Claude Code. The result: a one-command setup with 9 agents, 7 rules, security hooks, OCR, and auto-backup. Clone → ./init.sh → you're shipping. I wanted DeepSeek's 1M context window inside Claude Code's interface. What I got was two weeks of fighting API configs, debugging token limits, and discovering that most "just swap the model" advice is missing half the puzzle. So I packaged everything into a single repo. git clone https://github.com/YuhaoLin2005/deepseek-claude-code-starter.git cd deepseek-claude-code-starter ./init.sh Three commands. Here's what lands in your ~/.claude/ : | Component | What It Does | |---|---| 9 Custom Agents | Code review, security audit, TDD guide, architecture, build-fix — each tuned for DeepSeek's reasoning style | 7 Behavior Rules | Code quality, security, testing discipline, YAGNI enforcement, commit standards | Security Hook | PreToolUse guard — blocks sensitive file access and dangerous commands before they execute | Auto-Backup | Pre-edit snapshots keeps 5 + session-start git commit. You'll thank me the first time you roll back | Local OCR | RapidOCR on ONNX — lets DeepSeek "see" screenshots without sending them to a cloud API | Status Line | Compaction counter — warns you at 5+ compactions to start fresh | The single biggest quality-of-life improvement: main agent gets the Pro model, sub-agents get Flash. DeepSeek V4 Pro handles architecture, debugging, and complex reasoning. But reading files? Searching? Running tests? Those don't need a 1M-context reasoning beast. Flash is faster, cheaper, and completely adequate for the grunt work. This one decision doubled my effective throughput. The main agent never waits behind a queue of file-reads. One rule I'm unreasonably proud of: the 6-level decision ladder. Level 0: stdlib can do it → don't write code Level 1: one-liner → don't write fifty lines Level 2: existing tool → don't build a replacement Level 3: simple script → don't build a framework Level 4: library → don't build from scratch Level 5: only then, build It's YAGNI compiled into a decision tree. When every token costs money, "just in case" code is a bill you pay every session. My self-model protocol https://dev.to/yuhaolin2005/i-open-sourced-the-protocol-that-keeps-my-ai-from-forgetting-who-i-am-4pp handles identity across sessions. But identity is useless if the tools aren't right. This starter kit is the companion piece — the "body" to the protocol's "mind." Both are MIT licensed. Both took weeks of trial and error to get right. Both are now one git clone away. Have you tried running non-Claude models inside Claude Code? What broke first — the API, the prompts, or your patience? Drop your war stories in the comments. Related: LLM compaction isn't linear • Self-model protocol for AI identity persistence