codebase-memory-mcp Cuts AI Agent Token Use by 121x

wpnews.pro

cd /news/ai-agents/codebase-memory-mcp-cuts-ai-agent-to… · home › topics › ai-agents › article

[ARTICLE · art-42875] src=byteiota.com ↗ pub=2026-06-29T03:22Z topic=ai-agents verified=true sentiment=↑ positive

codebase-memory-mcp Cuts AI Agent Token Use by 121x

An open-source MCP server called codebase-memory-mcp, built by DeusData, cuts the token cost of structural code queries by up to 121x by using a persistent knowledge graph. The tool, trending #1 on GitHub, indexes codebases with tree-sitter AST parsing and exposes 14 queryable tools to MCP-compatible coding agents, reducing token consumption from approximately 412,000 to 3,400 tokens for five typical queries. It runs locally as a static C binary with no cloud dependencies, supporting 158 languages and automatic re-indexing.

read4 min views1 publishedJun 29, 2026

codebase-memory-mcp Cuts AI Agent Token Use by 121x — Image: Byteiota (auto-discovered)

An open-source MCP server called codebase-memory-mcp is trending #1 on GitHub today — and for good reason. Built by DeusData and released June 20, 2026, it cuts the token cost of structural code questions by up to 121x. The mechanism is a persistent knowledge graph built from your codebase using tree-sitter AST parsing, exposed to any MCP-compatible coding agent as 14 queryable tools. Your agent stops reading files one-by-one and starts querying a graph at sub-millisecond speeds.

Why File-by-File Exploration Burns Your Token Budget #

When your AI coding agent needs to answer a structural question — “what calls this function?” or “where are all my REST routes?” — the default approach is brute-force: read auth.py, read user.py, read routes.py, grep through each one. That works. It also costs a fortune in tokens.

The numbers from the arXiv research preprint, evaluated across 31 real-world repositories, are striking. Five typical structural queries — finding a function, tracing a call chain, detecting dead code, listing routes, and getting an architecture overview — consume roughly 3,400 tokens via codebase-memory-mcp. The same five queries via file-by-file exploration burn approximately 412,000 tokens. That is a 121x gap, and it grows with codebase size. If you have been hitting context limits on large repos, this is probably why.

Related:[Tokenmaxxing Killed AI Budgets — What’s Replacing It]

A Knowledge Graph That Answers in Milliseconds #

codebase-memory-mcp builds a SQLite-backed graph of your codebase using tree-sitter — the same battle-tested parser powering GitHub’s code search, Neovim, and Helix. It indexes 158 languages and, for 9 of them (Python, TypeScript, Go, Rust, C, C++, Java, PHP, C#), adds language-server-grade type inference that catches imports, generics, and inheritance chains that pure syntax parsing would miss.

Indexing is fast: Django’s 49,000-node codebase takes about 6 seconds on an M3 Pro; the Linux kernel’s 28 million lines of code take 3 minutes. After that initial pass, everything persists in SQLite and re-indexes automatically as files change. There is no re-indexing every time you start a new Claude Code session. The graph is already there.

Once indexed, queries run in under a millisecond. The 14 exposed MCP tools cover the structural questions agents ask most: search_graph

finds functions or classes by name or regex, trace_path

walks call chains to any depth, get_architecture

returns a full overview of routes and hotspots, and query_graph

accepts raw Cypher graph queries for custom traversal. Dead-code detection, clone detection via MinHash, and change-impact analysis that maps a git diff to its blast radius round out the toolset.

One Command. Zero Dependencies. Zero Cloud. #

Installation on macOS or Linux is a single line:

curl -fsSL https://raw.githubusercontent.com/DeusData/codebase-memory-mcp/main/install.sh | bash

The installer auto-detects whichever AI coding agents you have installed — Claude Code, Codex CLI, Gemini CLI, Zed, OpenCode, Antigravity, Aider, KiloCode, VS Code, OpenClaw, Kiro — and writes their MCP configuration entries automatically. No JSON editing required. Windows users get a PowerShell equivalent; npm, pip, Homebrew, Scoop, and AUR packages are also available.

The privacy story is unusually clean for this space. The tool is a single static C binary: no Docker, no runtime, no external APIs, no embedded LLM. Code never leaves your machine. All releases carry SLSA Level 3 cryptographic build provenance, are signed with Sigstore cosign, and are scanned by 70+ antivirus engines before release. For teams working with sensitive codebases, the combination of MIT license and 100% local processing is hard to argue with.

The Honest Trade-Off #

codebase-memory-mcp is not a universal replacement for agent file-reading. The arXiv evaluation puts answer quality at 83% versus 92% for file-based agents — the gap comes from what a knowledge graph can and cannot capture. Semantic questions (“why does this function exist?” or “what is the business logic here?”) still need the agent to read actual source. Macro-heavy C codebases hit a harder wall: preprocessor directives do not appear in the AST, and quality scores drop to around 0.58 on those repos.

However, the tool is purpose-built for structural queries — call graphs, dead code, route maps, cross-service dependency chains — and it excels at exactly those. Use it for structural navigation. Let your agent read files for reasoning. The two approaches are complementary, not competing.

Key Takeaways #

codebase-memory-mcp reduces token consumption on structural code queries by up to 121x — five typical questions drop from 412,000 tokens to 3,400 tokens.
It uses tree-sitter AST parsing across 158 languages plus optional LSP type inference, stored in local SQLite. Queries run in under 1 millisecond.
One install command auto-configures 11 AI coding agents simultaneously — no JSON editing, no Docker, no cloud APIs, 100% local and MIT licensed.
Answer quality is 83% vs 92% file-based on structural queries; semantic reasoning and macro-heavy C still require file reading.
16,000+ GitHub stars, trending #1 today — the MCP ecosystem is producing genuinely useful infrastructure, not just agent wrappers.

source & further reading

byteiota.com — original article CVE-2026-12957: Amazon Q Silently Stole Your AWS Keys Kimi K2.7-Code: Open-Source 1T Coding Agent, 30% Fewer Thinking Tokens Tokenmaxxing Killed AI Budgets — What’s Replacing It

~/api · this article 200

$curl api.wpnews.pro/v1/news/codebase-memory-mcp-cuts…

Read original on byteiota.com → byteiota.com/codebase-memory-mcp-cuts-ai-agent-t…

mentioned entities

DeusData

codebase-memory-mcp

GitHub

tree-sitter

SQLite

Claude Code

Codex CLI

Gemini CLI

metadata

slugcodebase-memory-mcp-cuts-ai-agent-token-use-by-121x

topic#ai-agents

secondary3 topics

sentimentpositive

canonicalbyteiota.com

navigation

← prevAndroid 17 Bakes AI Into Pixels …

── more in #ai-agents 4 stories · sorted by recency

dev.to · 29 Jun · #ai-agents

Building Multi-Agent Systems with Python: Orchestration Patterns That Work

dev.to · 29 Jun · #ai-agents

I Built an AI-Powered Smart Contract Auditor Looking for Feedback from Developers

dev.to · 29 Jun · #ai-agents

Open-Source Customizable Compliance

gist.github.com · 29 Jun · #ai-agents

Fix: This model is not supported when using X-OpenAI-Internal-Codex-Responses-Lite

── more on @deusdata 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 Jun · #ai-agents

OpenCode v1.17: Session Snapshots Undo Your AI Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required