Your LLM reads the whole file. It doesn't have to.

wpnews.pro

cd /news/developer-tools/your-llm-reads-the-whole-file-it-doe… · home › topics › developer-tools › article

[ARTICLE · art-33868] src=dev.to ↗ pub=2026-06-19T12:05Z topic=developer-tools verified=true sentiment=↑ positive

Your LLM reads the whole file. It doesn't have to.

A developer created md2idx, a CLI tool that splits Markdown files at heading boundaries into a JSON index and sections, enabling LLM coding agents to read only relevant parts instead of entire files. This reduces token consumption by 80-98% and improves answer quality by avoiding context window bloat. The tool is available on GitHub and includes a skill for autonomous agent use.

read4 min views1 publishedJun 19, 2026

Coding agents read specs, design docs, and long READMEs every day. Most of the time, they only need a few sections. Yet they load the entire file into context.

Here's a scenario that plays out constantly. You ask your agent to check the error handling section of a 5,000-line API spec. The agent opens the file, reads all 5,000 lines into its context window, finds the 80 lines it needs, and answers your question.

The result is correct. But the agent also consumed a large number of tokens on the 4,920 lines it didn't need. Repeat this for every file read in a session, and the waste compounds fast.

The cost isn't just tokens. A context window stuffed with irrelevant content makes the agent's answers worse.

When a human picks up a 300-page technical book, they don't read cover to cover to find the chapter on authentication. They flip to the table of contents, scan the chapter titles, and jump to page 47. LLMs can do the same thing.

Markdown documents have a built-in structure: headings. A # Title

followed by ## Section A

followed by ### Subsection A.1

creates a hierarchy that mirrors a book's table of contents.

Split a Markdown file at heading boundaries, and you get a natural "table of contents + sections" structure. Each heading starts a new section, the heading text becomes the index entry, and the section number becomes the address.

This is the idea behind md2idx, a CLI tool.

md2idx converts a Markdown file into JSON with two fields:

index

<# markers for depth> <serial>. <heading text>

sections

$ npx md2idx spec.md | jq -r '.index'
## 1. Authentication
## 2. Endpoints
### 3. GET /users
### 4. POST /users
## 5. Error Handling
## 6. Rate Limiting

The serial numbers match the array indices. To read the Error Handling section:

$ npx md2idx spec.md | jq -r '.sections[5]'
## Error Handling

When a request fails, the API returns a JSON error object with...
(just the content of that one section)

To read a heading and all its children together:

$ npx md2idx spec.md | jq -r '.sections[2:5][]'

For a 5,000-line spec where the agent needs 2 sections, context usage goes from ~5,000 lines to ~100 lines (20-line index + 80 lines of content). Depending on the document and which sections are needed, the reduction is typically 80–98%.

The output is designed to work with jq

. One-line JSON by default (pipe-friendly), --pretty

for formatted output. Reads from a file argument or stdin.

grep -nE '#{1,6} ' spec.md

gives you a list of headings. For simple cases, that works. But md2idx covers problems that grep can't solve:

jq '.sections[N]'

is all it takes===

/ ---

): invisible to grep's #

pattern#

inside code fences[link](url)

etc. as-is. md2idx strips markup in the index while preserving it in section contentWith the md2idx-read

skill, the agent autonomously handles everything from fetching the index to selecting sections.

jq

slicing

gh skill install oubakiou/md2idx md2idx-read --agent claude-code --scope project

npx skills add oubakiou/md2idx --skill md2idx-read --agent claude-code --yes

Once installed, the agent uses the skill proactively whenever it encounters a large Markdown file. No manual invocation needed — it reads the index first, picks sections, and skips the rest.

A fallback is included. If md2idx isn't available (network-restricted environments, permission issues), it falls back to grep

for headings and Read tool with offset/limit. Less accurate, but functional.

npx md2idx README.md | jq -r '.index'

npx md2idx README.md | jq -r '.sections[2]'

npx md2idx data.md | jq -r '.sections[4]' | grep Tokyo

npm install -g md2idx

md2idx has zero external dependencies — a self-contained line scanner, not a Markdown AST parser. It handles ATX headings (#

style), setext headings (===

/ ---

underlines), code fence skipping, and inline markup stripping.

md2idx is MIT-licensed and fully open source. If your LLM agents are reading entire large Markdown files, give it a try:

npx md2idx your-file.md | jq -r '.index'

gh skill install oubakiou/md2idx md2idx-read

If you've tried it in your agent workflow, I'd love to hear how it went — drop a comment below or open an issue on GitHub.

source & further reading

dev.to — original article NeevCloud unveils AI native sovereign SuperCloud at KubeCon India 2026 I Cut My AI Agent's Token Bill by 62% in One Weekend. Here's the Receipts. Finisma

~/api · this article 200

$curl api.wpnews.pro/v1/news/your-llm-reads-the-whole…

Read original on dev.to → dev.to/kiou_ouba_afbd120335456f3/your-llm-reads-…

mentioned entities

md2idx

GitHub

Claude Code

metadata

slugyour-llm-reads-the-whole-file-it-doesn-t-have-to

topic#developer-tools

secondary2 topics

sentimentpositive

canonicaldev.to

navigation

← prevThe GitHub Clone Farm That Beat …

next →EBA Warns AI Models Increase Cyb…

── more in #developer-tools 4 stories · sorted by recency

dev.to · 19 Jun · #developer-tools

A Few Months Ago, Agentic Development Felt Overwhelming

devclubhouse.com · 19 Jun · #developer-tools

The GitHub Clone Farm That Beat VirusTotal

dev.to · 19 Jun · #developer-tools

I Built an Open-Source Prompt Library for Developers, Creators, and AI Power Users

wired.com · 19 Jun · #developer-tools

Try One of macOS 27’s Best Features Right Now

── more on @md2idx 3 stories trending now

wpnews · 18 Jun · #large-language-models

ICYMI: ZAI launches GLM-5.2 open model with 1M context

wpnews · 18 Jun · #ai-chips

Apple and Intel join forces in Trump’s push to bring chipmaking home

wpnews · 18 Jun · #ai-agents

How to Automate Business Reports With an AI Agent Instead of Dashboards

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required