Making your docs site agent-readable: llms.txt, MCP, and the .well-known files that actually matter

wpnews.pro

cd /news/ai-agents/making-your-docs-site-agent-readable… · home › topics › ai-agents › article

[ARTICLE · art-27114] src=dev.to ↗ pub=2026-06-14T17:13Z topic=ai-agents verified=true sentiment=↑ positive

Making your docs site agent-readable: llms.txt, MCP, and the .well-known files that actually matter

OrchestKit's documentation site implements a stack of standard files—llms.txt, OpenAPI specs, MCP endpoints, and .well-known identity files—to make its docs machine-readable for AI agents. The project, a free MIT-licensed plugin for Claude Code, publishes these resources at predictable paths so agents can fetch structured data without scraping HTML. The approach includes agent-card.json, a schema.org graph, and explicit robot.txt allowances for AI crawlers.

read2 min views23 publishedJun 14, 2026

AI agents increasingly read your docs instead of a human. If your documentation site only emits HTML for a browser, an agent has to scrape and guess. There's a better surface — and most of it is a handful of small, standard files. Here's the full stack we ship on the OrchestKit docs site, why each piece exists, and how to verify it.

llms.txt

— the agent's table of contents A plain-text index at /llms.txt

: what the product is, its constraints, and a link map to every machine-readable resource. Keep it under ~30k chars; put the exhaustive page list in /docs/llms.txt

and the full corpus in /llms-full.txt

. The win: an agent gets oriented in one fetch instead of crawling.

Append .md

to any page URL (or send Accept: text/markdown

) and return the raw Markdown. Agents get clean tokens; humans still get the rendered page.

Even a docs site has an API surface (search, page fetch). Publish an OpenAPI document at a predictable path so an agent can call it without reverse-engineering. Pair it with RFC 9727 — a /.well-known/api-catalog

linkset that enumerates every API entry point.

The Model Context Protocol lets agents call your tools natively. We expose a read-only MCP server over Streamable HTTP at /api/mcp

plus a discovery server-card.json

. Two tools — search docs, get a doc by id — are enough to be useful.

.well-known

identity files agent-card.json

(A2A): declares your agent skills.agent-skills/index.json : the Agent Skills Discovery RFC, with a SHA-256 digest per skill so a consumer can verify it.oauth-protected-resource

(RFC 9728): if your API is anonymous, authorization_servers

is a positive signal, not an omission.Emit a schema.org

graph (Organization

, SoftwareApplication

, WebSite

) linked by @id

, with sameAs

pointing at the registries that already verify you (GitHub, your package registry, Wikidata). One canonical Organization block, reused everywhere, so the graph never sees conflicting identifiers. Never fabricate an aggregateRating

— surface real signals (e.g. GitHub stars as an InteractionCounter

) instead.

robots.txt

Explicitly allow the named AI crawlers you want (GPTBot, ClaudeBot, OAI-SearchBot, Google-Extended…), and emit a Content-Signal

directive. Link your sitemap and a schema-map.

curl -s https://yoursite/llms.txt , fetch each .well-known

path, and run your JSON-LD through a structured-data validator. If you build on Claude Code, the open-source OrchestKit docs site implements every item above — the source is on GitHub, MIT-licensed, and you can read the route handlers directly.

I maintain OrchestKit (a free, MIT plugin for Claude Code, 111 skills/37 agents/210 hooks). The agent-discovery surface described here is what its docs site ships today.

source & further reading

dev.to — original article Testing Non-Deterministic LLM Pipelines in CI: A Contract-Based Approach 🌱 MyZubster: The Decentralized Ecosystem to Map the World with Monero and AI Building Production AI Systems(Part 4)

~/api · this article 200

$curl api.wpnews.pro/v1/news/making-your-docs-site-ag…

Read original on dev.to → dev.to/yonyonai/making-your-docs-site-agent-read…

mentioned entities

OrchestKit

Claude Code

GitHub

OpenAPI

Model Context Protocol

GPTBot

ClaudeBot

OAI-SearchBot

metadata

slugmaking-your-docs-site-agent-readable-llms-txt-mcp-and-the-well-known-files-that

topic#ai-agents

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevZerostack v1.5 – Lightweight Uni…

next →Multistack – Lightweight TUI for…

── more in #ai-agents 4 stories · sorted by recency

dev.to · 30 Jul · #ai-agents

SEO Tools in Claude Code: Comparing Hosted and Local MCP

runtimewire.com · 30 Jul · #ai-agents

X launches encrypted Chat API for bots and AI agents

dev.to · 30 Jul · #ai-agents

Building Production AI Systems(Part 4)

arxiv.org · 30 Jul · #ai-agents

(Im)Paired Programming: Coding Agents Improve Productivity but Harm Understanding

── more on @orchestkit 3 stories trending now

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

wpnews · 29 Jul · #ai-safety

News Summary for July 29, 2026

wpnews · 30 Jul · #artificial-intelligence

Apple to join Samsung in AI glasses race against Meta

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required