The Core of a Coding Agent Is 128 Lines of Python. So I Built One From Scratch.

wpnews.pro

cd /news/artificial-intelligence/the-core-of-a-coding-agent-is-128-li… · home › topics › artificial-intelligence › article

[ARTICLE · art-35537] src=dev.to ↗ pub=2026-06-21T12:04Z topic=artificial-intelligence verified=true sentiment=↑ positive

The Core of a Coding Agent Is 128 Lines of Python. So I Built One From Scratch.

A developer built a coding agent from scratch in 128 lines of Python, demonstrating that the core loop powering tools like Claude Code and Cursor is surprisingly simple. The agent autonomously reads files, runs tests, diagnoses failures, fixes code, and re-runs tests without hard-coded steps. The project is open source under MIT license.

read4 min views1 publishedJun 21, 2026

128 lines of Python.

That's the entire core of a coding agent — the loop that powers tools like Claude Code and Cursor. I didn't believe it either, so I built one from scratch. Then I pointed it at a failing test, and it read the file, ran the test, saw the traceback, fixed the code, and re-ran it — choosing every step itself. No one hard-coded that.

It's open source (MIT), with a phased roadmap you can follow:

👉 github.com/osama96gh/coding-agent-from-scratch

I use coding agents every day. As an AI engineer, I think they're the breakout use case for LLMs right now. But using something and understanding it are different things.

Reading a production agent's source to learn the core is a trap — the essential logic is buried under prompt caching, retries, telemetry, and elaborate scaffolding. You can't see the engine for the bodywork.

So I built just the engine. No optimizations. Just the essence.

These surprised me enough that I re-counted:

Piece	Size
Entire REPL + agent loop + permission gate (`main.py` )
128 lines
The system prompt that steers all behavior (`prompts.py` )
19 lines
Tools — read, list, grep, edit, write, run_bash
6 files, smallest is 35
Whole project, incl. 2 swappable providers + streaming
~1,300 lines

The thing that feels like magic — an agent autonomously reading files, running your tests, fixing the failure, re-running — comes out of about a hundred lines of orchestration. The intelligence lives in the model. Your job is plumbing.

Strip away the streaming, the permission gate, and the UI, and the heartbeat of the whole thing is this:

conversation.append({"role": "user", "content": user_input})

while True:  # keep going until the model stops asking for tools
    turn = llm.call(conversation, tools=TOOL_SCHEMAS, system=SYSTEM_PROMPT)
    conversation.append(turn.to_message())

    if not turn.tool_calls:        # plain text → the model is done
        break

    for call in turn.tool_calls:   # otherwise, run each tool it asked for…
        result = run_tool(call.name, call.args)
        conversation.append({
            "role": "tool", "id": call.id,
            "name": call.name, "content": result,
        })

That's it. That's the agent.

main.py

", "run pytest

").The model decides which tool and in what order; the loop just keeps turning until the model stops asking.

An agent is just an LLM, a loop, and some tools. Everything else in this repo is refinement on top of those three.

This is also where "it can debug itself" comes from — for free. When the shell tool feeds exit codes and stderr back into the conversation, the model sees the failure on the next turn and proposes a fix. Nobody wrote if tests fail, edit the code

. It falls out of the loop.

One file each: read_file

, list_files

, grep

, edit_file

, write_file

, run_bash

Each is just a function plus a JSON schema describing its arguments — and that schema is all the model needs to know the tool exists and how to call it. "Tool calling" sounds advanced; it's really "here's a function signature, fill in the arguments."

run_bash

alone is almost a superpower — with a shell you can stand in for most of the others — which is exactly why an agent needs a permission gate.

These refinements sit on top of the core, and they're where most of the line count goes:

git status

runs unprompted while git push

still stops to ask. The difference between an assistant and rm -rf

roulette.That failing-test run from the top? I never scripted it. The model chose to read, run, diagnose, fix, and re-run entirely on its own — the same shape of behavior I pay for in Claude Code every day, out of ~128 lines I could read in a single sitting.

The gap between "toy" and "real" is smaller than the hype suggests. The production polish — caching, retries, sandboxing, a thousand handled edge cases — is genuine, hard engineering. But the core that makes an agent an agent is within any engineer's reach in an afternoon.

The repo is a phased roadmap — each phase runs on its own and teaches one concept, so you always have a working agent:

read_file

list_files

, grep

)edit_file

, write_file

)run_bash

) — where it gets powerful (and dangerous)A learning project: build a simple but real coding agent (think a tiny Claude Code / Cursor / Codex), step by step, from nothing — to understand how complex AI agents are actually structured under the hood.

The one-sentence mental model:An agent is just an LLM, a loop, and some tools.Everything else is refinement. ([source])

This repository is an educational, from-scratch Python implementation of a terminal coding agent. It shows the core mechanics behind modern AI coding tools: a model-driven agent loop, tool calling, file exploration, targeted code edits, shell command execution, permission checks, streaming responses, usage reporting, context compaction, and pluggable OpenAI/Gemini providers.

It is meant to be read, modified, and learned from. It is not a production coding agent, but a small reference implementation for understanding how production coding agents are structured under the hood.

Build it, break it, extend it (a new tool, a web UI, a third provider) — and tell me how it goes. The fastest way to stop an AI tool from feeling like magic is to build a small one yourself.

source & further reading

dev.to — original article Real World Tailwind CSS: Controlling the Special Cases (Part 2/2) Solstice Turing Simulation: An Interactive 3D Imitation Game Powered by Google Gemini Coding Agents: Moving From "Bash Mimics" to "AST Manipulators"

~/api · this article 200

$curl api.wpnews.pro/v1/news/the-core-of-a-coding-age…

Read original on dev.to → dev.to/osama_ghazal_96/the-core-of-a-coding-agen…

mentioned entities

Claude Code

Cursor

Osama

GitHub

Python

MIT

metadata

slugthe-core-of-a-coding-agent-is-128-lines-of-python-so-i-built-one-from-scratch

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevAutopia: Lincoln Nautilus is a p…

next →Tech Workers Invest Nights Learn…

── more in #artificial-intelligence 4 stories · sorted by recency

github.com · 21 Jun · #artificial-intelligence

Shall We? – A pre-engineering sanity check for AI-era feature bloat

dev.to · 21 Jun · #artificial-intelligence

CodeGraph Review 2026: This MCP Server Cut My AI Agent's Token Waste by 47%

grammar-of-coding-agents.pages.dev · 21 Jun · #artificial-intelligence

The Grammar of Coding Agents

dev.to · 21 Jun · #artificial-intelligence

How Graphify Stopped My Team from Burning Thousands of Tokens Per Query

── more on @claude code 3 stories trending now

wpnews · 20 Jun · #ai-agents

Amazon Bedrock AgentCore Memory: Build AI Agents That Remember

wpnews · 20 Jun · #artificial-intelligence

Microsoft is rewriting the economics of enterprise AI and the bill shock is just getting started

wpnews · 20 Jun · #artificial-intelligence

Big Tech redirects buybacks into AI capital spending

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required