cd /news/ai-agents/rise-of-the-software-factory · home topics ai-agents article
[ARTICLE · art-45986] src=terezatizkova.com ↗ pub= topic=ai-agents verified=true sentiment=↑ positive

Rise of the Software Factory

Factory co-founder and CEO Matan Cohen presented the concept of a software factory at the AI Engineer World Fair in San Francisco, defining it as the full autonomous cycle of software development including specification, build, validation, deploy, and learning. He outlined three essential properties—agnostic, autonomous, and always improving—and noted that the infrastructure to build such factories only became viable with advances from 2021 to 2025.

read4 min views15 publishedJun 30, 2026
Rise of the Software Factory
Image: Terezatizkova (auto-discovered)

Back to blog Everyone is suddenly talking about the software factory. Few have built one. Here's what it actually means, and the three properties every software factory must have.

I gave this talk at AI Engineer World Fair in San Francisco, June 2026. Everyone is suddenly talking about the software factory. Few have built one. Fewer still show the process. This post walks through what we've learned building Factory over the past two years.

No one knows what software factory means #

The term gets thrown around loosely. Some people mean "coding agent." Others mean "a consulting project with AI bolted on." Neither is right.

A software factory is the whole cycle of developing software that runs autonomously. Not just writing code. The spec, the build, the validation, the deploy, the learning. All of it.

The numbers back this up. Firms that rewire around AI (not just adopt it) see 2x revenue according to a16z's study of 515 startups. Accenture's cash-flow multiple compressed from 30x to 10x as the "AI implementor" model lost its sheen. You can't bolt AI onto a broken process and call it a factory.

The software factory wasn't possible two years ago #

You needed Copilot (2021), ChatGPT (2022), GPT-4 and bigger context windows (2023), better reasoning (2024), and persistent environments with long-running missions (2025). Every layer had to exist before the factory could.

Three properties every software factory must have #

We think about it as three properties. The same properties you'd want from the best engineering team you ever managed: agnostic, autonomous, always improving.

1. Agnostic #

Token optimization is winning over just cost saving. Better default models, cache hit rates going from 5% to 60%, and routing. The goal isn't fewer tokens used, it's fewer tokens wasted.

Routing isn't just about cost. It gives you reliability (failover between providers), speed (pick the fastest model for simple tasks), and cost efficiency without sacrificing quality.

The magic is in the classifier. Five signals (message content, recent tools, repo size, language mix, difficulty) produce a single quality score per model. That's it. No magic, just a well-tuned classifier.

We pass caching discounts directly to end users. The mechanism is prefix caching: your system prompt, tools, and skills stay the same every turn. The transformer already computed the KV cache for that prefix. Why make you pay again? Turn 2 is 10x cheaper than turn 1.

2. Autonomous #

Loops have always been here, they just moved a level up. Mathematical induction, code loops, training loops, ReAct tool loops, and now software factory loops. The core is always the same: iterator, exits, entry conditions. Even for a nondeterministic agent, the scaffolding is still the same primitives.

Task length doubles roughly every 7 months, but only at 50% reliability. The real challenge isn't making agents do more. It's making them do more reliably. That's why validation is a separate concern from generation.

Neither validator wrote the code it's judging. The Scrutiny Validator reads the code and trajectory: tests, typecheck, lint, and code review. It sees the code but didn't write it. The User-Testing Validator is a true black box: it never reads the source. It drives the running app via computer-use and checks behavior against the contract.

Context is the elephant in the room. You can't feed an agent your entire codebase. The deferred context engine keeps a compact capability index and loads full schemas on demand, saving 15-50% of tokens depending on tool count.

3. Always improving #

The gap between AI leaders and laggards has been 12x in the last three years, and it continues growing. Structure matters. Without it, AI makes code worse.

Agent readiness is measurable. We score repositories on how well they support autonomous work: linting, type safety, test coverage, documentation. These aren't vanity metrics. They directly predict how well an agent performs on your codebase.

Stop repeating to agents how you want things done. Plugins bundle skills, commands, and tools into one install. AutoWiki generates always-fresh docs from your codebase. The packaging layer ships droids, hooks, and MCP servers as versioned packages. The context engine uses deferred context, scoped tools, and structured user preferences. All of this compounds on your own data.

Where we're heading #

We move up, towards deciding what to build, instead of how. The factory handles the how.

Go touch some grass.

── more in #ai-agents 4 stories · sorted by recency
── more on @factory 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/rise-of-the-software…] indexed:0 read:4min 2026-06-30 ·