Ultracoding: The Next Frontier

wpnews.pro

cd /news/artificial-intelligence/ultracoding-the-next-frontier · home › topics › artificial-intelligence › article

[ARTICLE · art-28756] src=jay.ai ↗ pub=2026-06-15T23:53Z topic=artificial-intelligence verified=true sentiment=↑ positive

Ultracoding: The Next Frontier

Ultracoding, a new paradigm where AI agents programmatically spawn copies of themselves via code execution, is enabling massive multi-agent hierarchies that scale software development to unprecedented levels. Recent demonstrations include large-scale code refactors by Bun, Monty, and Cursor, achieved through swarms of agents working in parallel. The approach, which leverages code mode for dynamic tool composition, promises exponential productivity gains but requires better UX for human oversight.

read4 min views20 publishedJun 15, 2026

The successor to vibe coding is ultracoding: let agents programmatically spawn copies of themselves via code execution. Dynamically spin up multi-agent hierarchies in a task-dependent manner and, in doing so, scale up to previously unheard of tasks. Ride the dragon of exponential productivity.

It feels the like the jump from single-threaded scripts to MapReduce and Spark: fan-out across many, reduce/verify steps, and capable of orders of magnitude higher throughput.

This is what the future of building software is going to look like. Meta-harnesses like Claude workflows are the path to scaling up to massive multi-agent hierarchies capable of a fundamentally new category of tasks, in software and beyond.

#Operating at a new scale This pattern of LLMs recursively invoking themselves has previously demonstrated impressive results on academic benchmarks - see RLMs.

Recently however we've seen several impressive demonstrations in the wild in rapid succession, specifically for large code refactors and 0-1 projects.

Recent massive refactors demonstrated in the wild:

Bun's refactor from

[Zig to Rust](https://github.com/oven-sh/bun/pull/30412) [Monty refactor](https://github.com/pydantic/monty/pull/500)to subprocess pool- Cursor
[building a browser from scratch](https://cursor.com/blog/scaling-agents)with a swarm of agents

Exact implementation details for the above are light, but we can infer that each was accomplished via a swarm of agents working in parallel, managed by a small number of humans in a custom harness. The commonality is that each task has high test coverage and therefore lends itself to horizontally-scalable "ralph-loops" (now a first-class primitive in tools like Codex's /goal) and human verification.

#Code Mode as a Multi-agent Substrate A key enabler for this emerging pattern is agent proficiency at "code mode" - programmatically invoking tools via code execution.

The latest generation of LLMs are RL'd to operate specifically in this manner. It's a more efficient way to act on the world - it can compose bespoke bulk actions at runtime instead of one tool call at a time and enables agents to effectively assemble their own tools.

This pattern was introduced by Voyager, and Perplexity/Cloudflare/many others have since introduced code mode-oriented interfaces. OpenAI and Anthropic even expose this tool calling method in their APIs via simple config (1, 2).

Historically, multi-agent harnesses have been hard-coded and established an explicit heirarchy of agents with different roles and communication patterns. Ultracoding, like workflows, cedes this territory to the bitter lesson and acknowledges that agents can dynamically determine the best meta-harness at runtime. Infra-wise, this only requires the addition of a "spawn agent" tool within an existing (persistent) code mode execution environment.

This ability, to spin up a harness in a task-dependent manner at runtime, has radically reduced the barrier to entry and means you can realistically chat your way to a massive refactor or ambitious 0-1 project.

#Scaling Human-Agent Hierarchies: The UX Massive multi-agent hierarchies are unlocked from a capabilities perspective - now, the major barrier to widespread adoption is better UX for human in the loop.

As Swyx has noted, the UX patterns for ultracoding are nascent. There's no established way to view/triage incremental outputs; The two patterns that have dominated thus far have been agent lists and Kanban boards, however this is clearly not a terminal state.

I think we will imminently move towards a model where the agent expresses a UI for human oversight as part of the meta-harness. This may look like hooking into an existing UI like ClickUp or Linear, or alternatively writing bare HTML in a completely bespoke workflow in case bulk approvals or triage is necessary for the human.

In the fullness of time, agents will effectively dynamically code oversight applications for human orchestrators, directly hooked up to "workflows" and with with bespoke approval and triage flows baked in.

#Ultrawork I think about agents for general-purpose knowledge work and the analogies to code. From what I know of our customers at ClickUp it's obvious: this same pattern applies to many workstreams that emerge in recruiting, sales, project management, legal services, accounting, etc.

This pattern of dynamic multi-agent hierarchies will wash out over knowledge work more generally. Instead of babysitting chat loops, you spin up a bespoke app for the task on the spot, with a UI built to verify the task in aggregate. The stuff that lives in a spreadsheet today becomes an application the agent assembles for you.

Knowledge work time-lags code, so adoption will be incremental for everyone rather than a sudden flip. But the payoff is steep: the efficiencies are large, and once it works at scale you can take on a fundamentally new scope of work, not just the same tasks done faster.

The unlock is that UI. Give a human the right way to participate in and verify a large-scale job and the horizon of what an LLM can take on extends dramatically, not just inside code where the review flow is obvious, but everywhere. Excited to see conventions established for this in the back half of 2026.

source & further reading

jay.ai — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/ultracoding-the-next-fro…

Read original on jay.ai → www.jay.ai/blog/ultracoding-the-next-frontier

mentioned entities

Bun

Monty

Cursor

Claude

Codex

OpenAI

Anthropic

Perplexity

metadata

slugultracoding-the-next-frontier

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicaljay.ai

navigation

← prevFedBPT: Efficient Federated Blac…

next →Continue has been acquired by Cu…

── more in #artificial-intelligence 4 stories · sorted by recency

blackflare.dev · 31 Jul · #artificial-intelligence

Show HN: I stopped babysitting my AI agents by pushing them to Telegram

technode.com · 31 Jul · #artificial-intelligence

DeepSeek puts V4-Flash API into public beta

gokulakrishna.co · 31 Jul · #artificial-intelligence

I Migrated a 350k-Line Java/JSP Application to TypeScript in Five Days

dev.to · 31 Jul · #artificial-intelligence

We added mobile approvals to our CLI AI tool -- approve Claude's destructive commands from your phone

── more on @bun 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 31 Jul · #artificial-intelligence

Microsoft doubles down on multi-model AI as it builds a Copilot super app

wpnews · 30 Jul · #artificial-intelligence

Apple to join Samsung in AI glasses race against Meta

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required