More parallel subagents made my pipeline slower. Here's the data.

wpnews.pro

cd /news/ai-agents/more-parallel-subagents-made-my-pipe… · home › topics › ai-agents › article

[ARTICLE · art-30621] src=dev.to ↗ pub=2026-06-17T06:16Z topic=ai-agents verified=true sentiment=· neutral

More parallel subagents made my pipeline slower. Here's the data.

A developer found that adding more parallel subagents to an ad-creative analysis pipeline increased latency rather than decreasing it, because context assembly before the LLM call became the bottleneck. With 8 subagents, aggregation consumed 61% of wall-clock time, and serializing JSON blobs took over 4 seconds. The fix was to have subagents write summaries to R2, reducing aggregation context from ~6,400 to ~1,100 tokens and cutting monthly costs from $207 to $38.

read2 min views30 publishedJun 17, 2026

Adding a 7th subagent pushed my orchestrator latency from 22s to 31s — the opposite of what I expected.

I'd been running a fanout pattern in my ad-creative analysis SaaS: spawn N subagents in parallel, collect results, merge into one verdict. The parallel part worked fine. Individual subagents finished in 9–12 seconds regardless of how many I spawned. The problem was everything after that.

With 8 subagents, each returning ~800 tokens of analysis, the orchestrator was assembling a 6,400-token context before it could even call the LLM once. On Cloudflare Workers, serializing 8 JSON blobs into a single prompt string was taking 4+ seconds of pure CPU time before the first API call fired. The log entry that made it obvious:

[worker:orchestrator] WARN
  aggregate_context_size=52480 bytes
  serialize_duration=4312ms
  reason="context_assembly_backpressure"

Measured across 3 weeks of production data:

Subagents	Total latency	Aggregation share
2	14.2s	18%
4	16.8s	31%
6	22.4s	47%
8	31.1s	61%

At 6+ subagents, aggregation consumed more than half the wall-clock time. The fanout was fast. The funnel was the bottleneck.

The fix wasn't reducing parallelism — it was changing what the orchestrator actually reads. Instead of passing full results to the aggregation LLM call, each subagent now writes to R2 on completion. The orchestrator pulls only a three-field summary struct per agent (verdict

, confidence

, top_signal

). Eight agents still produce eight files, but the aggregation context dropped from ~6,400 tokens to ~1,100. Monthly cost for that one pipeline step: $207 → $38.

The counterintuitive part: the bottleneck wasn't the LLM. It was the context assembly happening before the LLM even got called.

I wrote up the full breakdown — including the R2 chunking pattern, the D1 counter approach for tracking partial completions without polling, and the KV-based loop guard for failed aggregation retries — over on riversealab.com.

source & further reading

dev.to — original article From Software Engineer to AI Engineer - Part 3: Giving it a hand The trillion-dollar AI hole: where is the revenue? AI collapsed my job into three roles and I had to relearn all of them

~/api · this article 200

$curl api.wpnews.pro/v1/news/more-parallel-subagents-…

Read original on dev.to → dev.to/riversea/more-parallel-subagents-made-my-…

mentioned entities

Cloudflare Workers

LLM

metadata

slugmore-parallel-subagents-made-my-pipeline-slower-here-s-the-data

topic#ai-agents

secondary2 topics

sentimentneutral

canonicaldev.to

navigation

← prevShow HN: Fig0 – 0-Manual AI Scie…

next →Disbatch – turn a PowerShell/bat…

── more in #ai-agents 4 stories · sorted by recency

pub.towardsai.net · 1 Aug · #ai-agents

Your AI Agent Keeps Retrying. It’s Costing You $5,000 a Year.

dev.to · 1 Aug · #ai-agents

Why Your Web Scrapers Keep Breaking (And How to Build Self-Healing TypeScript Agents Using LLMs and Playwright)

byteiota.com · 1 Aug · #ai-agents

OpenAI Atlas Shuts Down August 9: Migration Guide

github.com · 1 Aug · #ai-agents

Rcarmo/umcp: A micro MCP core (asyncio and synchronous)

── more on @cloudflare workers 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 1 Aug · #developer-tools

I Built a Portable AI Skill That Safely Upgrades .NET Applications

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required