Ask HN: What were your worst war stories bringing agentic applications into prod

wpnews.pro

cd /news/ai-agents/ask-hn-what-were-your-worst-war-stor… · home › topics › ai-agents › article

[ARTICLE · art-18905] src=news.ycombinator.com ↗ pub=2026-05-31T02:07Z topic=ai-agents verified=true sentiment=· neutral

Ask HN: What were your worst war stories bringing agentic applications into prod

A developer building a team of AI agents to generate reports from transcript data spent a month rewriting jobs as durable executions after cascading errors from failed API calls and memory issues broke the entire generation process with little visibility. The developer is now seeking advice on handling agent failures mid-process, the typical ratio of engineer-weeks spent on agent infrastructure versus logic, and whether to buy or build tools for durability, monitoring, and user progress reflection.

read1 min views21 publishedMay 31, 2026

For a bit of context, I’m currently creating a team of AI agents at work to generate reports by fanning out into a large amount of subagents to process a large amount of transcript data. When the analysis fails mid-way because of some individual step like an API call returns an error or the machine is out of memory, it would create cascading errors that break the entire generation with almost no visibility. I’ve just spent the past month rewriting the individual jobs as durable execution jobs on DBOS but just wondering if there are better solutions out there and if others encountered similar issues? And then there is the issue to reflect back the progress to the users which I’ve just been coding ad-hoc honestly… When an agent fails at step 9 of 12, how do you handle that?

Roughly how many engineer-weeks have you sunk into agent infrastructure (durability, monitoring, human-in-the-loop, live UI) vs. the actual agent logic? Curious if my ratio is normal.

For those who built this stuff in-house: was it ever a build-vs-buy conversation? What would a tool have had to do for you to buy instead of build? Do you currently pay for anything in your agent stack (LangSmith, Temporal, Braintrust, etc.)? What made that one worth a line item when others weren't and should I look into it too?

Comments URL: [https://news.ycombinator.com/item?id=48342441](https://news.ycombinator.com/item?id=48342441)

Points: 1

source & further reading

news.ycombinator.com — original article Ask HN: How do you do marketing in the age of slop? Ask HN: How are you productive with GPT 5.6 Sol? Ask HN: What Are You Building with AI?

~/api · this article 200

$curl api.wpnews.pro/v1/news/ask-hn-what-were-your-wo…

Read original on news.ycombinator.com → news.ycombinator.com/item?id=48342441

mentioned entities

DBOS

LangSmith

Temporal

Braintrust

Hacker News

metadata

slugask-hn-what-were-your-worst-war-stories-bringing-agentic-applications-into-prod

topic#ai-agents

secondary4 topics

sentimentneutral

canonicalnews.ycombinator.com

navigation

← prevLitterbox – Somewhat Isolated De…

next →Reviving Nudge: Building an AI-P…

── more in #ai-agents 4 stories · sorted by recency

news.ycombinator.com · 15 Jul · #ai-agents

Ask HN: What Are You Building with AI?

machinelearningmastery.com · 14 Jul · #ai-agents

LLM Evaluation Frameworks Compared: How to Actually Measure What Your Model Does

dev.to · 13 Jul · #ai-agents

Your AI agent says "done." Who checks that from outside the agent?

dev.to · 13 Jul · #ai-agents

The Evaluation Debt You Don't Know You Have: Why Agent Evals Fail in Production

── more on @dbos 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required