{"slug": "six-sessions-at-qcon-ai-boston-2026-that-take-productionizing-ai-seriously", "title": "Six Sessions at QCon AI Boston 2026 That Take Productionizing AI Seriously", "summary": "Six sessions from the upcoming QCon AI Boston 2026 conference that focus on the practical challenges of moving AI prototypes into reliable production systems. Key topics include managing latency and performance regressions in agentic systems, building organizational context layers for coding agents, and implementing control planes and evaluation frameworks to ensure reliability and auditability in production AI agents. The sessions also cover consolidating shared infrastructure, such as LLM gateways and batch inference platforms, to avoid redundant work across teams.", "body_md": "QCon AI Boston 2026 (June 1–2 at Boston University's George Sherman Union) is two weeks away and nearly sold out. The full schedule contains 40+ sessions. Six sessions are highlighted below that anchor on the same pressing challenge: what AI engineering looks like after the demo, when teams have spent two years finding out which parts of the prototype don't hold up.\nKeeping ChatGPT Fast in the Agentic Era\nMartin Spier, ChatGPT Performance @OpenAI\nIn this day-one keynote, Martin Spier takes on a common misconception: AI application latency is not simply a GPU problem.\nA single user request can pass through client work, conversation loading, context assembly, tokenization, routing, inference, streaming, and observability. Any one of those layers can become the bottleneck.\nThe second half of the problem is newer. Agentic coding lets teams ship faster, which also means performance regressions can accumulate faster. Martin will cover how OpenAI is moving performance engineering toward agent-operated investigation, with telemetry and tooling that agents can read directly.\nContext Engineering at LinkedIn: How We Built an Organizational Context Layer for AI Agents with MCP\nAjay Prakash, Senior Staff Software Engineer @LinkedIn\nCoding agents work well out of the box until they have to do real work inside a specific company.\nThey do not know your services. They do not know your internal frameworks. They do not know which data systems matter, which workflows are standard, or which conventions have built up over years of engineering practice.\nAjay Prakash’s session looks at how LinkedIn approached that problem with CAPT, an MCP-based context layer for AI agents. The architecture matters, but the more useful part may be the organizational deployment story: what happened when LinkedIn tried to roll MCP out across engineering, what did not work first, and how the system evolved. Reported results include 70% faster issue triage and 500+ community-authored skills.\nThe Agent Harness: Control Planes, Invariants, and Approval Boundaries for Production AI Agents\nVinoth Govindarajan, Member of Technical Staff @OpenAI\nVinoth Govindarajan’s talk has one of the clearest positions in the program: agents may appear autonomous, but reliability comes from the harness around the model.\nThat harness includes control planes, session state, single-writer execution, throttling, tool boundaries, approval paths, and auditability. These are not model features. They are systems concerns.\nThe session uses OpenClaw as a case study, but the underlying mental model is portable: events enter the system, state is rehydrated per session, execution is constrained, tools are bounded, and every important action leaves an audit trail.\nBuilding Reusable Evaluation Frameworks for Agentic AI Products\nSusan Chang, Principal Data Scientist @Elastic\nSusan's team has been running a user-facing AI agent in production for almost two years. That duration matters because most production agent stories in circulation right now are no older than six months.\nThe talk covers her team's eval methods, the centralized eval framework they built to reuse across other GenAI products, and the feedback loop from evals back into product improvement. Worth attending if you are still figuring out which eval pattern matches which failure mode in your own system.\nBuilding GenAI Platform at DoorDash\nSiddharth Kodwani, Tech Lead, AI Infrastructure @DoorDash, and Swaroop Chitlur, Staff Engineer / Engineering Manager, Machine Learning Platform @DoorDash\nDoorDash’s platform story begins with a familiar failure mode: every team started rebuilding the same LLM plumbing.\nRetry logic. Fallback mechanisms. Cost tracking. Prompt versioning. Batch processing. None of it was the product, but all of it was necessary.\nThis talk explores how DoorDash consolidated that work into shared platform components, including an LLM Gateway, a Batch Inference platform, and an Agentic Gateway. The interesting question is not only how those systems were built, but when shared infrastructure helps and when it becomes another layer of overhead.\nPrompt to Prod: Engineering an Autonomous SDLC at Scale\nAndrew Swerdlow, Sr. Director of Software @Roblox\nMore generated code does not automatically mean faster software delivery.\nThat is the problem this session takes on. AI coding tools can increase output, but production software still has to be reviewed, migrated, maintained, tested, and shipped safely. Roblox’s approach is to treat the SDLC itself as the system to redesign, with autonomous agents handling codebase migrations and maintenance work.\nThe session introduces Exemplar Alignment, an approach for grounding agents in expert engineering judgment. The harder question is one many teams will face soon: how do you measure quality when an agent generated the code, but a human still owns the production outcome?\nQCon AI Boston 2026 runs June 1–2 at Boston University, George Sherman Union. The full schedule and registration are available at boston.qcon.ai.", "url": "https://wpnews.pro/news/six-sessions-at-qcon-ai-boston-2026-that-take-productionizing-ai-seriously", "canonical_source": "https://www.infoq.com/news/2026/05/qconai-boston-2026-talks/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=global", "published_at": "2026-05-21 15:15:00+00:00", "updated_at": "2026-05-21 15:33:25.045360+00:00", "lang": "en", "topics": ["artificial-intelligence", "large-language-models", "developer-tools", "enterprise-software", "data"], "entities": ["QCon AI Boston 2026", "OpenAI", "Martin Spier", "LinkedIn", "Ajay Prakash", "ChatGPT", "MCP"], "alternates": {"html": "https://wpnews.pro/news/six-sessions-at-qcon-ai-boston-2026-that-take-productionizing-ai-seriously", "markdown": "https://wpnews.pro/news/six-sessions-at-qcon-ai-boston-2026-that-take-productionizing-ai-seriously.md", "text": "https://wpnews.pro/news/six-sessions-at-qcon-ai-boston-2026-that-take-productionizing-ai-seriously.txt", "jsonld": "https://wpnews.pro/news/six-sessions-at-qcon-ai-boston-2026-that-take-productionizing-ai-seriously.jsonld"}}