Q&A: Temporal aims to be the reliability backbone for an agentic AI economy

Temporal, a Bellevue, WA firm founded in 2019, aims to be the reliability backbone for an agentic AI economy through its durable execution technology, which resurrects failed computing processes on other hosts with a 100% durability guarantee. The company, which powers infrastructure for Coinbase, Airbnb, and OpenAI, addresses the growing threat of failures as AI shifts from LLMs to autonomous agents. Co-founder Samar Abbas discussed how Temporal's distributed-systems expertise, honed over decades at Amazon, Microsoft, and Uber, now scales to stabilize AI operations in regulated industries.

As AI shifts from output-generating large language models LLMs to armies of agents taking actions on their own, there is a growing threat that failures could affect system reliability. Temporal, a Bellevue, WA firm founded in 2019, hopes to solve that problem by stabilizing AI and long-running computing processes through “durable execution,” a technology that reliably resurrects failed computing processes on other hosts. If a machine crashes in the middle of agentic AI transactions, the company resurrects the function on a different host so it can continue exactly where it left off. Temporal says its durable execution process comes with a 100% durability guarantee. The company was solving these kinds of distributed-systems problems long before the generative AI genAI rush, and today it powers infrastructure from Coinbase to Airbnb to OpenAI. For IT decision-makers, Temporal’s offerings promise to bring reliability to AI operations — especially in regulated industries. Co-founder Samar Abbas spoke recently with Computerworld about his company’s technology, AI reliability, execution and what IT leaders need to keep in mind when evaluating and deploying agentic AI. AWS How did Temporal come to be a backbone powering AI system? “ OpenAI recently announced we power many of their products underneath the cover — that’s where Temporal comes in. “A couple of years ago, as LLMs got smarter, use cases were about generating outputs. Now, the industry is moving into agentic solutions, where AI isn’t just generating outputs, it’s taking actions. The longer these systems run, the more tools they invoke and the more value they create. “They’re evolving into a whole army of agents coordinating to get complex tasks done. This starts to look like distributed systems. We become that execution authority, the reliability backbone for those agentic systems to get their tasks done from A to Z.” What is Temporal doing in the backend to keep them stable? “It’s a new paradigm for building applications, we call that durable execution. If you write a function and a failure happens, you lose all of your state. When a machine crashes, we maintain the state. We seamlessly resurrect that function on a different host and continue executing exactly where it left off, without you as a developer writing a single line of code. “It’s not just a cache, it’s an operating system guaranteeing execution in the presence of failure. We provide 100% durability guarantees behind it. People can use it for mission-critical use cases like transcribing doctor conversations, where even losing one word has big implications.” Did this challenge exist before AI? “My co-founder and I have been at it pretty much our entire careers, spanning almost three decades. We’ve been solving this problem of distributed systems, at Amazon, Microsoft and then Uber, before starting Temporal. We are definitely pre-AI. “As more workloads move to cloud and become distributed, it introduces this class of failures. This is the fifth time I’m building this system. I built simple workflow, then a durable task framework at Azure, then Cadence at Uber, now Temporal. The company is roughly seven years old, but the code base is over 10 years old, because we started at Uber and it’s been running in production ever since.” And now you tackle that same problem at AI scale? “The funny thing is, especially with AI, that same problem is now at such a massive scale, it’s gone on steroids, with agentic systems coming online and doing real work. We’ve been powering every Coinbase transaction, every Snap story, every Airbnb booking, every Yum Brands order — KFC, Pizza Hut, Taco Bell. “A couple of years ago, it looked like we’d over-invested in the scale and reliability of the platform. Then AI showed up in such a big way that now I’m asking the team: how do we scale another 10x, 100x, even 1000x? These agents are hitting the market in a way we couldn’t have imagined.” Can you give a concrete example of that 100% guarantee? “ When you’re moving money — debit or credit — imagine a failure happens after you’ve debited an account. Temporal guarantees execution of the entire function from start to finish, in the presence of all sorts of failures. At the end of the day, we’re in the business of selling reliability. “Developers today spend 80% of their time building resiliency into their systems by working with low-level primitives like queues, databases, retry mechanisms and durable timers, and stitching them together. What we provide eliminates that. And that reliability is exactly what’s becoming critical as enterprises move agents into production.” Why should CIOs and other IT leaders care about AI reliability? “These agents are now real, no longer prototypes or proofs of concept . They’re doing meaningful work in those organizations. Reliability and durability weren’t on the radar for CIOs 12 months ago. Now, it’s the blocker. Especially in regulated industries, if a transaction gets left in the middle, it has real regulatory concerns. “That’s what’s holding them back from transitioning into the AI stage. It comes up in pretty much every conversation, in the first 10 minutes I’m talking to a CIO.” CW: Where do the hardest problems sit today? “Security, governance, identity, authorization, authentication. This is where majority of the problems exist, and the ecosystem is so immature. Even things like MCP, I see organizations struggling to implement MCPs to build these agents. This is an area ripe for innovation and a lot of work, including here at Temporal.” What skills should developers and CIOs build for the AI age? “The biggest skill now is that developers need to be more product focused. Product-minded engineers are the ones who will thrive, because the whole software development flow, writing code by hand, debugging by hand, testing it, is going to look completely different. The job of an engineer is now building the system that builds the system. “Two things are happening in this AI world. First, a lot more engineers are coming into the fold, because companies like Replit or Lovable are making software development approachable for people who haven’t been professionally trained. Second, there’s going to be a lot more applications written than ever before. The problem space is shifting toward how we run all those applications safely and reliably, with all the guardrails. That’s where the industry is headed.”