Backpressure is all you need A new approach called "backpressure" aims to make AI coding agents safer by forcing them to validate their own work before requiring human review. The method, inspired by systems engineering, uses automated tests, type checking, and other guardrails to prevent low-quality code from reaching human reviewers. This technique addresses the fundamental tension between letting AI agents run unattended—which risks bugs and unmanageable pull requests—and requiring humans to review every step, which defeats the purpose of automation. Backpressure is all you need There are two obvious ways to use coding agents. Both are bad. The first is to let the LLM run unattended and hope the repository survives. This is fast, exciting, and stupid. It leads to bugs, confused changes, and a flood of PRs that humans cannot review quickly enough, at least not without eventually lowering their standards and merging things they do not really understand. The second approach is to treat the agent like glorified autocomplete and force a human to review every tiny step. This is safer, but slow enough to partially defeat the purpose of using an agent in the first place. If you still have to steer every minor decision, you have not delegated much. In this post, I’ll cover a third, not-so-obvious approach: building ways for the agent to validate more of its own work before a human has to step in. The goal to make longer unattended sessions safe enough to be useful without fully removing the human from the loop. It should also reduce the number of low-quality PRs your teammates have to review for details the agent should have caught itself. What backpressure is and how it can help In systems engineering, backpressure is the mechanism by which a downstream component signals upstream that it can't accept more work, forcing the producer to slow down, buffer, or shed load. Whenever there's no backpressure, the producer is free to generate work at will, and the consumer is forced to absorb the mismatch. Then, the consumer either falls behind, breaks under the load, or speeds up by cutting corners. In our work, backpressure usually takes the form of a machine refusing work the producer hasn't cleaned up yet. The simplest version of that is an automated test: you don't usually submit a PR with failing tests. Ideally, your colleagues shouldn't even review a PR until all tests are green. In that case, the test suite is the backpressure mechanism for a human to clean up their code before asking for a review. In addition to automated testing, types can also be a powerful form of backpressure. Remember writing plain JavaScript, for example. Back in those days, it was easy to wire a component with the wrong prop shape and only find out much later, when someone clicked a button and got hit in the face with props.onSubmit is not a function . Before TypeScript, the only way to catch the bug before production was for a reviewer to follow the prop, follow the callback, check the caller, check the caller's caller, and hope the mismatch was visible in the diff. Some of us learned a lesson from these difficult times and started using types to make impossible states impossible https://www.youtube.com/watch?v=IcgmSRJHu 8 . Others looked at the same lesson, nodded solemnly, and kept passing dictionaries around, but I guess that's a story for another day. Anyway, the point here is that TypeScript added backpressure and made the producer confront the consumer's expectations before moving the code forward. Now, if a component needs a function, you can't casually hand it a string, an object, or nothing at all and hope the reviewer catches the bug. Instead, the machine will refuse the work at the boundary where you introduced the type mismatch, with no need for an expensive human review. As time passed, we kept adding more automated guardrails to the process, like linters, end-to-end tests, canary releases, and so on. We then bundled a bunch of those guardrails into CI pipelines. That way, we could stop reviewing code that wasn't even close to being ready, and we could focus our human attention on the things that machines can't check, like readability, complexity, and overall design. Today, this lesson is easy to recognize when we're talking about compilers, automated tests, CI, and, for the true believers among us, types. However, it seems much harder to recognize when the producer is an LLM writing code faster than anyone can read it. That's why, most of the time, the LLM's backpressure is still us . We look at the code in the editor, ask the model to fix the parts that smell wrong multiple times , open the PR, fix any failing checks, and then someone looks at the same code again with a more serious face. Often, for extra safety, we install a review bot to check the first AI's code. Then, we copy the bot's feedback back into the coding agent. That way, we have ironically promoted ourselves to an expensive clipboard doing the mechanical work between two machines. The next step for AI-aided software development is to stop making humans the default backpressure in the AI loop . We need tests that fail early, types that push back, benchmarks that catch regressions, and review agents that send bad patches back before they become a human's problem. That machinery is what makes delegation possible, and frees up our time to focus on higher-level feedback and design decisions instead of low-level correctness and quality issues. Next, I'll explain how I've been building that machinery in my work, how you can do the same, and interesting approaches I've yet to explore. You can install this post's backpressure skills by running npx @lucasfcosta/backpressured in your terminal. Then, run /backpressured