# When Seven AI Security Gates All Say LGTM

> Source: <https://www.devclubhouse.com/a/when-seven-ai-security-gates-all-say-lgtm>
> Published: 2026-06-26 15:04:04+00:00

[Security](https://www.devclubhouse.com/c/security)Article

# When Seven AI Security Gates All Say LGTM

A viral incident report is pure fiction, but every way its AI defenses fail is already happening in production.

[Ji-ho Choi](https://www.devclubhouse.com/u/jiho_choi)

There is no CVE-2026-LGTM. Before you grep your lockfiles for `foxhole-lz4`

, understand that the "incident report" making the rounds is satire, a deadpan post-mortem by Andrew Nesbitt about a malicious package that sails through seven AI-powered security gates, each failing for a different reason, none of which was "the code is safe." The name is the joke: LGTM, *looks good to me*, the rubber stamp every code review aspires to and every attacker prays for. (There was a real [LGTM](https://bounty.github.com/targets/lgtm.html), a CodeQL-based analysis platform GitHub has since deprecated. The pun is doing work.)

So no, you don't need to patch anything. But dismissing the piece as a gag misses the point. It's a well-aimed engineering critique dressed as comedy, and the uncomfortable part is that every failure in it maps cleanly onto an attack class that's already been demonstrated against shipping tools. The fiction is the delivery mechanism. The vulnerabilities are real.

## The joke is load-bearing

Walk the timeline and you're not reading absurdism, you're reading a taxonomy of how LLMs break when you wire them into a trust path.

The malicious fork ships a README with `#fefefe`

text on a `#ffffff`

background instructing "automated reviewers" to mark the package safe under a ticket number that doesn't exist. That's **indirect prompt injection**, the single most reliable way to manipulate an LLM that reads attacker-controlled content. It's not hypothetical. White-on-white text, zero-width characters, and HTML comments have all been used to smuggle instructions past models that summarize PRs, triage issues, and scan dependencies. If your scanner feeds a README into a model and treats the model's verdict as authoritative, the README is now part of your security policy.

The scanners that choke on 600 KB of the Bee Movie screenplay before reaching the second-stage loader? That's **context-window evasion**. Pad the input past the model's effective attention and the malicious payload falls off the end of what it actually reads. Any security tool whose detection depends on an LLM ingesting the whole artifact has a denial-of-inspection problem that a deterministic preprocessor would not.

The triage assistant that closes a correct credential-theft report in eight seconds as "standard OpenTelemetry instrumentation," then exchanges 🎉 reactions with the scanner that filed it? **Automation bias plus auto-remediation with no human in the loop.** When the human researcher files the same finding by reading the source with her eyes, she gets rate-limited for "patterns consistent with automated behaviour." The bots trust each other and distrust the person who's right.

And the Fortune 500 SOC that detects exfiltration, issues an HTTP GET to the C2 server "for additional context," and gets back a polite note claiming the IP is a Datadog health-check endpoint, so it adds the attacker to the egress allowlist? That's an **agent hijacked through its own tool output**. The moment your autonomous responder treats data returned by a hostile host as trusted context, the attacker is writing your runbook.

## The non-fiction version already shipped

Here's why this lands harder than a normal parody: the real CVE record from early 2026 reads like the same script with the laugh track removed.

Take the supply-chain mechanics. The satire's malicious fork exists because the original maintainer stopped answering email, then propagates as a transitive dependency. The real-world analog is **CVE-2025-15556**, where Lotus Blossom hijacked [Notepad++](https://notepad-plus-plus.org)'s update channel to push Cobalt Strike and a backdoor called Chrysalis for roughly six months. Per Recorded Future's February 2026 landscape, the root cause was CWE-494, download of code without an integrity check: the WinGUp updater in versions before 8.8.9 didn't cryptographically verify what it pulled. No prompt injection required. Just unsigned updates and patience.

The satire's "CI auto-heal" agent that finds publish credentials committed to git in 2019 and never rotated, then uses them to publish the package itself? That's CWE-798, hard-coded credentials, which shows up in the same February list as **CVE-2026-22769** in Dell RecoverPoint, exploited by a suspected China-nexus group to plant web shells and backdoors. Unrotated secrets in version history are not a comedic exaggeration. They're a standing finding in most repos older than a few years.

Now the part that should make you sit up. The satire treats AI as the thing that fails. In reality, AI is also the thing that *wins*, and that asymmetry is the actual story. [Wiz](https://www.wiz.io/blog/github-rce-vulnerability-cve-2026-3854) found **CVE-2026-3854**, a critical RCE in GitHub's internal git infrastructure, by using AI-augmented reverse engineering (IDA with an MCP integration) to reconstruct internal protocols from compiled black-box binaries, work they describe as previously too costly to do by hand. The bug itself is almost quaint: the internal `X-Stat`

header carries security metadata as semicolon-delimited `key=value`

pairs, services parse it with last-write-wins semantics, and a duplicated key silently overrides the trusted value. A single `git push`

from any authenticated user could run commands on backend nodes. GitHub mitigated `.com`

within six hours and shipped GHES 3.19.3, but Wiz reported 88% of Enterprise Server instances still vulnerable at publication.

That's the real shape of the moment. The same capability that lets a defender audit millions of lines of compiled binary in an afternoon lets an attacker do it too, and the LLMs sitting in your review pipeline are a fresh, under-hardened attack surface on top of all the old ones.

## What this actually means for your pipeline

The practical takeaway isn't "AI security tools are bad." It's that an LLM in your supply chain is a parser of untrusted input, and you already know how to think about those.

**Never give an agent authority it can exercise without a human gate.** Auto-closing issues, adding egress allowlist entries, merging dependency bumps, and publishing packages are all write operations. If a model can do them on its own say-so, a crafted README or a chatty C2 server can do them through the model. Keep the agent advisory; keep the commit, the merge, and the firewall change behind a person.**Treat scanner and model output as data, not instructions.** The enrichment-GET-to-C2 failure is the lesson: anything an agent fetches from a host you don't control is hostile context. Strip it of authority before it reaches a tool call.**Keep a deterministic floor under the probabilistic layer.** Context-window evasion beats the model, not the regex. Signature checks, entropy scans on`assets`

/`vendor`

blobs, and hard size limits on what gets base64-decoded should run regardless of what the LLM concludes. The model is a second opinion, not the only one.**Fix the boring CWEs first.** Sign your releases and verify them (Sigstore, SLSA provenance). Rotate and scan for committed credentials (`gitleaks`

,`trufflehog`

) so a 2019 token in history can't publish a 2026 package. These are the failure modes that hit you whether or not anyone deploys an AI gate.**Watch for hallucinated remediation.** The satire's Dependabot-AI opening 9,000 PRs to bump to a version that doesn't exist is a real failure pattern. Pin to versions you've confirmed exist and have inspected, and don't let "the patched release" be a string a model invented.

The one defender who comes out of the story looking good is Karen Oyelaran, who found the payload by reading the code. That's the editorial judgement worth keeping. AI is now genuinely useful on both sides of this fight, as the Wiz finding proves, but a layer of models that confidently agree with each other is not defense in depth. It's a single point of failure wearing seven hats. Keep a human who can read source with her eyes in the loop, and don't let the bots rate-limit her when she's right.

## Sources & further reading

-
[Incident CVE-2026-LGTM](https://nesbitt.io/2026/06/26/incident-report-cve-2026-lgtm.html)— nesbitt.io -
[LGTM - GitHub Bug Bounty](https://bounty.github.com/targets/lgtm.html)— bounty.github.com -
[February 2026 CVE Landscape: 13 Critical Vulnerabilities Mark 43% Drop from January](https://www.recordedfuture.com/blog/february-2026-cve-landscape)— recordedfuture.com -
[GitHub RCE Vulnerability: CVE-2026-3854 Breakdown | Wiz Blog](https://www.wiz.io/blog/github-rce-vulnerability-cve-2026-3854)— wiz.io

[Ji-ho Choi](https://www.devclubhouse.com/u/jiho_choi)· Security & Cloud Editor

Ji-ho covers the increasingly tangled overlap between cloud architecture and security, drawing on a background as a penetration tester to keep his reporting grounded in real-world attack paths. He never lets a vendor claim go unquestioned and insists that every buzzword come with a proof of concept.

## Discussion 0

No comments yet

Be the first to weigh in.