OpenAI Help: Lockdown Mode

wpnews.pro

cd /news/ai-safety/openai-help-lockdown-mode · home › topics › ai-safety › article

[ARTICLE · art-23026] src=simonwillison.net ↗ pub=2026-06-06T23:56Z topic=ai-safety verified=true sentiment=↑ positive

OpenAI Help: Lockdown Mode

OpenAI introduced Lockdown Mode to block outbound network requests that could exfiltrate sensitive data during the final stage of a prompt injection attack. The feature does not prevent prompt injections from appearing in ChatGPT content, but it targets the data exfiltration leg of the "Lethal Trifecta" — the combination of private data access, untrusted content exposure, and a data-stealing channel. By cutting off the exfiltration vector with deterministic, non-AI mechanisms, Lockdown Mode aims to secure LLM systems without reducing their usefulness.

read1 min views18 publishedJun 6, 2026

Lockdown Mode is designed to help prevent the final stage of data exfiltration from a prompt injection attack by limiting outbound network requests that could transfer sensitive data to an attacker. Lockdown Mode does not prevent prompt injections from appearing in the content ChatGPT processes. For example, a prompt injection could appear in cached web content or in an uploaded file, and could still affect the behavior or accuracy of a response.

This looks really good to me.

The Lethal Trifecta occurs when an LLM system has access to all three of access to private data, exposure to untrusted content and a way to steal data and transmit it back to the attacker.

The only way to solve the trifecta is to cut off one of the three legs, and by far the easiest leg to restrict without making your LLM systems far less useful is the exfiltration vectors to steal data.

It looks to me like lockdown mode directly attacks that leg, using mechanisms that are deterministic and, crucially, are not evaluated by AI systems that themselves can be subverted by sufficiently devious attacks.

Tags: security, ai, openai, prompt-injection, llms, lethal-trifecta

source & further reading

simonwillison.net — original article OpenAI’s accidental cyberattack against Hugging Face is science fiction that happened Nativ: Run AI models locally on your Mac A Fireside Chat with Cat and Thariq from the Claude Code team

~/api · this article 200

$curl api.wpnews.pro/v1/news/openai-help-lockdown-mod…

Read original on simonwillison.net → simonwillison.net/2026/Jun/6/openai-help-lockdow…

mentioned entities

OpenAI

Lockdown Mode

Simon Willison

Lethal Trifecta

metadata

slugopenai-help-lockdown-mode

topic#ai-safety

secondary4 topics

sentimentpositive

canonicalsimonwillison.net

navigation

← prevOpenAI Built a Lockdown Mode Bec…

next →Bevy Tutorial: Build Your First …

── more in #ai-safety 4 stories · sorted by recency

cs.dartmouth.edu · 22 Jul · #ai-safety

Bing Copilot (ChatGPT-4) Flunks Math [pdf] (2024)

github.com · 22 Jul · #ai-safety

Local agent first AI search optimization tooling

koreatimes.co.kr · 22 Jul · #ai-safety

What to know about AI hacking blamed on rogue OpenAI models

simonwillison.net · 22 Jul · #ai-safety

OpenAI’s accidental cyberattack against Hugging Face is science fiction that happened

── more on @openai 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required