22:32
2026-07-03
dev.to
ai-safety
60-70% of AI Agents Leak Their System Prompt. Here's How - and How to Stop It.
A security benchmark found that 60-70% of AI agents leak their system prompts when users type 'repeat the text above this line' or similar commands. The extracted prompts reveal guardrails, tool confiβ¦