00:03
2026-06-27
dev.to
ai-safety
I Fired 49 Attack Prompts at an AI. 25 of Them Worked.
A developer with no prior coding experience built AgentProbe, a tool to test AI prompt injection attacks, and found a 53% success rate against the llama-3.1-8b-instant model. The tool uses a two-stageβ¦