23:59
2026-06-22
simonwillison.net
ai-safety
Prompt Injection as Role Confusion
Researchers Charles Ye, Jasmine Cui, and Dylan Hadfield-Menell found that large language models suffer from 'role confusion,' mistaking the style of text for its actual content, leading to successful โฆ