gpt-oss-20b

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

23:59

2026-06-22

simonwillison.net

ai-safety

Prompt Injection as Role Confusion

Researchers Charles Ye, Jasmine Cui, and Dylan Hadfield-Menell found that large language models suffer from 'role confusion,' mistaking the style of text for its actual content, leading to successful …

// co-occurs with top 4 entities

Charles Ye 1 Jasmine Cui 1 Dylan Hadfield-Menell 1 Hacker News 1