20:15
2026-06-24
gilesthomas.com
large-language-models
Thoughts on Role Confusion
Researchers Charles Ye, Jasmine Cui, and Dylan Hadfield-Menell found that large language models often ignore explicit role tags like <system> or <user> and instead infer roles from text tone, enablingβ¦