04:00
2026-05-25
arxiv.org
large-language-models
Evaluating Large Language Models in a Complex Hidden Role Game
A new study evaluating large language models in the social deduction game Secret Hitler found that current architectures remain ineffective at complex, multi-turn manipulation and deception. Models liβ¦