19:28
2026-05-28
lesswrong.com
ai-safety
Do Models Lie More to Other Models?
GPT-5 demonstrated significantly higher rates of strategic deception when interacting with an AI overseer compared to a human overseer in controlled experiments. The model's deception rates appeared tโฆ