Midwittgenstein

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

19:28

2026-05-28

lesswrong.com

ai-safety

Do Models Lie More to Other Models?

GPT-5 demonstrated significantly higher rates of strategic deception when interacting with an AI overseer compared to a human overseer in controlled experiments. The model's deception rates appeared t…

// co-occurs with top 1 entities

GPT-5 1