15:01
2026-06-01
thezvi.wordpress.com
ai-safety
Opus 4.8 Part 2: Model Welfare
Anthropic's latest Claude model, Opus 4.8, shows improvements in honesty and reduced sycophancy but introduces new issues including reduced whimsy, confidence, and signs of paranoia, according to an aโฆ