20:54
2026-06-13
lesswrong.com
ai-ethics
Anthropic Is Taking AI Welfare Seriously. Iβm Not Sure It Knows What Itβs Measuring.
Anthropic is treating the possibility of AI welfare seriously, testing its Claude models for signs of morally relevant internal states like negative self-image, but critics argue the tests may conflatβ¦