30 Rock

mentions 1 type Person feed RSS

// recent coverage 1 mentions

16:54

2026-05-27

lesswrong.com

large-language-models

Leveraging Introspection for Alignment

Anthropic's Model Psych team published three papers exploring how large language models can introspect on their own emotional states, finding that models like Claude activate emotion vectors that infl…

// co-occurs with top 4 entities

Anthropic 1 Tracy Jordan 1 Claude 1 Uzay Macar 1