04:00
2026-06-12
arxiv.org
large-language-models
Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior
A new study from arXiv (2606.12730v1) found that large language models (LLMs) demonstrate selective coherence between self-reported intentions and actual behavior, challenging prior claims of widespreβ¦