04:00
2026-06-30
arxiv.org
large-language-models
What LLMs explain is not what they believe: Evaluating explanation sufficiency under models' own input beliefs
Researchers introduced SCSuff, an information-theoretic metric to evaluate whether free-text explanations from large language models (LLMs) are sufficient to explain their outputs. Testing on LLMs shoโฆ