04:00
2026-05-27
arxiv.org
large-language-models
OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling
Researchers have introduced OmniToM, a benchmark that evaluates large language models' theory of mind by requiring explicit modeling of belief structures for all actors in a narrative. The benchmark, โฆ