04:00
2026-06-17
arxiv.org
computer-vision
Visuals Lie, Consistency Speaks: Disentangling Spatial Attention from Reliability in Vision-Language Models
Researchers at arXiv challenge the common assumption that visual attention correlates with reliability in vision-language models. Their VLM Reliability Probe study across multiple models finds that spโฆ