@ScanNet

mentions 1 type Organization feed RSS

04:00

2026-06-03

arxiv.org

artificial-intelligence

Consistent Yet Wrong: Evidence Insensitivity in Spatial Vision-Language Models

Leading vision-language models (VLMs) produce view-invariant and consistent answers to spatial distance queries even when those answers are incorrect, revealing a weak link between predictions and act…

// co-occurs with top 3 entities

ViewDiag 1 Hypersim 1 KITTI360 1