04:00
2026-05-29
arxiv.org
computer-vision
Embodied3DBench: Benchmarking Low-Level Embodied Spatial Intelligence of Vision Language Models
Researchers have introduced Embodied3DBench, a benchmark designed to evaluate low-level spatial intelligence in Vision Language Models (VLMs) within embodied 3D environments. The benchmark includes ovโฆ