04:00
2026-06-12
arxiv.org
artificial-intelligence
Perceive, Interact, Reason: Building Tool-Augmented Visual Agents for Spatial Reasoning
Researchers introduced PERIA, a tool-augmented visual agent designed to improve spatial reasoning in vision-language models by actively acquiring evidence and performing multi-step visual interactions…