04:00
2026-06-16
arxiv.org
artificial-intelligence
Visual-Seeker: Towards Visual-Native Multimodal Agentic Search via Active Visual Reasoning
Researchers propose Visual-Seeker, a multimodal deep search agent that actively reasons over visual details to perform multi-hop, cross-modal search. The agent achieves state-of-the-art results on fivโฆ