Zhengbo Zhang

mentions 1 type Person feed RSS

// recent coverage 1 mentions

04:00

2026-06-16

arxiv.org

artificial-intelligence

Visual-Seeker: Towards Visual-Native Multimodal Agentic Search via Active Visual Reasoning

Researchers propose Visual-Seeker, a multimodal deep search agent that actively reasons over visual details to perform multi-hop, cross-modal search. The agent achieves state-of-the-art results on fiv…

// co-occurs with top 2 entities

Visual-Seeker 1 arXiv 1