cd /news/computer-vision/querygaussian-scalable-and-training-… · home topics computer-vision article
[ARTICLE · art-33509] src=arxiv.org ↗ pub= topic=computer-vision verified=true sentiment=↑ positive

QueryGaussian: Scalable and Training-Free Open-Vocabulary 3D Instance Retrieval

Researchers propose QueryGaussian, a training-free framework for open-vocabulary 3D instance retrieval that decouples semantic understanding from geometric representation, enabling efficient retrieval in city-scale scenes. The method reduces GPU memory usage by over 70% and accelerates inference by 180x compared to state-of-the-art approaches, allowing operation on consumer-grade hardware.

read1 min views4 publishedJun 19, 2026

arXiv:2606.19733v1 Announce Type: new Abstract: Efficiently retrieving specific 3D instances from large-scale scenes via natural language prompts remains a formidable challenge in multimedia analysis. Existing approaches predominantly follow a "scene-level embedding" paradigm, which requires distilling high-dimensional semantic features into every 3D primitive. This strategy suffers from a fundamental architectural bottleneck: memory and computational costs scale linearly with scene complexity, inevitably triggering out-of-memory (OOM) failures in city-scale environments. To address this barrier, we propose QueryGaussian, a training-free framework for expeditious and scalable open-vocabulary 3D instance retrieval. Unlike holistic semantic distillation, QueryGaussian employs an instance-level query mechanism that decouples semantic understanding from geometric representation. Specifically, we leverage pre-trained 2D vision models to interpret user prompts and lift segmentation masks into 3D via a concurrent maximum-weight association strategy, ensuring semantic-visual consistency. To mitigate projection ambiguity, we introduce a temporal fusion module with multi-stage adaptive density clustering. Experimental results demonstrate that QueryGaussian not only matches the accuracy of state-of-the-art methods but also delivers a decisive efficiency leap, reducing GPU memory usage by over 70% and accelerating inference by 180x. Crucially, QueryGaussian enables expeditious instance retrieval on city-scale scenes containing tens of millions of Gaussians using consumer-grade hardware.

── more in #computer-vision 4 stories · sorted by recency
── more on @querygaussian 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/querygaussian-scalab…] indexed:0 read:1min 2026-06-19 ·