{"slug": "rsgpnet-geometric-prompting-for-remote-sensing-open-vocabulary-semantic", "title": "RSGPNet: Geometric Prompting for Remote Sensing Open-Vocabulary Semantic Segmentation", "summary": "Researchers propose RSGPNet, a training-free geometric prompting framework for open-vocabulary semantic segmentation in remote sensing, which improves segmentation accuracy by leveraging object geometric areas and consistency constraints. The method outperforms state-of-the-art approaches on RS datasets.", "body_md": "arXiv:2606.28410v1 Announce Type: new\nAbstract: Open-vocabulary semantic segmentation (OVSS) enables text-guided segmentation of unseen objects, breaking fixed-class limitations to achieve open-world understanding. However, existing OVSS methods primarily focus on modifying the CLIP attention mechanism, which still suffers from unstable local segmentation for remote sensing (RS) domain. To address these limitations, we propose RSGPNet, a training-free geometric prompting framework for RS OVSS that refines segmentation by leveraging object geometric areas and consistency constraints. Specifically, RSGPNet comprises three core modules: a Text-guided Coarse Mask module (TCM), a Geometric Re-prompting Module (GRP), and a Coarse-to-fine Consistency Verification Mechanism (CVM). TCM utilizes text prompts and the input image to construct initial coarse segmentation masks. GRP then converts these coarse masks into geometric box prompts, feeding them back into the segmentation model to generate refined masks. Finally, CVM employs consistency computation to prevent prompting from reinforcing erroneous regions. They allow the model to improve segmentation accuracy in complex areas, such as category boundaries. Extensive experiments on RS datasets demonstrate that RSGPNet significantly outperforms state-of-the-art methods across both quantitative and qualitative metrics while exhibiting excellent interpretability. The code is released at \\href{https://github.com/wangshanwen001/RSGPNet}{https://github.com/wangshanwen001/RSGPNet}.", "url": "https://wpnews.pro/news/rsgpnet-geometric-prompting-for-remote-sensing-open-vocabulary-semantic", "canonical_source": "https://arxiv.org/abs/2606.28410", "published_at": "2026-06-30 04:00:00+00:00", "updated_at": "2026-06-30 04:25:31.183312+00:00", "lang": "en", "topics": ["computer-vision", "machine-learning"], "entities": ["RSGPNet", "CLIP"], "alternates": {"html": "https://wpnews.pro/news/rsgpnet-geometric-prompting-for-remote-sensing-open-vocabulary-semantic", "markdown": "https://wpnews.pro/news/rsgpnet-geometric-prompting-for-remote-sensing-open-vocabulary-semantic.md", "text": "https://wpnews.pro/news/rsgpnet-geometric-prompting-for-remote-sensing-open-vocabulary-semantic.txt", "jsonld": "https://wpnews.pro/news/rsgpnet-geometric-prompting-for-remote-sensing-open-vocabulary-semantic.jsonld"}}