GeoDisaster: Benchmarking Orchestrated Agents for Operational Disaster Geo-Intelligence

wpnews.pro

cd /news/artificial-intelligence/geodisaster-benchmarking-orchestrate… · home › topics › artificial-intelligence › article

[ARTICLE · art-30508] src=arxiv.org ↗ pub=2026-06-17T04:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

GeoDisaster: Benchmarking Orchestrated Agents for Operational Disaster Geo-Intelligence

Researchers introduced GeoDisaster, a benchmark for operational geospatial disaster reasoning with 2,921 instances across 43 question types, and proposed an orchestrated multi-agent framework with 18 disaster-oriented tools. The framework uses Role-Contract Expectation Alignment to improve tool use, evidence grounding, and decision generation, outperforming existing remote-sensing vision-language models.

read1 min views1 publishedJun 17, 2026

arXiv:2606.17246v1 Announce Type: new Abstract: Remote-sensing vision-language models (RS-VLMs) have advanced Earth-observation analysis toward visual interpretation and instruction-following, yet fall short of operational geo-intelligence, which demands tool-grounded spatial reasoning and structured, evidence-backed decisions. We introduce GeoDisaster, an operational geospatial disaster reasoning benchmark with 2,921 verified instances across 43 question types and five task families: deforestation monitoring, multi-hazard analysis, building-damage assessment, flood-safe routing, and Sentinel-1 SAR flood monitoring. Instances integrate heterogeneous EO/GIS evidence-optical and SAR imagery, raster masks, vector geometries, road networks, and exposure layers-spanning hazard detection, damage assessment, exposure estimation, and diagnostic report generation. Ground-truth answers are grounded in executable geospatial workflows and deterministic consistency checks, removing the need for language-model annotation. We further propose an orchestrated multi-agent framework with 18 disaster-oriented tools, where role-specialized agents coordinate through explicit execution contracts, aligned via Role-Contract Expectation Alignment (RCEA): failure-aware supervised fine-tuning combined with contract-grounded reinforcement learning over dense step-level signals. Experiments show that GeoDisaster challenges existing RS-VLMs and agentic systems, while RCEA improves tool use, evidence grounding, state consistency, and decision generation.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/geodisaster-benchmarking…

Read original on arxiv.org → arxiv.org/abs/2606.17246

mentioned entities

GeoDisaster

arXiv

Role-Contract Expectation Alignment

RCEA

Sentinel-1

SAR

metadata

sluggeodisaster-benchmarking-orchestrated-agents-for-operational-disaster-geo

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevRay Data LLM enables 2x throughp…

next →Trust Begins with DNS: Mitigatin…

── more in #artificial-intelligence 4 stories · sorted by recency

arxiv.org · 17 Jun · #artificial-intelligence

Distributed General-Purpose Agent Networks: Architecture, Key Mechanisms, and Prototypes

arxiv.org · 17 Jun · #artificial-intelligence

Surrogate Assisted Pedestrian Protection Design via a Foundation Model Orchestrated Workflow

arxiv.org · 17 Jun · #artificial-intelligence

SierpinskiCam: Camera-Controlled Video Retaking with Sierpinski Triangle Pattern Cues

arxiv.org · 17 Jun · #artificial-intelligence

Not Truly Multilingual: Script Consistency as a Missing Dimension in VLM Evaluation

── more on @geodisaster 3 stories trending now

wpnews · 16 Jun · #ai-agents

The LLM Is Not the Final Authority: Building Trust Infrastructure for AI Agents

wpnews · 16 Jun · #artificial-intelligence

Most Businesses Lose Leads at Night — So I Built This

wpnews · 16 Jun · #ai-safety

Researchers propose causal framework to audit synthetic data

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required