Progressive Pixel-Neighborhood Deformable Cross-Attention for Multispectral Object Detection

wpnews.pro

cd /news/computer-vision/progressive-pixel-neighborhood-defor… · home › topics › computer-vision › article

[ARTICLE · art-37214] src=arxiv.org ↗ pub=2026-06-24T04:00Z topic=computer-vision verified=true sentiment=↑ positive

Progressive Pixel-Neighborhood Deformable Cross-Attention for Multispectral Object Detection

Researchers propose Progressive Pixel-Neighborhood Deformable Cross-Attention (PNAFusion) for multispectral object detection, achieving state-of-the-art results on FLIR, M3FD, and DroneVehicle datasets while reducing GPU memory by 33% compared to prior methods.

read1 min views4 publishedJun 24, 2026

arXiv:2606.24092v1 Announce Type: new Abstract: Effective cross-modal feature alignment and interaction are central challenges in multispectral object detection. Although global cross-attention provides strong long-range modeling ability, its quadratic complexity with respect to feature size limits deployment on resource-constrained platforms. We therefore propose Progressive Pixel-Neighborhood Deformable Cross-Attention for multispectral feature fusion, termed PNAFusion. The proposed framework is motivated by two observations: weak misalignment between visible and thermal images is usually concentrated around local neighborhoods, and semantic correspondence across modalities often follows non-linear spatial mappings that fixed receptive fields cannot model well. To address these issues, PNAFusion incorporates local spatial priors into its architectural design to concentrate feature interaction and alignment on the most relevant neighborhoods. Specifically, a Pixel-Neighborhood Cross-Attention (PNCA) module is introduced to avoid redundant global feature matching and suppress background noise. Meanwhile, an Adaptive Deformable Alignment (ADA) module captures non-linear spatial correspondences through learned pixel-wise offsets. These components are further integrated through an iterative feedback mechanism to progressively refine cross-modal feature alignment. Experiments on FLIR, M3FD, and DroneVehicle show that PNAFusion achieves 84.2, 90.5, and 85.5 mAP@0.5, respectively, under the YOLOv5 detector, and further reaches 86.8 mAP@0.5 on FLIR and 90.8 mAP@0.5 on M3FD when transferred to Co-DETR. Efficiency analysis indicates that PNAFusion reduces allocated GPU memory by 33.0% compared with ICAFusion and reduces theoretical FLOPs from 194.8 G to 156.4 G, although the deformable sampling and iterative refinement introduce additional latency. Our code will be available at https://github.com/DanielQiuTian/PNAFusion.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/progressive-pixel-neighb…

Read original on arxiv.org → arxiv.org/abs/2606.24092

mentioned entities

PNAFusion

YOLOv5

Co-DETR

FLIR

M3FD

DroneVehicle

ICAFusion

metadata

slugprogressive-pixel-neighborhood-deformable-cross-attention-for-multispectral

topic#computer-vision

secondary2 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevStop coding agents from writing …

next →Zhipu considers multibillion-dol…

── more in #computer-vision 4 stories · sorted by recency

arxiv.org · 25 Jun · #computer-vision

SEMIR: Topology-Preserving Graph Minors for Thin-Structure Segmentation

arxiv.org · 25 Jun · #computer-vision

Structuring Sparsity: Block-Sparse Featurizers Capture Visual Concept Manifolds

arxiv.org · 25 Jun · #computer-vision

Pre-Warm: Input-Conditioned Weight Initialization for Convolutional Neural Networks

techpowerup.com · 25 Jun · #computer-vision

AMD FSR SDK v2.3 Arrives with Ray Regeneration 1.2 and FSR 4.1.1 for RDNA 3

── more on @pnafusion 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 24 Jun · #ai-policy

An AI startup is suing the US government for taking away Anthropic's new model

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required