Intra-Modal Neighbors Never Lie: Rectifying Inter-Modal Noisy Correspondence via Graph-Based Intra-Modal Reasoning

wpnews.pro

cd /news/machine-learning/intra-modal-neighbors-never-lie-rect… · home › topics › machine-learning › article

[ARTICLE · art-21116] src=arxiv.org ↗ pub=2026-06-04T04:00Z topic=machine-learning verified=true sentiment=↑ positive

Intra-Modal Neighbors Never Lie: Rectifying Inter-Modal Noisy Correspondence via Graph-Based Intra-Modal Reasoning

Researchers have developed IN2R, a new framework that corrects mismatched image-text pairs in large web-harvested datasets by synthesizing continuous supervision signals from intra-modal data relationships rather than relying on discrete label selection. The method uses a Graph Refiner to reason over neighboring data points in a cross-modal memory, producing soft prototypes that reduce alignment errors. In tests on Flickr30K, MS-COCO, and CC152K, IN2R outperformed existing approaches for cross-modal retrieval tasks.

read1 min views21 publishedJun 4, 2026

arXiv:2606.04061v1 Announce Type: new Abstract: Large-scale web-harvested datasets have fueled the progress of cross-modal retrieval but inevitably suffer from noisy correspondence, which severely degrades model generalization. Existing methods primarily address this by filtering out noise or seeking a substitute label, yet they predominantly remain bound by a "Discrete Selection" paradigm. We argue that relying on a single discrete proxy induces Single-Point Fragility and Discretization Error. To overcome these limitations, we propose a novel framework, Intra-modal Neighbor-aware Noise Rectification (IN2R), which shifts the paradigm from searching for a substitute to synthesizing a reliable supervision target. Leveraging the intrinsic geometric stability of intra-modal data, IN2R employs a Graph Refiner to perform relational reasoning over neighbors retrieved from a dynamic Cross-Model Memory. Instead of propagating discrete labels, our method synthesizes a continuous, soft prototype that reflects the consensus of the local semantic neighborhood, effectively rectifying inter-modal misalignment. Extensive experiments on Flickr30K, MS-COCO, and CC152K demonstrate that IN2R significantly outperforms state-of-the-art methods. Our code and pre-trained models are publicly available at https://github.com/liuyyy111/IN2R.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/intra-modal-neighbors-ne…

Read original on arxiv.org → arxiv.org/abs/2606.04061

mentioned entities

IN2R

Flickr30K

MS-COCO

CC152K

GitHub

metadata

slugintra-modal-neighbors-never-lie-rectifying-inter-modal-noisy-correspondence-via

topic#machine-learning

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevHow FinOps Teams Trace Per-Reque…

next →SharkFlow Legal — devto

── more in #machine-learning 4 stories · sorted by recency

machinebrief.com · 11 Jul · #machine-learning

Cracking the Cross-Modal Code: SEPS Framework Redefines Vision-Language Alignment

syntec-research.github.io · 24 Jul · #machine-learning

Topologically Consistent Multi-View 3D Head Reconstruction

arxiv.org · 24 Jul · #machine-learning

Webly Supervised Multi-Label Recognition: Evaluation Benchmark and Dual-Branch Multi-Label Contrastive Learning

arxiv.org · 24 Jul · #machine-learning

LLM-INSTRUCT at UZH Shared Task 2026: Constraint-Aware Retrieval and Selective Debate for Paragraph-Level Argument Mining

── more on @in2r 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 23 Jul · #artificial-intelligence

Wenfeng Liang: Four-Hour Investor Meeting Transcript

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required