cd /news/machine-learning/xssr-cross-domain-self-supervised-re… · home topics machine-learning article
[ARTICLE · art-21127] src=arxiv.org pub= topic=machine-learning verified=true sentiment=↑ positive

XSSR: Cross-Domain Self-Supervised Representative Selection for Efficient Annotation in Medical Image Segmentation

Researchers introduced XSSR, a cross-domain self-supervised framework that selects representative medical images for annotation, reducing labeling effort while maintaining segmentation accuracy. The method achieved 99.3% of full-data performance on Chest X-ray using only 22 labeled samples and outperformed random selection by up to 2.5 Dice points on Prostate MRI across three public benchmarks. XSSR automatically calibrates its novelty-diversity trade-off parameter and consistently surpassed the CoreSet baseline by 0.4 to 1.2 Dice points, addressing the challenge of domain shifts in medical imaging.

read1 min publishedJun 4, 2026

arXiv:2606.04301v1 Announce Type: new Abstract: Acquiring labeled medical image data is resource-intensive and a challenge further exacerbated in cross-domain scenarios where source and target datasets differ in imaging equipment, population, or clinical site. This study introduces XSSR (Cross-Domain Self-Supervised Representative Selection), a framework designed to minimize annotation effort in the target domain while maintaining robust segmentation performance. XSSR comprises three stages: first, a Masked Autoencoder (MAE) is trained on unlabeled source data to establish a shared embedding space without requiring target labels; second, a greedy selection algorithm scores unlabeled target samples based on a composite density, novelty, and diversity criterion; and third, a U-Net segmentation model is trained exclusively on the selected subset. The novelty-diversity trade-off parameter, alpha, is automatically calibrated by minimizing embedding-space coverage, eliminating manual tuning. We evaluate XSSR on three public benchmarks: Chest X-ray, RIGA+ retinal fundus imaging, and multi-site Prostate MRI, each under a fixed 5% annotation budget. XSSR achieves 99.3% of full-data performance on Chest X-ray using only 22 labeled samples, surpasses random selection by up to 2.5 Dice points on Prostate MRI, and consistently outperforms the CoreSet baseline by 0.4 to 1.2 Dice points across all datasets. Ablation studies indicate that diversity is the most influential scoring component, and per-site analysis shows that performance correlates with scanner similarity to the source domain.

── more in #machine-learning 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/xssr-cross-domain-se…] indexed:0 read:1min 2026-06-04 ·