cd /news/machine-learning/not-all-nvfp4-qat-recipes-are-equal-… · home topics machine-learning article
[ARTICLE · art-16018] src=arxiv.org pub= topic=machine-learning verified=true sentiment=· neutral

Not All NVFP4 QAT Recipes Are Equal: How Architecture and Scale Shape Model Quality for Anomaly Segmentation

A study evaluating FP4 quantization-aware training (QAT) for real-time anomaly segmentation in brain tumor detection found that architecture choice, not QAT recipe, most determines model quality. The Swin Transformer demonstrated robust performance across all scales and recipes, while CNN quality degraded under gradient-quantizing recipes at larger scales. Researchers recommend the Swin Transformer for FP4-quantized anomaly segmentation after five-fold cross-validation confirmed the findings.

read1 min publishedMay 28, 2026

arXiv:2605.27616v1 Announce Type: new Abstract: Real-time anomaly segmentation demands both high recall and efficient low-precision inference. We study the three-way interaction of model architecture, model scale, and FP4 quantization-aware training (QAT) recipe on a recall-critical brain tumor segmentation task, evaluating multiple architectures, scales, and QAT recipes under a unified protocol. We find that architecture choice has the largest impact on quantization robustness, with attention-based architectures showing remarkable resilience to recipe choice while CNN degrades under gradient-quantizing recipes at larger scales. At low capacity, FP4 can discretize softmax attention, but advanced QAT recipes prevent this collapse. At larger scales, advanced recipes mitigate gradient quantization noise that degrades CNN quality. Five-fold patient-level cross-validation confirms these findings are robust to data partition. Our results show that the Swin Transformer is robust to QAT recipe choice across all scales, making it the recommended architecture for FP4-quantized anomaly segmentation.

── more in #machine-learning 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/not-all-nvfp4-qat-re…] indexed:0 read:1min 2026-05-28 ·