04:00
2026-06-12
arxiv.org
computer-vision
SalArt-VQA: Diagnosing Whether VLMs Understand Salient Artifacts in Generated Images
Researchers have developed SalArt-VQA, a diagnostic benchmark to evaluate whether vision-language models (VLMs) truly understand salient artifacts in AI-generated images, rather than just correctly flโฆ