04:00
2026-05-27
arxiv.org
computer-vision
Benchmarking Convolutional, Transformer, Hybrid, and Vision Language Models for Multi Disease Retinal Screening
A new study benchmarked twelve deep learning architectures across four model families—convolutional neural networks, vision transformers, hybrid CNN-transformer backbones, and vision-language models—f…