CNNs, Transformers, Hybrid, and Vision Language Models for Skin Cancer Detection

wpnews.pro

cd /news/computer-vision/cnns-transformers-hybrid-and-vision-… · home › topics › computer-vision › article

[ARTICLE · art-14872] src=arxiv.org ↗ pub=2026-05-27T04:00Z topic=computer-vision verified=true sentiment=· neutral

CNNs, Transformers, Hybrid, and Vision Language Models for Skin Cancer Detection

A unified evaluation of twelve deep learning models for binary skin cancer detection on the PAD-UFES-20 dataset found that well-tuned CNNs provide strong baselines, but transformer-based families consistently improve discrimination. Hybrid models (MaxViT Tiny, CoAtNet0) and a SigLIP-based vision language model achieved the best overall trade-off between ranking performance and clinically relevant operating points, while a CLIP-based model offered high precision. The findings offer practical guidance on model family suitability for real-world skin cancer screening deployment and establish a reproducible reference point for future work on the dataset.

read1 min views16 publishedMay 27, 2026

arXiv:2605.26294v1 Announce Type: new Abstract: Skin cancer is a common and fast rising malignancy worldwide. Early detection is critical for improving outcomes. Deep learning models trained on dermoscopic and clinical images can support automated and fast triage. However, many studies evaluate only a limited set of architectures. Experimental setups also vary across studies. In this paper, we present a unified evaluation of twelve deep learning models for binary skin cancer detection on the PAD-UFES-20 dataset. The models span four families: convolutional neural networks (CNN), vision transformers (ViT), hybrid convolution transformer backbones, and vision language models (VLM). Performance is assessed using AUC, the maximum F1 score with its precision and recall, and sensitivity at 80% specificity, reflecting screening oriented requirements. Our results show that well tuned CNNs already provide strong baselines, but transformer based families consistently improve discrimination. Hybrid models (MaxViT Tiny, CoAtNet0) and a SigLIP based VLM achieve the best overall trade off between ranking performance and clinically relevant operating points, while CLIP based model offers high precision. The full codebase for all experiments is publicly released. Together, these findings offer practical guidance on which model families are most suitable for real world deployment in skin cancer screening and establish a reproducible reference point for future work on PAD-UFES-20.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/cnns-transformers-hybrid…

Read original on arxiv.org → arxiv.org/abs/2605.26294

mentioned entities

PAD-UFES-20

MaxViT Tiny

CoAtNet0

SigLIP

CLIP

metadata

slugcnns-transformers-hybrid-and-vision-language-models-for-skin-cancer-detection

topic#computer-vision

secondary4 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevSejong University launches Asia’…

next →European AI adoption hits 99% wi…

── more in #computer-vision 4 stories · sorted by recency

machinebrief.com · 11 Jul · #computer-vision

Linear Paths to Compositional Brilliance

arxiv.org · 19 Jun · #computer-vision

Vortex: Multi-Modal Fusion System for Intelligent Video Retrieval

arxiv.org · 27 May · #computer-vision

Benchmarking Convolutional, Transformer, Hybrid, and Vision Language Models for Multi Disease Retinal Screening

machinebrief.com · 14 Jul · #computer-vision

GNNs: Decoupling Feature Transformation from Topology

── more on @pad-ufes-20 3 stories trending now

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required