UtVAA: Ultra-tiny Vision Transformer with Affix Attention for Mobile Image Classification

wpnews.pro

cd /news/computer-vision/utvaa-ultra-tiny-vision-transformer-… · home › topics › computer-vision › article

[ARTICLE · art-28917] src=arxiv.org ↗ pub=2026-06-16T04:00Z topic=computer-vision verified=true sentiment=↑ positive

UtVAA: Ultra-tiny Vision Transformer with Affix Attention for Mobile Image Classification

Researchers introduced UtVAA, an ultra-tiny Vision Transformer architecture with a novel Affix Attention block for mobile image classification. The smallest variant has 204.67K parameters and 53.95M FLOPs, achieving competitive accuracy on CIFAR-10, CIFAR-100, and tomato disease datasets. This work enables transformer-based models to run on resource-constrained devices without significant performance loss.

read1 min views1 publishedJun 16, 2026

arXiv:2606.14735v1 Announce Type: new Abstract: Vision Transformers (ViTs) have demonstrated strong representation capability in image classification. However, their quadratic self-attention complexity and large parameter counts limit deployment on resource-constrained mobile and edge devices. This paper introduces UtVAA, an ultra-tiny Vision Transformer architecture designed for efficient visual recognition under strict computational budgets. It incorporates a novel Affix Attention block that combines depthwise-pointwise local feature extraction, linear self-attention, coordinate attention for spatial dependency modelling, and a lightweight ternary fusion strategy to integrate local and global representations. In addition, Dilated Bottleneck blocks expand the receptive field using dilated depthwise separable convolutions while maintaining low FLOPs and stable optimisation through residual connections. UtVAA is implemented in scalable Tiny, Medium, and Large variants, with the smallest model containing 204.67K parameters and 53.95M FLOPs. Experimental results on CIFAR-10, CIFAR-100, PlantVillage-Tomato and SLIF-Tomato datasets show that UtVAA achieves competitive accuracy within a sub-million-parameter regime. Overall, the results demonstrate that transformer-based vision models can be redesigned into ultra-tiny architectures without significant loss in discriminative performance, making UtVAA suitable for mobile and edge deployment. Code is available at https://github.com/romiyal/UtVAA

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/utvaa-ultra-tiny-vision-…

Read original on arxiv.org → arxiv.org/abs/2606.14735

mentioned entities

UtVAA

Vision Transformer

CIFAR-10

CIFAR-100

PlantVillage-Tomato

SLIF-Tomato

metadata

slugutvaa-ultra-tiny-vision-transformer-with-affix-attention-for-mobile-image

topic#computer-vision

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevBuild Your Own AI Automation wit…

next →Could a diamond wafer as wide as…

── more in #computer-vision 4 stories · sorted by recency

arxiv.org · 28 May · #computer-vision

SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training

arxiv.org · 16 Jun · #computer-vision

Interpolation between Convolution and Attention via K-Nearest Neighbors

arxiv.org · 16 Jun · #computer-vision

Hierarchical GRU with Input-Conditioned Slot Queries for Ball Action Anticipation

arxiv.org · 16 Jun · #computer-vision

GRAPE: Guided Parameter-Space Evolution for Compact Adversarial Robustness

── more on @utvaa 3 stories trending now

wpnews · 15 Jun · #artificial-intelligence

Facebook now has an AI search engine that pulls answers from your Group posts and Reels

wpnews · 15 Jun · #generative-ai

Pentagon Reports 1.5 Million Daily GenAI.mil Users

wpnews · 15 Jun · #large-language-models

The Grain of Thought

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required