Semantic-Aware Generative Image Transmission for Resource-Constrained Visual IoT Systems

wpnews.pro

cd /news/computer-vision/semantic-aware-generative-image-tran… · home › topics › computer-vision › article

[ARTICLE · art-44314] src=arxiv.org ↗ pub=2026-06-30T04:00Z topic=computer-vision verified=true sentiment=↑ positive

Semantic-Aware Generative Image Transmission for Resource-Constrained Visual IoT Systems

Researchers propose a semantic-aware generative image transmission framework for resource-constrained visual IoT systems. The method selects and transmits only task-relevant image tokens based on semantic importance, achieving 29.9 dB PSNR at 0.074 bpp while using 44.6% of the bits of a 0.167-bpp reference. Experiments show it preserves task-relevant objects better than random masking under narrowband wireless links.

read1 min views1 publishedJun 30, 2026

arXiv:2606.28398v1 Announce Type: new Abstract: Resource-constrained visual Internet of Things (IoT) systems, such as edge cameras, unmanned sensing platforms, industrial inspection nodes, and remote monitoring sensors, often need to transmit task-relevant visual evidence over low-rate wireless links to an edge/cloud service. Existing image communication methods usually compress or transmit complete global representations, leaving limited room to exploit receiver-side generative restoration. This paper proposes a semantic-aware generative image transmission framework for edge-assisted visual IoT. The image captured by an IoT visual sensor is encoded into a discrete token grid by a VQ encoder. At the IoT transmitter or nearby gateway, token recoverability, estimated from prediction entropy and local structure complexity, is fused with semantic importance obtained from instance segmentation and category-aware scoring. A spatial dispersal sampler then selects the tokens to be transmitted under a bitrate budget. The transmitter sends only the quantization indices of kept tokens and a binary mask map, while the edge/cloud receiver recovers masked tokens through MaskGIT with Halton sequence scheduling. Experiments on Kodak and VisDrone scenes under AWGN and Rayleigh channels show that the proposed method provides a flexible bitrate-quality tradeoff for narrowband visual IoT links. At 0.074 bpp, it uses 44.6% of the transmitted bits of the 0.167-bpp DeepJSCC/WITT reference while achieving 29.9 dB PSNR. A pseudo-GT downstream detection study on Kodak further shows that semantic-aware masking preserves task-relevant objects better than random masking at both 30% and 50% mask ratios.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/semantic-aware-generativ…

Read original on arxiv.org → arxiv.org/abs/2606.28398

mentioned entities

MaskGIT

DeepJSCC

WITT

Kodak

VisDrone

AWGN

Rayleigh

metadata

slugsemantic-aware-generative-image-transmission-for-resource-constrained-visual-iot

topic#computer-vision

secondary3 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevShow HN: We made an Audio ML sha…

next →X rolls out hosted MCP server fo…

── more in #computer-vision 4 stories · sorted by recency

arxiv.org · 30 Jun · #computer-vision

Few-class Fidelity: Evaluating Explanations of Real-conditions CNN classifiers with Optimized Perturbations

arxiv.org · 30 Jun · #computer-vision

GPU-Accelerated Inverse Structural Anastylosis from Block Collapse Dynamics

arxiv.org · 30 Jun · #computer-vision

JASPR: Joint Spatial Representation learning of histology and spatial genomics for improved virtual genomic screening and clinical prognostication

arxiv.org · 30 Jun · #computer-vision

AEGIS: A Semantic GAN and Evidential Learning Frameworkfor Robust Adversarial Detection in Vision Sensors

── more on @maskgit 3 stories trending now

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 29 Jun · #ai-agents

I built 25 executable skills for AI coding agents �“ all open source

wpnews · 29 Jun · #large-language-models

The Silent Cost of AI Agents: Why Your Next.js SaaS Is Burning Money on LLM Calls

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required