PID: Fast and High-Resolution Latent Decoding with Pixel Diffusion

wpnews.pro

cd /news/generative-ai/pid-fast-and-high-resolution-latent-… · home › topics › generative-ai › article

[ARTICLE · art-13660] src=research.nvidia.com ↗ pub=2026-05-25T15:23Z topic=generative-ai verified=true sentiment=↑ positive

PID: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Researchers have developed PiD, a pixel diffusion decoder that directly transforms latent representations into high-resolution images, bypassing the traditional decode-then-super-resolve pipeline. The system decodes 512×512 latents into 2048×2048 pixel images in under one second on a consumer RTX 5090 GPU, achieving up to 5.9× faster processing than cascaded diffusion-based super-resolution methods while improving visual fidelity. PiD unifies decoding and upsampling into a single generative module, enabling 4× and 8× upscaling with low latency and compatibility with both conventional VAE and semantic latents.

read2 min views11 publishedMay 25, 2026

Fast and High-Resolution Latent Decoding

with Pixel Diffusion

TL;DR: PiD directly decodes latent representations into high-resolution images, replacing the decode–then–super-resolve cascade while achieving lower latency and higher visual quality. *

Abstract #

Most practical high-resolution text-to-image systems rely on latent diffusion models, where generation is performed in a compact latent space and a decoder maps latents back to pixels. Yet the latent-to-pixel decoder is reconstruction-oriented, optimized to invert the encoder rather than synthesize more details, and becomes increasingly costly at megapixel scale. This drawback calls for a more expressive and efficient decoding paradigm. Motivated by recent progress in scalable pixel-space diffusion, we introduce PiD, a Pi xel diffusion D ecoder that reformulates latent decoding as conditional pixel diffusion, unifying decoding and upsampling into one generative module. By denoising directly in high-resolution pixel space, PiD synthesizes 4× and even 8× upscaled images with low latency. For latent conditioning, a lightweight sigma-aware adapter injects noise-corrupted latents into the pixel diffusion backbone, enabling PiD to decode partially denoised latents and terminate the latent diffusion process early. To further improve efficiency, we distill the model using DMD2, reducing inference to just 4 steps. PiD applies to both conventional VAE latents and semantic latents (e.g., SigLIP, DINOv2) used in recent RAE-based models. PiD decodes latents of 512×512 images into 2048×2048 pixels in under 1 second with 13 GB peak memory on a consumer RTX 5090, and as fast as 210 ms on a GB200 GPU, about 6× faster than cascaded diffusion-based super-resolution pipelines with better visual fidelity.

Results #

From Latent to Pixels

4K Decode

Baseline Comparison

Quantitative Results (Decoding + Upsampling, 512² → 2048²)

PiD is up to 5.9× faster than SeedVR2 (211.2 ms vs 1237.5 ms)

% of evaluations where judges prefer PiD over each baseline

source & further reading

research.nvidia.com — original article CoFrGeNets replace the ‘bones’ of transformer-based models How training environments can teach AI models to misbehave Running AI on mixed hardware for speed and affordability

~/api · this article 200

$curl api.wpnews.pro/v1/news/pid-fast-and-high-resolu…

Read original on research.nvidia.com → research.nvidia.com/labs/sil/projects/pid/

mentioned entities

PiD

Pixel Diffusion Decoder

DMD2

SigLI

metadata

slugpid-fast-and-high-resolution-latent-decoding-with-pixel-diffusion

topic#generative-ai

secondary4 topics

sentimentpositive

canonicalresearch.nvidia.com

navigation

← prevThe AI Compute Extensions (Ace) …

next →The Stepford AI

── more in #generative-ai 4 stories · sorted by recency

runtimewire.com · 9 Jul · #generative-ai

Aleph's silent-speech prototype reads tongue motion at 15.6% word error rate

aiornot.vote · 9 Jul · #generative-ai

AI or Not: Animal

eos.org · 9 Jul · #generative-ai

Comparing Machine Learning Models of Raindrop Formation

dev.to · 9 Jul · #generative-ai

Top AI Papers on Hugging Face - 2026-07-09

── more on @pid 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required