NVIDIA's Nemotron Diffusion: One Model, Three Generation Modes, 6 Faster
NVIDIA has released the Nemotron-Labs Diffusion family of open-weight language models (3B, 8B, 14B, and an 8B VLM) that can operate in three generation modes—autoregressive, diffusion, or self-speculative—from a single c…