02:20
2026-06-20
inceptionlabs.ai
large-language-models
Diffusion‑based LLMs that generate many parallel tokens rather than one‑by‑one
Inception launched Mercury, a family of diffusion-based large language models that generate tokens in parallel rather than sequentially, achieving faster speeds and higher GPU efficiency. The models a…