ML-GSAI

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-06-25

arxiv.org

large-language-models

Improved Large Language Diffusion Models

Researchers introduced iLLaDA, an 8B masked diffusion language model trained from scratch with fully bidirectional attention, scaling pre-training to 12T tokens and fine-tuning on a 25B-token instruct…

// co-occurs with top 4 entities

iLLaDA 1 LLaDA 1 Qwen2.5 7B 1 arXiv 1