04:00
2026-06-19
arxiv.org
large-language-models
PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models
Researchers introduced PerceptionDLM, a multimodal diffusion language model that enables parallel region perception for visual tasks, achieving significant speed improvements over sequential methods. โฆ