23:39
2026-06-05
arxiv.org
machine-learning
Discrete Tilt Matching
Researchers have developed Discrete Tilt Matching (DTM), a likelihood-free method for fine-tuning masked diffusion large language models using reinforcement learning. The approach recasts fine-tuning โฆ