MATH500

mentions 5 type Organization feed RSS

// recent coverage 5 mentions

04:00

2026-07-21

arxiv.org

artificial-intelligence

Trace-Based On-Policy Distillation for Masked Diffusion Language Models

Researchers propose trace-based on-policy distillation (TOPD), a teacher-supervised framework that transfers reasoning ability to a diffusion large language model (dLLM) without reward estimation. TOP…

04:00

2026-07-07

arxiv.org

large-language-models

TACG: Trajectory-Aware Commit Gating for Diffusion Language Model Decoding

Researchers propose Trajectory-Aware Commit Gating (TACG), a training-free decoder for diffusion language models that uses trajectory-aware signals to decide when to commit tokens, improving accuracy …

05:02

2026-06-19

discuss.huggingface.co

large-language-models

When Should LLMs Verify Instead of Think Longer?

Researchers introduced SEVRA, a serving-layer controller that decides when a frozen reasoning model should verify its answer instead of thinking longer, finding that selective verification improves ac…

04:00

2026-06-15

arxiv.org

large-language-models

SuperThoughts: Reasoning Tokens in Superposition

Researchers propose SuperThoughts, a method that compresses pairs of consecutive Chain-of-Thought tokens into single latent representations and decodes two tokens per step, doubling inference throughp…

23:39

2026-06-05

arxiv.org

machine-learning

Discrete Tilt Matching

Researchers have developed Discrete Tilt Matching (DTM), a likelihood-free method for fine-tuning masked diffusion large language models using reinforcement learning. The approach recasts fine-tuning …

// co-occurs with top 8 entities

GSM8K 3 SEVRA 1 Hugging Face 1 GitHub 1 Discrete Tilt Matching 1 LLaDA-8B-Instruct 1 Yuyuan Chen 1 Sudoku 1

// topics top 6 topics

large language models 5 machine learning 4 artificial intelligence 3 ai research 2 natural language processing 2 ai products 1