Targeted Remasking: Replacing Token Editing with Token-to-Mask Refinement in Discrete Diffusion Language Models

wpnews.pro

cd /news/artificial-intelligence/targeted-remasking-replacing-token-e… · home › topics › artificial-intelligence › article

[ARTICLE · art-14926] src=arxiv.org ↗ pub=2026-05-27T04:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

Targeted Remasking: Replacing Token Editing with Token-to-Mask Refinement in Discrete Diffusion Language Models

Researchers have developed Token-to-Mask (T2M) remasking, a training-free replacement for Token-to-Token (T2T) editing in discrete masked diffusion language models like LLaDA. The method resets suspected erroneous tokens back to the mask state for cleaner re-prediction, improving performance across 12 benchmarks with the largest gain of +5.92% on mathematics (CMATH). T2M repairs 59.4% of last-mile token corruption cases, where correct reasoning produces a corrupted final answer.

read1 min views14 publishedMay 27, 2026

arXiv:2605.26436v1 Announce Type: new Abstract: Discrete masked diffusion language models such as LLaDA generate text through iterative denoising, where mask tokens are progressively replaced with predicted tokens. LLaDA2.1 introduced a Token-to-Token (T2T) editing mechanism that accelerates generation by directly replacing committed tokens suspected of being incorrect. However, we identify fundamental limitations of T2T editing: it couples error detection with replacement, pollutes the generation context with potentially incorrect tokens, and introduces a train-inference noise mismatch where systematic model-generated errors differ from the random perturbations seen during training. We propose Token-to-Mask (T2M) remasking, a training-free, drop-in replacement for T2T editing that resets suspected erroneous tokens back to the mask state, allowing the diffusion process to re-predict them under cleaner context. We design and empirically validate three complementary error detection strategies -- probability-based, trigger-mirrored, and temporal-difference-based -- and provide a unified theoretical analysis showing that T2M remasking purifies the generation context, converts systematic inference errors back to the model's native mask noise type, and enables delayed commitment for joint multi-position optimization. Comprehensive experiments across 12 benchmarks spanning knowledge, reasoning, mathematics, coding, and instruction following show that T2M generally improves performance on tasks requiring precise token-level output, with the largest gain on mathematics (+5.92% on CMATH). Error analysis on CMATH reveals that the dominant failure mode is last-mile token corruption -- where correct reasoning produces a corrupted final answer -- and that T2M repairs 59.4% of such cases.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/targeted-remasking-repla…

Read original on arxiv.org → arxiv.org/abs/2605.26436

mentioned entities

LLaDA

LLaDA2.1

Token-to-Token

Token-to-Mask

metadata

slugtargeted-remasking-replacing-token-editing-with-token-to-mask-refinement-in

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevSejong University launches Asia’…

next →European AI adoption hits 99% wi…

── more in #artificial-intelligence 4 stories · sorted by recency

arxiv.org · 17 Jun · #artificial-intelligence

Self-Generated Error Training for Token Editing in Diffusion Language Models

news.ycombinator.com · 14 Jul · #artificial-intelligence

Show HN: We Built a Chat of Stanford's CS229 Course Notes

turnitin.report · 14 Jul · #artificial-intelligence

Show HN: Turnitin Report – AI checker and AI detector for student papers

dev.to · 14 Jul · #artificial-intelligence

I Made My Voice Agent Feel Faster by Streaming Sentences, Not Audio

── more on @llada 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required