04:00
2026-06-16
arxiv.org
machine-learning
QPILOTS: Efficient Test-Time Q-Steering for Flow Policies
Researchers propose QPILOTS, a method that steers flow-matching and diffusion policies at inference time by projecting noisy intermediate states to clean action estimates for critic gradient computatiβ¦