04:00
2026-06-24
arxiv.org
autonomous-vehicles
FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
Researchers propose FlowR2A, a generative model that unifies scoring-based and anchor-based driving planning by learning reward-conditioned action distributions from dense trajectory-reward pairs. Theβ¦