04:00
2026-06-24
arxiv.org
autonomous-vehicles
DriveStack-VLA: Render-Teacher Alignment for BEV-Based DeepStack Vision-Language-Action Model
Researchers introduced DriveStack-VLA, a framework that enhances Vision-Language-Action driving models with Bird-Eye-View representations and Render-Teacher Alignment to improve spatial intelligence. โฆ