DriveStack-VLA: Render-Teacher Alignment for BEV-Based DeepStack Vision-Language-Action Model

wpnews.pro

cd /news/autonomous-vehicles/drivestack-vla-render-teacher-alignm… · home › topics › autonomous-vehicles › article

[ARTICLE · art-37210] src=arxiv.org ↗ pub=2026-06-24T04:00Z topic=autonomous-vehicles verified=true sentiment=↑ positive

DriveStack-VLA: Render-Teacher Alignment for BEV-Based DeepStack Vision-Language-Action Model

Researchers introduced DriveStack-VLA, a framework that enhances Vision-Language-Action driving models with Bird-Eye-View representations and Render-Teacher Alignment to improve spatial intelligence. The model achieved state-of-the-art results on NAVSIMv1, NAVSIMv2, and Bench2Drive benchmarks, demonstrating superior motion planning and safety-critical perception.

read1 min views5 publishedJun 24, 2026

arXiv:2606.24051v1 Announce Type: new Abstract: Vision-Language-Action driving models convert a pretrained Vision-Language Model into a driving policy, allowing them to use world knowledge and follow language guidances. However, existing VLA driving models still lack driving-oriented spatial intelligence: their policies are mainly grounded on perspective image tokens and language priors, while precise motion planning requires metric geometry, top-down scene structure, and attention to safety-critical perceptual cues. This limitation makes current models vulnerable to weak visual geometry modeling and perceptual coverage in expert demonstrations. In this paper, we present DriveStack-VLA, a framework built upon a large VLM backbone. To strengthen the spatial grounding of VLA driving, we develop dual visual modeling components. We inject a Bird-Eye-View representation into the Large Language Model decoder through a DeepStack-style connection, and propose Render-Teacher Alignment to align the perceptual focus of real images with that of rasterized images. Furthermore, to bridge the gap in multimodal trajectory selection, we introduce a head-based self-critique module that ranks sampled trajectories and conditionally refines the best one. DriveStack-VLA achieves 91.6 PDMS on NAVSIMv1, 91.0 EPDMS on NAVSIMv2 (with the human penalty filter enabled), and a driving score of 79.49 with a success rate of 56.36% on the closed-loop Bench2Drive. More visualizations are available on our project page: https://anonymous.4open.science/w/drivestack-vla/.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/drivestack-vla-render-te…

Read original on arxiv.org → arxiv.org/abs/2606.24051

mentioned entities

DriveStack-VLA

NAVSIMv1

NAVSIMv2

Bench2Drive

metadata

slugdrivestack-vla-render-teacher-alignment-for-bev-based-deepstack-vision-language

topic#autonomous-vehicles

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevStop coding agents from writing …

next →Zhipu considers multibillion-dol…

── more in #autonomous-vehicles 4 stories · sorted by recency

arxiv.org · 25 Jun · #autonomous-vehicles

OrthoTrack: Continuous 6-DoF UAV Trajectory Estimation Anchored in Public Orthophotos

techpowerup.com · 25 Jun · #autonomous-vehicles

AMD FSR SDK v2.3 Arrives with Ray Regeneration 1.2 and FSR 4.1.1 for RDNA 3

pub.towardsai.net · 25 Jun · #autonomous-vehicles

You Do Not Need 50 Diffusion Steps. Here Is What Nvidia Proved at GTC.

dev.to · 25 Jun · #autonomous-vehicles

How AI and Tech Are Reshaping Geospatial Work

── more on @drivestack-vla 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 24 Jun · #ai-policy

An AI startup is suing the US government for taking away Anthropic's new model

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required