cd /news/autonomous-vehicles/personadrive-human-style-retrieval-a… · home topics autonomous-vehicles article
[ARTICLE · art-24796] src=arxiv.org ↗ pub= topic=autonomous-vehicles verified=true sentiment=· neutral

PersonaDrive: Human-Style Retrieval-Augmented VLA Agents for Closed-Loop Driving Simulation

Researchers have developed PersonaDrive, a pipeline that conditions vision-language-action driving agents on retrieved demonstrations from a style-instructed human driving dataset to produce aggressive, neutral, or conservative non-ego traffic agents in closed-loop simulations. The system, which requires no per-style retraining, improves driving scores by up to 4.6% over existing models on the Bench2Drive benchmark while enabling style-diverse behavior. Average speed and acceleration increase by 18% and 25% from conservative to aggressive instructions, demonstrating the pipeline's ability to generate human-like driving styles for more realistic simulation environments.

read1 min publishedJun 12, 2026

arXiv:2606.12616v1 Announce Type: new Abstract: Closed-loop driving simulators typically populate their environments with non-ego traffic agents that behave largely the same way, produced either by rule-based traffic managers or by learned models trained toward a single behavioral mode. Recent work introduces style variation through post-hoc labels on observational data or LLM-inferred reward weights, but these signals act as proxies for what a style should reward rather than demonstrations of humans explicitly asked to drive in that style. We introduce PersonaDrive, a pipeline that conditions a vision-language-action (VLA) driving agent on retrieved demonstrations from a style-instructed human driving dataset, in which participants drive CARLA leaderboard routes under aggressive, neutral, and conservative instructions on a driver-in-the-loop rig. The pipeline has three stages: (i) offline triplet mining over per-style human driving data using a combined image-text similarity score; (ii) training a lightweight retrieval head that fuses frozen visual features with a small control encoder over per-style databases; and (iii) fine-tuning a single VLA backbone to treat retrieved context points as in-context behavioral demonstrations during waypoint prediction. At inference, the same backbone is conditioned on any style by swapping which per-style database the retrieval head queries, so selecting a style requires no per-style retraining while enabling human-style, style-diverse non-ego agents for closed-loop simulation. On Bench2Drive, PersonaDrive (no style) improves the driving score by 4.6% over SimLingo and 2.5% over HiP-AD, and under style conditioning attains the highest driving score in every style within a roughly 2% band (its weakest style surpassing the strongest baseline, DMW, by 5.4%), while average speed and acceleration rise by 18% and 25% from the conservative to the aggressive instruction.

── more in #autonomous-vehicles 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/personadrive-human-s…] indexed:0 read:1min 2026-06-12 ·