ProfiLLM: Utility-Aligned Agentic User Profiling for Industrial Ride-Hailing Dispatch

wpnews.pro

cd /news/large-language-models/profillm-utility-aligned-agentic-use… · home › topics › large-language-models › article

[ARTICLE · art-32060] src=arxiv.org ↗ pub=2026-06-18T04:00Z topic=large-language-models verified=true sentiment=↑ positive

ProfiLLM: Utility-Aligned Agentic User Profiling for Industrial Ride-Hailing Dispatch

Researchers from DiDi introduced ProfiLLM, an agentic LLM data pipeline that generates utility-aligned user profiles for industrial ride-hailing dispatch. Deployed on DiDi's production system, ProfiLLM achieved up to +6.14% AUC improvement in outcome prediction and +4.35% GMV gain in simulations, with consistent gains in a 14-day online A/B test including +0.47% GMV and +0.33% Completion Rate.

read1 min views4 publishedJun 18, 2026

arXiv:2606.18803v1 Announce Type: new Abstract: Bringing Large Language Models (LLMs) into industrial ride-hailing dispatch as semantic feature extractors over platform-scale behavioral logs is a compelling but under-explored data systems problem. Production matching pipelines remain dominated by structured numerical features, yet decisive behavioral signals (e.g., a driver's habitual aversion to certain regions) are inherently contextual and naturally expressible as LLM-generated user profiles. However, scaling such profiling to a live, millisecond-latency dispatcher faces three intertwined constraints rarely addressed together: on a platform with millions of daily orders, logs exceed any LLM's context window by orders of magnitude; most users are long-tail, with too few interactions for per-user profiling; and surface-fluent profiles do not necessarily improve downstream prediction utility. We present ProfiLLM, an agentic LLM data pipeline that operationalizes utility-aligned user profiling for production matching systems through two modules. (1) Tool-Augmented Global Knowledge Mining equips an LLM agent with 27 analytical tools to mine platform-scale data, producing reusable global knowledge, adaptive user clustering rules, and region-level supply-demand priors. (2) Utility-Aligned Profile Exploration generates multiple candidate profiles per cluster, evaluates them via a lightweight downstream utility proxy, iteratively refines the best candidates and constructs preference pairs for DPO fine-tuning. Deployed on DiDi's production dispatcher, ProfiLLM achieves up to +6.14% relative AUC improvement in outcome prediction, up to +4.35% GMV gain in dispatching simulation, and consistent improvements in a 14-day online A/B test including +0.47% GMV, +0.33% Completion Rate, and -0.82% Cancel-Before-Accept rate.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/profillm-utility-aligned…

Read original on arxiv.org → arxiv.org/abs/2606.18803

mentioned entities

DiDi

ProfiLLM

LLM

metadata

slugprofillm-utility-aligned-agentic-user-profiling-for-industrial-ride-hailing

topic#large-language-models

secondary3 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevIs AI Getting Quietly Dumber? A …

next →Most agentic AI projects in prod…

── more in #large-language-models 4 stories · sorted by recency

dev.to · 18 Jun · #large-language-models

Building Minyut: An Embeddable RAG Chatbot in One Script Tag

vercel.com · 18 Jun · #large-language-models

The Agent Stack

letsdatascience.com · 18 Jun · #large-language-models

Meta executive exits amid internal AI-for-work overhaul

infoq.com · 18 Jun · #large-language-models

Microsoft Scout, New Enterprise Autopilot Built on OpenClaw, Announced at Build 2026

── more on @didi 3 stories trending now

wpnews · 17 Jun · #developer-tools

CircleCI MCP Server: Debug Build Failures Without Leaving Your AI Coding Agent

wpnews · 17 Jun · #artificial-intelligence

How I Build Production AI Apps on Cloudflare with Claude Code

wpnews · 16 Jun · #large-language-models

I'm building CortexDB — an agent-native context database for AI agents

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required