04:00
2026-06-19
arxiv.org
large-language-models
Cost-Optimal LLM Routing with Limited User Feedback under User Satisfaction Guarantees
Researchers introduced SLARouter, an online routing algorithm for large language model (LLM) applications that learns cost-optimal policies from sparse user feedback while guaranteeing Service Level Aโฆ