04:00
2026-06-25
arxiv.org
large-language-models
Efficient and Trainable Language Model Test-Time Scaling via Local Branch Routing
Researchers introduced Local Branch Routing (LBR), a token-level test-time scaling framework that expands a small local lookahead tree and uses a lightweight router to select the best branch, enablingβ¦