04:00
2026-06-17
arxiv.org
large-language-models
MapSatisfyBench: Benchmarking Satisfaction-Aware Map Agents through Behavior-Grounded Implicit Decision Factors
Researchers introduced MapSatisfyBench, a benchmark to evaluate how well large language model agents in map services recover implicit user needs from available information before responding. The benchβ¦