{"type": "article", "title": "What Benchmarks Don't Measure: The Case for Evaluating Abstention Competence in Autonomous Agents", "publisher": "Web Pulse", "url": "https://wpnews.pro/news/what-benchmarks-don-t-measure-the-case-for-evaluating-abstention-competence-in", "original_source": "https://arxiv.org/abs/2606.02965", "published": "2026-06-03T04:00:00+00:00", "accessed": "2026-06-03", "id": "what-benchmarks-don-t-measure-the-case-for-evaluating-abstention-competence-in"}