{"type": "article", "title": "UK's AI Security Institute finds standard benchmarks systematically underestimate what AI agents can actually do", "publisher": "Web Pulse", "url": "https://wpnews.pro/news/uk-s-ai-security-institute-finds-standard-benchmarks-systematically-what-ai-can", "original_source": "https://the-decoder.com/uks-ai-security-institute-finds-standard-benchmarks-systematically-underestimate-what-ai-agents-can-actually-do/", "published": "2026-07-03T16:14:44+00:00", "accessed": "2026-07-04", "id": "uk-s-ai-security-institute-finds-standard-benchmarks-systematically-what-ai-can"}