20:57
2026-05-25
transformernews.ai
artificial-intelligence
Against the METR Graph
AI researcher Nathan Witkin has challenged the validity of METR's widely-cited Long Tasks benchmark, arguing its methodology is fundamentally flawed despite its status as a leading indicator of AI capโฆ