17:44
2026-06-12
cryptobriefing.com
ai-research
FrontierMath benchmark undergoes major audit as Epoch AI flags errors in one-third of math problems
Epoch AI disclosed on May 11, 2026, that an internal audit of its FrontierMath benchmark, a 350-problem test developed with over 60 mathematicians to evaluate AI reasoning, found fatal errors in roughβ¦