17:24
2026-06-26
the-decoder.com
artificial-intelligence
An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run
Epoch AI's new MirrorCode benchmark tests AI models on recreating complete programs without access to original code. Claude Opus 4.7 leads with a 56 percent solve rate, rebuilding a 16,000-line toolkiβ¦