cd /news/artificial-intelligence/an-ai-model-programmed-nonstop-for-1… · home topics artificial-intelligence article
[ARTICLE · art-41091] src=the-decoder.com ↗ pub= topic=artificial-intelligence verified=true sentiment=· neutral

An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run

Epoch AI's new MirrorCode benchmark tests AI models on recreating complete programs without access to original code. Claude Opus 4.7 leads with a 56 percent solve rate, rebuilding a 16,000-line toolkit in 14 hours, but all models fail on the most complex tasks. One model ran nonstop for 19 days on a single task, costing $2,600.

read1 min views1 publishedJun 26, 2026

Epoch AI's new MirrorCode benchmark tests whether AI models can recreate complete programs without access to the original code. Claude Opus 4.7 leads with a 56 percent solve rate, rebuilding a 16,000-line toolkit in just 14 hours. But every model tested still fails on the most complex tasks.

The article An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run appeared first on The Decoder.

── more in #artificial-intelligence 4 stories · sorted by recency
── more on @epoch ai 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/an-ai-model-programm…] indexed:0 read:1min 2026-06-26 ·