An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run

wpnews.pro

cd /news/artificial-intelligence/an-ai-model-programmed-nonstop-for-1… · home › topics › artificial-intelligence › article

[ARTICLE · art-41091] src=the-decoder.com ↗ pub=2026-06-26T17:24Z topic=artificial-intelligence verified=true sentiment=· neutral

An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run

Epoch AI's new MirrorCode benchmark tests AI models on recreating complete programs without access to original code. Claude Opus 4.7 leads with a 56 percent solve rate, rebuilding a 16,000-line toolkit in 14 hours, but all models fail on the most complex tasks. One model ran nonstop for 19 days on a single task, costing $2,600.

read1 min views1 publishedJun 26, 2026

Epoch AI's new MirrorCode benchmark tests whether AI models can recreate complete programs without access to the original code. Claude Opus 4.7 leads with a 56 percent solve rate, rebuilding a 16,000-line toolkit in just 14 hours. But every model tested still fails on the most complex tasks.

The article An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run appeared first on The Decoder.

source & further reading

the-decoder.com — original article AI startup Lindy ditched Claude entirely for Deepseek, saving millions as cost pressure mounts on Anthropic Altman won't go public for less than $1 trillion, so OpenAI's IPO may slip to 2027 Anthropic doesn't need junior engineers anymore thanks to AI and warns of an economic shock when other industries follow

~/api · this article 200

$curl api.wpnews.pro/v1/news/an-ai-model-programmed-n…

Read original on the-decoder.com → the-decoder.com/an-ai-model-programmed-nonstop-f…

mentioned entities

Epoch AI

MirrorCode

Claude Opus 4.7

The Decoder

metadata

slugan-ai-model-programmed-nonstop-for-19-days-on-a-single-mirrorcode-task-that-cost

topic#artificial-intelligence

secondary4 topics

sentimentneutral

canonicalthe-decoder.com

navigation

← prevWhat Is a Nomogram and Why Would…

next →What’s next for Katie Porter aft…

── more in #artificial-intelligence 4 stories · sorted by recency

dev.to · 26 Jun · #artificial-intelligence

Beyond AWS Service Names: Understanding the Problems They Actually Solve

mercuryagent.sh · 26 Jun · #artificial-intelligence

Mercury Agent

theartnewspaper.com · 26 Jun · #artificial-intelligence

Comment | Art Basel’s Zero 10 grows up and outgrows the digital community that led to its inception

dev.to · 26 Jun · #artificial-intelligence

Vibe Coding Is Not Software Development — And It's Starting to Show

── more on @epoch ai 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required