cd /news/artificial-intelligence/proof-of-agi-is-the-impossibility-of… · home topics artificial-intelligence article
[ARTICLE · art-35338] src=thewatershed.markpesce.com ↗ pub= topic=artificial-intelligence verified=true sentiment=· neutral

Proof of AGI is the impossibility of evals

Mark Pesce of the University of Sydney argues that the growing intractability of AI evaluations is itself proof that artificial general intelligence (AGI) has arrived. He contends that AI evals fail for the same reasons human intelligence tests have been problematic for 150 years—constructs resist decomposition, benchmarks saturate, and goalposts move—because the measured capability is the same general intelligence. The difficulty of evaluation, he says, is the evidence of AGI.

read1 min views1 publishedJun 21, 2026
Proof of AGI is the impossibility of evals
Image: source

Mark Pesce · University of Sydney · June 2026

tl;dr The failure of AI evaluations is itself the proof of the existence of AGI. AI evals are becoming intractable for the same reasons that measuring human general intelligence has been intractable for 150 years: the constructs resist decomposition, the benchmarks saturate, and the goalposts move. The reasons are the same because the thing being measured is the same. The difficulty is the evidence.

The reason AI evals are becoming intractable is the same reason human intelligence testing has been intractable for 150 years. The constructs resist decomposition into measurable components. The benchmarks saturate because the capability being tested is too general to be captured by any specific test. The goalposts move because the thing being measured keeps exceeding the frame of measurement.

AI evaluation is hard for exactly the same reasons that measuring human general intelligence is hard, because in both cases the thing being measured is general intelligence.

The difficulty is itself the evidence. We do not need a theoretical proof that artificial general intelligence has arrived. The practical failure of our evaluation instruments tells us. When testing AI becomes as hard as testing humans, and hard for the same reasons, the question has already been answered.

If it quacks in practice, it's a duck in principle.

Acknowledgements #

This paper emerged from deep discussions with both John Allsopp and Alan Eyzaguirre, and was drafted by Claude Cowork from my extensive notes. I remain responsible for any errors that may have crept in.

── more in #artificial-intelligence 4 stories · sorted by recency
── more on @mark pesce 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/proof-of-agi-is-the-…] indexed:0 read:1min 2026-06-21 ·