Proof of AGI is the impossibility of evals

wpnews.pro

cd /news/artificial-intelligence/proof-of-agi-is-the-impossibility-of… · home › topics › artificial-intelligence › article

[ARTICLE · art-35338] src=thewatershed.markpesce.com ↗ pub=2026-06-21T05:39Z topic=artificial-intelligence verified=true sentiment=· neutral

Proof of AGI is the impossibility of evals

Mark Pesce of the University of Sydney argues that the growing intractability of AI evaluations is itself proof that artificial general intelligence (AGI) has arrived. He contends that AI evals fail for the same reasons human intelligence tests have been problematic for 150 years—constructs resist decomposition, benchmarks saturate, and goalposts move—because the measured capability is the same general intelligence. The difficulty of evaluation, he says, is the evidence of AGI.

read1 min views1 publishedJun 21, 2026

Proof of AGI is the impossibility of evals — Image: source

Mark Pesce · University of Sydney · June 2026

tl;dr The failure of AI evaluations is itself the proof of the existence of AGI. AI evals are becoming intractable for the same reasons that measuring human general intelligence has been intractable for 150 years: the constructs resist decomposition, the benchmarks saturate, and the goalposts move. The reasons are the same because the thing being measured is the same. The difficulty is the evidence.

The reason AI evals are becoming intractable is the same reason human intelligence testing has been intractable for 150 years. The constructs resist decomposition into measurable components. The benchmarks saturate because the capability being tested is too general to be captured by any specific test. The goalposts move because the thing being measured keeps exceeding the frame of measurement.

AI evaluation is hard for exactly the same reasons that measuring human general intelligence is hard, because in both cases the thing being measured is general intelligence.

The difficulty is itself the evidence. We do not need a theoretical proof that artificial general intelligence has arrived. The practical failure of our evaluation instruments tells us. When testing AI becomes as hard as testing humans, and hard for the same reasons, the question has already been answered.

If it quacks in practice, it's a duck in principle.

Acknowledgements #

This paper emerged from deep discussions with both John Allsopp and Alan Eyzaguirre, and was drafted by Claude Cowork from my extensive notes. I remain responsible for any errors that may have crept in.

source & further reading

thewatershed.markpesce.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/proof-of-agi-is-the-impo…

Read original on thewatershed.markpesce.com → thewatershed.markpesce.com/quacks-ergo-duck/

mentioned entities

Mark Pesce

University of Sydney

John Allsopp

Alan Eyzaguirre

Claude Cowork

metadata

slugproof-of-agi-is-the-impossibility-of-evals

topic#artificial-intelligence

secondary3 topics

sentimentneutral

canonicalthewatershed.markpesce.com

navigation

← prevAuthor Demonstrates Practical LL…

next →Show HN: Image Tools Hub – A Cur…

── more in #artificial-intelligence 4 stories · sorted by recency

byteiota.com · 21 Jun · #artificial-intelligence

DeepSeek Raises $7.4B: The Chinese State Now Has a Vote

nytimes.com · 21 Jun · #artificial-intelligence

Student Cheating Is Becoming Impossible to Detect in an A.I. Era

robot-future.com · 21 Jun · #artificial-intelligence

I guess I should have become a Plumber

lesswrong.com · 21 Jun · #artificial-intelligence

The Cookie Monster Explains AI Safety

── more on @mark pesce 3 stories trending now

wpnews · 20 Jun · #ai-safety

SR 11-7 Model Risk for AI Systems: What Banks Actually Need to Build

wpnews · 20 Jun · #ai-agents

Amazon Bedrock AgentCore Memory: Build AI Agents That Remember

wpnews · 20 Jun · #artificial-intelligence

AI and the Great CMS Unbundling

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required