Local AI detectors rated a 2013 Paul Graham essay as more AI than actual slop

wpnews.pro

cd /news/ai-safety/local-ai-detectors-rated-a-2013-paul… · home › topics › ai-safety › article

[ARTICLE · art-40857] src=ninjasandrobots.com ↗ pub=2026-06-26T13:59Z topic=ai-safety verified=true sentiment=↓ negative

Local AI detectors rated a 2013 Paul Graham essay as more AI than actual slop

A user submitted a fake AI-generated article to Hacker News, which was flagged after detection. Testing local AI detectors on a 2013 Paul Graham essay showed it was rated as more likely AI-generated than the actual fake article, highlighting the difficulty of automated AI detection.

read4 min views1 publishedJun 26, 2026

Local AI detectors rated a 2013 Paul Graham essay as more AI than actual slop — Image: source

#

[Paul Graham Flagged For AI Use](https://ninjasandrobots.com/paul-graham-flagged-for-ai-use)

Let me short-circuit the flames. He wasn’t using AI, but my attempts at trying to rid myself of AI slop in my feed reader flagged him as the worst offender.

Is he? No. It just points out how hard this is.

So just yesterday, I posted a neat article to Hacker News. It was from Kagi Small Web which I’ve been using in my feed reader a ton because hell yeah I want to support the small blogs out there like the one here. I’m sick of all the usual garbage. And here’s an article that has some interesting bits I’ve never heard before, considering I’ve been in the YC circle since late 2005. Like Airbnb almost went corporate housing to make ends meet!?

Immediately the submission jumped to the top of Hacker News. Lots of upvotes coming in. But then I started seeing “slop”, “AI;DR”, and then someone pointed out the author’s x handle doesn’t even exist.

Crap. I got fooled. This article is ridiculously AI generated. The whole site is: siliconopera.com. Of the authors I’ve clicked on, all their articles follow templates and regenerate the same themes day after day. They point to non-existent or completely not-them x.com accounts.

That sucked. My submission was [flagged] as it should be.

Of course I’m embarrassed. But I can at least try to use that as some fuel to fix the problem. I asked:

Does anyone use a decent “ai detection” algo/service/on device model that they are happy with?

No takers. I could just ask Claude Opus. When I put the silicon opera article through it, it confidently thinks it’s AI spam, but it also did the work of following links, digging into the fake author’s masthead, etc. I can’t do that every single article before I read it.

So I tried to get Claude to make something locally for me to test. Is there something that can run cheaply on my device that quickly goes through a whole feed?

Here’s what I tried.

Apple’s on-device model. I’ve been enjoying fooling with this one a lot since it’s fast and already easily lives on our modern macs. But it obviously isn’t that powerful. Also reminded of very guardrailsy old Claude. At one point Apple’s model refused to give me synonyms for Pimantle because it was convinced Pimantle was an adult website. Then it wouldn’t give me more synonyms because it already gave me 5, and more would be a “waste for resources” :) I gave it a couple chances. One prompt to just score how AI it is 0 to 100. And another to see if it understood how well-sourced it was.

GPT-2 perplexity. Basically, how surprised is the LLM by your next word. If you ask an LLM, hey did you predict I’d say: JumbleJuice? “Nah. You crazy human.” So the gist: low surprise is high AI likelihood.

RoBERTa (Robustly Optimized BERT Pre-Training Approach) is a model Facebook released a few years ago. Someone trained it with “this is human, this is ChatGPT” examples and it did decent in 2023. But 2026’s ChatGPT is an entirely different beast to predict now.

|---|---|---|---|
| Apple on-device model: “AI score” 0-100, higher = more AI | 75 | 90 | 70 |

| Apple on-device model: “how well sourced” 0-100, higher = better | 40 | 20 | 20 | | GPT-2 perplexity, lower = more AI in theory | 274 | 90 | 169 | | RoBERTa trained on ChatGPT: % AI | 0% | 0% | 0% |

Fack. Paul’s essay (from 2013!!!) reads as the spammiest AI slop to 3 of the 4. The RoBERTa thing is clearly useless.

But these detectors basically just tried to measure for overly good writing. Paul’s a great writer. Must be a robot.

Anyways, just wanted to apologize a bit for missing this. I hate causing even more noise in the community I lean on every day. I’m trying to make this stuff better. And clearly failing in some dimensions.

What does seem to work is digging into the article with humans or a very expensive agent. And I was hoping the Kagi Small Web had human verified this too. But this is tough at scale. At least I can contribute a PR to cleanup the Small Web list:

https://github.com/kagisearch/smallweb/pull/817 And I’d be an idiot not to mention, the feed reader I talked about above is something I’ve been working on myself: ** PageForth**. It’s been an awesome tool to summarize news I’d like to first vet if the subject matter is even interesting to me before diving in. It has an AI detector built into it ironically. It’s an improved version of one of the approaches I tested above. But clearly that part isn’t working yet :)

source & further reading

ninjasandrobots.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/local-ai-detectors-rated…

Read original on ninjasandrobots.com → ninjasandrobots.com/paul-graham-flagged-for-ai-u…

mentioned entities

Paul Graham

Hacker News

Kagi Small Web

Claude Opus

Apple

RoBERTa

GPT-2

siliconopera.com

metadata

sluglocal-ai-detectors-rated-a-2013-paul-graham-essay-as-more-ai-than-actual-slop

topic#ai-safety

secondary3 topics

sentimentnegative

canonicalninjasandrobots.com

navigation

← prevRetroid Pocket Nova pre order st…

next →Estonian court fines plaintiffs …

── more in #ai-safety 4 stories · sorted by recency

1chat.com · 26 Jun · #ai-safety

Was GLM-5.2 trained on Opus 4.5 outputs?

fernandoi.cl · 26 Jun · #ai-safety

What happened after 2k people tried to hack my AI assistant

nesbitt.io · 26 Jun · #ai-safety

Incident CVE-2026-LGTM

cio.com · 26 Jun · #ai-safety

Modernizar el IT heredado con IA sin disparar el riesgo regulatorio

── more on @paul graham 3 stories trending now

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Nov · #developer-tools

Custom Zig Test Runner, better ouput, timing display, and support for special "tests:beforeAll" and "tests:afterAll" tests

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required