Still No Evidence Mythos Better at Security Than Self-hosted LLMs

wpnews.pro

cd /news/ai-safety/still-no-evidence-mythos-better-at-s… · home › topics › ai-safety › article

[ARTICLE · art-39291] src=flyingpenguin.com ↗ pub=2026-06-25T12:19Z topic=ai-safety verified=true sentiment=↓ negative

Still No Evidence Mythos Better at Security Than Self-hosted LLMs

Anthropic's Mythos AI security tool, touted as too dangerous to release, found only one low-severity vulnerability in the heavily audited curl codebase after a privileged deployment. The curl project lead dismissed the hype as marketing, while a separate AI tool, AISLE, discovered 18 vulnerabilities in the same codebase, including the oldest issue ever reported, at a fraction of the cost.

read1 min views1 publishedJun 25, 2026

Anthropic allegedly built Mythos so good at finding vulnerabilities that it was too dangerous to release. Then it was handed to only a few dozen very wealthy organizations under Project Glasswing. One of them ran it against curl and sent the project a report claiming five confirmed security vulnerabilities. The curl security team dug in. Three were false positives flagging behavior already documented in the API docs. The fourth was just a bug. One survived: a low-severity CVE shipping with 8.21.0. The most dangerous code-analysis model in the world, pointed at one of the most audited C codebases in existence, found… a single low.

Whomp whomp, sad trombone for Mythos.

The project lead publicly wrote that the Mythos hype was primarily marketing, given no evidence Mythos finds issues to a higher or more advanced degree than tools that came before it. He also said he is not anti-AI-SAST. He reiterated that AI-powered code analyzers are significantly better at finding flaws than traditional analyzers ever were.

I agree with all of that 100%.

curl is one of the most fuzzed and audited C codebases in existence (OSS-Fuzz, Coverity, CodeQL, multiple paid audits), and finding anything is a good challenge. That’s why what happened next is so interesting.

The curl blog post about Mythos unleashed a wave of non-Mythos AI hunting as researchers piled onto curl with their own tooling. AISLE was hunting curl in fall 2025, before Mythos. When the blog post stirred the field, they were already deep in the codebase and just claimed 6 of 18 discovered. Compare those 18 to the single low-severity one that Mythos was credited with. The AISLE blog post makes it clear their AI method has been the most successful and yet it’s the least cost model, opposite of Mythos marketing.

source & further reading

flyingpenguin.com — original article Climate Denial Pulls Down US Defense Sea Walls: Sensor Gaps Invite Foreign Attack FreeFable Says the Mythos Monster They Sold You Is a Mouse Why We Need a Separation of AI Church and State

~/api · this article 200

$curl api.wpnews.pro/v1/news/still-no-evidence-mythos…

Read original on flyingpenguin.com → www.flyingpenguin.com/still-no-evidence-mythos-b…

mentioned entities

Anthropic

Mythos

curl

Project Glasswing

AISLE

Daniel Stenberg

OSS-Fuzz

Coverity

metadata

slugstill-no-evidence-mythos-better-at-security-than-self-hosted-llms

topic#ai-safety

secondary4 topics

sentimentnegative

canonicalflyingpenguin.com

navigation

← prevI Built a Log Monitoring Script …

next →These Asus WiFi 7 routers just d…

── more in #ai-safety 4 stories · sorted by recency

dev.to · 25 Jun · #ai-safety

AI Dev Weekly #16: Mistral OCR 4, Claude Tag, Alibaba Caught Stealing, GPT-5.6 Delayed

technologyreview.com · 25 Jun · #ai-safety

The Download: Europe’s heat wave hits the grid, and IBM’s chip targets Moore’s Law

letsdatascience.com · 25 Jun · #ai-safety

AI Disrupts Workplaces, New Nonprofit Hopes To Aid Workers

promptql.io · 25 Jun · #ai-safety

A Teardown of Claude Tag's Agent Identity Concept

── more on @anthropic 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 24 Jun · #ai-policy

An AI startup is suing the US government for taking away Anthropic's new model

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required