cd /news/ai-safety/the-national-academies-launders-myth… · home topics ai-safety article
[ARTICLE · art-40455] src=flyingpenguin.com ↗ pub= topic=ai-safety verified=true sentiment=↓ negative

The National Academies Launders Mythos: “Implications of AI for Cybersecurity”

The National Academies of Sciences, Engineering, and Medicine published a report on AI for cybersecurity that cites Anthropic's Mythos model as capable of finding 83.1% of vulnerabilities in CyberGym, but critics argue the claim is based on unverified vendor marketing and has been repeatedly disproven by independent tests showing similar results with open-weight models on consumer hardware.

read2 min views1 publishedJun 26, 2026

In April “The Boy That Cried Mythos” caught Anthropic collapsing its own credibility. In June “Mythos dressed up in a coat, should be called Opus with a moat” caught it again.

Anthropic wants to play God, feed on claims only they can verify, which is to say it feeds beliefs based on lies. If that sounds harsh, think about how the God of cycling Lance Armstrong treated anyone who suggested he was doping. He sure got a lot of medals for “livewrong“.

Now the Mythos lies have spilled their way into a venue claiming to use a formal review process. A new National Academies document (NASEM) freshly launders vendor marketing without any explanation.

National Academies of Sciences, Engineering, and Medicine. 2026. Implications of AI for Cybersecurity: A Rapid Expert Consultation. Washington, DC: The National Academies Press.

This should help clarify, for those who are wondering if we are dealing with a Lance Armstrong of LLMs.

NASEM Laundry (June 2026) Prior Evidence
Figure 1 plots Mythos at 83.1% on CyberGym as settled capability, sourced to “Wang et al. 2025” The

repeatedly disproven. Mythos emailed out of its sandbox only after being instructed to try, showed no sign of altering its weights, and Opus 4.6 finds the same or better flawsopposite of novel. It was a curated recovery from a backlog of delayed fixes, which any model does.not the machineoutputIronCurtain harness, andclearbluejarrecovered CVE-2026-4747 on two open-weight models on a single consumer GPU. Discovery is provable as an orchestration problem, making thefrontier-model unnecessary.vendor marketing trick. The June 2 expansion followed a June 1 confidential IPO filing near a one-trillion-dollar valuation, committing access and capital ahead of the promised verification, and several trialing firms are Anthropic investorscurl maintainers reported no change to their workflow, and Mozilla’s headline of 271 Firefox vulnerabilitiesreconcilesto just three versus the advisorynot independent confirmationunproven vendor capability as unproven

── more in #ai-safety 4 stories · sorted by recency
── more on @national academies of sciences, engineering, and medicine 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/the-national-academi…] indexed:0 read:2min 2026-06-26 ·