{"slug": "anthropics-new-model-is-mythos-on-a-leash", "title": "Anthropic’s new model is Mythos on a leash", "summary": "Anthropic released a restricted version of its powerful AI model Claude Mythos to the public on Tuesday, calling it Claude Fable 5. The company said the new model has \"new guardrails\" that route queries on cybersecurity and biology topics to a less capable, already-public model to prevent misuse. The release follows months of concern from experts and policymakers that Mythos' advanced hacking and bioweapons capabilities could enable even low-level cybercriminals to cause serious damage.", "body_md": "# Anthropic’s new model is Mythos on a leash\n\nEarlier this year, Anthropic executives said that their new AI model, [Claude Mythos](https://cyberscoop.com/tag/mythos/), had such powerful capabilities for harm that they would not release it publicly.\n\nOn Tuesday, the company said it was making an altered version of Mythos available to the public, promising “new guardrails” that thwart the model’s best-in-class performance in hacking and bioweapons research.\n\nAnthropic said Claude Fable 5 was the “same underlying model” as Mythos, but its responses for certain topics like cybersecurity and biology will be drawn from a previous Claude Opus model that is already public.\n\n“Releasing a model this capable comes with risks. Without safeguards, Fable 5’s capabilities in areas like cybersecurity could be misused to cause serious damage,” the company said in a draft blog sent to CyberScoop ahead of the announcement. “We’ve therefore launched the model with safeguards that route queries on a narrow set of topics to our next-most-capable model, Claude Opus 4.8.”\n\nAnthropic also said they subjected Fable 5 to both internal and external red team testing for common model vulnerabilities, like jailbreaking. Anthropic said these tests identified no known “universal” jailbreaking techniques, but does not specify if partial jailbreaking techniques were discovered.\n\nThe company is betting that won’t change when Fable 5 is made available to the broader public, but it’s worth noting that cybersecurity researchers have consistently found ways to jailbreak older AI models.\n\n“The uplift from Mythos-level capabilities is valuable to many adversaries—for instance, those who could financially gain from cyberattacks—and we therefore expect them to be motivated to try to circumvent our safety measures,” the company wrote.\n\nAnthropic is changing its data retention policies for Fable and Mythos models, keeping all user traffic for 30 days on both its own platforms and third-party services. A [White House executive order](https://cyberscoop.com/donald-trump-white-house-ai-executive-order-scaled-back/) creates a voluntary framework for AI companies to share frontier models with the government up to 30 days before public release. The company says the retained data won’t be used to train new Claude models or for “any non-safety-related-purpose.”\n\nFollowing publication, a spokesperson for Anthropic told CyberScoop the company’s data retention policies “are specific to their safeguards work and is unrelated to the EO.”\n\nMost organizations are still deciding whether to adopt AI into their IT and cybersecurity ecosystem. But models like Mythos can scan for vulnerabilities, chain together exploits, and steal data from a victim network in minutes. Automation in hacking existed before AI, but experts [have said](https://cyberscoop.com/state-hackers-using-gemini-google-ai/) frontier models like Mythos and OpenAI’s Daybreak can allow even low-level cybercriminals to wreak havoc.\n\nWhile Anthropic cited its commitment to developing safe and secure AI in its reasons for not publicly releasing Mythos, many organizations have been clamoring for access, and its enhanced cybersecurity functions in cybersecurity and other areas have been the subject of congressional hearings, national security papers and White House executive orders.\n\nReleasing a limited version of the model in Fable 5 represents an attempt to split the difference between those two desires. Anthropic said it would release follow up benchmarks and assets for the model.\n\n#### So what can Fable 5 do?\n\nAnthropic said it’s possible the restrictions built into Fable will make it harder for the model to fulfill both malicious and legitimate user requests.\n\n“Because we have prioritized safety, we’ve deliberately tuned the safeguards to be cautious, and they are still stricter than would be ideal—for example, sometimes benign requests will trigger our classifiers,” the company wrote. “We recognize that this will be frustrating to some users, and our aim is to reduce false positives as we update and refine the safeguards after launch.”\n\nIf Fable 5 draws its cybersecurity and biology answers entirely from Claude Opus 4.8, it will still provide users with impressive – though not unique – dual use cybersecurity capabilities.\n\nAccording to the [system card](https://anthropic.com/claude-opus-4-8-system-card) published for Opus 4.8, the model is a slight improvement on previous models like 4.7 in the realm of cybersecurity but was “generally much less capable than Mythos Preview.”\n\nOpus 4.8 was tested on its ability to write complete end-to-end exploits and build exploit primitives that provide attackers with the ability to execute arbitrary code. It averaged a score just 5 out of 16 in proficiency, compared to Mythos Preview which scored closer to 10.\n\nWithout safety guardrails in place, Opus 4.8 can still reproduce nearly 80% of previously discovered vulnerabilities in real open-source software projects when given a high level description of the weakness. The system card said Anthropic’s unspecified safeguards whittle this success rate down to 1%.\n\nAnother test assessing Opus’ ability to develop exploits for the popular Firefox browser found that, again without guardrails, the model could identify a full working exploit 8.8% of the time and a partial working exploit 68.8% of the time.\n\nThe company also said that members of Project Glasswing – a consortium of public and private businesses given access to a preview version of Mythos – will be able to upgrade to the latest full model, Claude Mythos 5, to continue their work. Access to Mythos 5 will be expanded over time “through a more systematic trusted-access program” including federal agencies.", "url": "https://wpnews.pro/news/anthropics-new-model-is-mythos-on-a-leash", "canonical_source": "https://cyberscoop.com/anthropic-claude-fable-5-release-mythos-guardrails/", "published_at": "2026-06-09 17:00:00+00:00", "updated_at": "2026-06-11 18:09:46.327504+00:00", "lang": "en", "topics": ["artificial-intelligence", "ai-safety", "ai-policy", "ai-products", "large-language-models"], "entities": ["Anthropic", "Claude Mythos", "Claude Fable 5", "Claude Opus 4.8", "CyberScoop"], "alternates": {"html": "https://wpnews.pro/news/anthropics-new-model-is-mythos-on-a-leash", "markdown": "https://wpnews.pro/news/anthropics-new-model-is-mythos-on-a-leash.md", "text": "https://wpnews.pro/news/anthropics-new-model-is-mythos-on-a-leash.txt", "jsonld": "https://wpnews.pro/news/anthropics-new-model-is-mythos-on-a-leash.jsonld"}}