{"slug": "anthropic-rankles-users-with-safety-first-fable-release", "title": "Anthropic rankles users with safety-first Fable release", "summary": "Anthropic released its Fable 5 AI model on Tuesday with strict safety guardrails that blocked many harmless queries, including questions about cancer research and public figures, triggering widespread user backlash. The company reversed some of its most conservative decisions less than two days later, apologizing for failing to balance safety with user access. The incident highlights growing tensions between AI companies' unilateral safety measures and users' demand for transparent, helpful AI-generated information.", "body_md": "Anthropic’s latest AI model might be the company’s most powerful public release, but the system’s strict safety measures quickly triggered some of the strongest backlash the AI giant has faced.\n\nMany users, some of whom have marveled at Anthropic’s previous announcements, torched the company for debuting its Fable 5 model on Tuesday with what they say are overly stringent guardrails. In some cases, when the model classified a query as potentially sensitive, it would provide a lower-quality answer without informing the user of the downgrade.\n\nAfter the outcry, Anthropic backtracked and reversed some of its most conservative decisions less than two days after Fable 5’s release, highlighting growing concerns about AI companies’ ability to unilaterally limit users’ access to helpful AI-generated information.\n\n“You should have visibility into the safeguards we have in place, and why. We’re sorry for not getting the balance right,” Anthropic [wrote on X early Thursday](https://x.com/ClaudeDevs/status/2064949876463645026).\n\nNathan Lambert, a leading AI researcher who champions collaborative approaches to building AI systems, [wrote](https://natolambert.substack.com/p/anthropic-walks-back-silently-nerfing) that with the cautious debut, “Anthropic has made it pretty clear that they only trust themselves as the mediators of cutting-edge AI research.”\n\n[Anthropic’s Fable 5 system](https://www.nbcnews.com/tech/security/fable-5-anthropic-release-public-mythos-claude-model-rcna349104) is the first consumer-facing system from Anthropic’s Mythos family of models. An early, nonpublic version of Mythos spooked policymakers and corporate executives in April for its ability to find [more than 10,000 severe bugs and vulnerabilities](https://www.anthropic.com/research/glasswing-initial-update) in important software systems.\n\nAnthropic fears that powerful AI models like Mythos could allow bad actors to use AI systems to commit crimes, from launching [crippling cyberattacks](https://darioamodei.com/post/policy-on-the-ai-exponential) against critical infrastructure to [designing bioweapons](https://red.anthropic.com/2025/biorisk/) that could kill masses of people.\n\nAs a result, Anthropic released Fable 5 with a strict set of guardrails preventing the model from answering a range of questions about cybersecurity or biology. Acknowledging its decision to err on the side of caution, Anthropic said that Fable 5’s safety-first approach might incorrectly flag harmless requests as being suspicious for less than 5% of queries.\n\n“With more capable models arriving in the coming months, we’re working to improve our safeguards and reduce false positives as quickly as we can.,” Anthropic wrote.\n\nPreliminary tests of the model conducted by NBC News, along with many examples shared on social media, found Anthropic’s protections took a broad view of potential suspicious activity, rendering the system useless for many mundane queries.\n\nFor example, the model refused NBC News’ requests to offer opinions on Elon Musk and Anthropic CEO Dario Amodei, asserting that the questions might be dangerous. Fable 5 also declined to answer many innocent biology-related questions, from queries about [open issues in cancer research](https://x.com/DeryaTR_/status/2064404588476948774?s=20) to which medical exams might best identify pancreatic injuries.\n\nFor those Fable 5 queries flagged as dangerous, Anthropic instead routes questions to a less-powerful system called Claude Opus 4.8, which had been the top-of-the-line system until Fable’s release on Tuesday.\n\nBecause it has been deployed for months, Opus 4.8 has a better ability to handle and redirect questions that could be seen as harmful. Opus 4.8 offered basic but clear answers to NBC News’ questions that Fable 5 had refused.\n\nAnthropic is also worried that competitors could use Anthropic’s AI systems to turbocharge their own research — Anthropic uses its own AI systems to help create the next generation of its models.\n\nTo prevent other AI companies from using Fable 5 to improve their own AI products or research, Anthropic said on Tuesday that it would include safeguards to make Fable 5’s answers less intelligent or useful for user questions that might be related to AI development.\n\nHowever, Anthropic said that these specific guardrails — unlike the cybersecurity and biology guardrails — would be invisible, since making them clear to users could allow competitors to more easily circumvent the barriers.\n\nThe move triggered immediate uproar, with some charging that such invisible guardrails were unfair and unethical. “You don’t want to go down as the first company to enable and open the door for human-designed AI manipulation at scale,” wrote leading AI researcher [Clement Delangue](https://x.com/ClementDelangue/status/2064673792303955985?s=20) on X, highlighting that Anthropic’s decision to invisibly degrade answers related to AI development would be “the highest form of manipulation.”\n\nAnthropic reacted quickly, updating its rules early Thursday morning to make such safeguards visible. [Wired first reported](https://www.wired.com/story/anthropic-responds-to-backlash-on-claudes-secret-sabotage-on-ai-research/) Anthropic’s reversal.\n\nPeter Wallich, a senior research manager at a AI safety research center called the [Constellation Institute](https://constellation.org/), said that the rocky rollout was less than ideal but sensible given the powerful technology.\n\n“It’s clearly frustrating for security researchers and biologists to be kicked back to Opus for innocuous tasks, which is a real cost,” Wallich told NBC News, emphasizing that he spoke in a personal capacity. “But this still seems to me like a reasonable trade-off. Regular people get the model earlier than they would otherwise.”\n\nWallich said that a more lax approach — prioritizing earlier public access to Fable 5’s full power without adequate protections — would be a far worse scenario. “The opposite failure mode, shipping with safeguards that were too loose, would be irresponsible and could have led to irreversible harm,” he said.", "url": "https://wpnews.pro/news/anthropic-rankles-users-with-safety-first-fable-release", "canonical_source": "https://www.nbcnews.com/tech/tech-news/anthropic-rankles-users-safety-first-fable-release-rcna349702", "published_at": "2026-06-11 22:26:09+00:00", "updated_at": "2026-06-11 22:46:27.263041+00:00", "lang": "en", "topics": ["artificial-intelligence", "ai-safety", "ai-policy", "ai-products", "large-language-models"], "entities": ["Anthropic", "Fable 5", "Nathan Lambert"], "alternates": {"html": "https://wpnews.pro/news/anthropic-rankles-users-with-safety-first-fable-release", "markdown": "https://wpnews.pro/news/anthropic-rankles-users-with-safety-first-fable-release.md", "text": "https://wpnews.pro/news/anthropic-rankles-users-with-safety-first-fable-release.txt", "jsonld": "https://wpnews.pro/news/anthropic-rankles-users-with-safety-first-fable-release.jsonld"}}