cd /news/ai-research/ask-hn-data-source-used-for-training… · home topics ai-research article
[ARTICLE · art-27572] src=news.ycombinator.com ↗ pub= topic=ai-research verified=true sentiment=· neutral

Ask HN: Data source used for training Anthropic's Mythos?

An anonymous user on Hacker News asked how Anthropic obtained private security bug data for training its Mythos or Fable AI models, noting that Anthropic's documentation only mentions publicly accessible data. The user speculated that such data is typically sensitive and not publicly available, raising questions about the training sources.

read1 min publishedJun 15, 2026

| |||||||||||| 2 points by | This question came up from a tangential discussion. Some one was asking suggestions about systems architecture. I suggested them to look into RCA (root cause analysis) for some big scale failures. Hard part is to find this kind of data publicly. This prompted the question about the data source for Mythos/Fable training. The security bugs reports, fixes are somewhat of a sensitive private collections. I did check the documentation from Anthropic on this, it only mentions the usage of publicly accessible data. Any ideas how Anthropic would have got their hands on this private data if they used that at all? | ||||||||||| |

── more in #ai-research 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/ask-hn-data-source-u…] indexed:0 read:1min 2026-06-15 ·