{"slug": "ai-watchdog", "title": "AI Watchdog", "summary": "The Atlantic's investigation reveals that tech companies have used at least 15 million videos and millions of songs to train AI models, often without permission. The report highlights the industry's reliance on copyrighted content and the controversial stance that public internet content is fair game for AI training.", "body_md": "[The Millions of Songs Mashed Into AI-Generated Music](https://www.theatlantic.com/technology/2026/06/ai-music-generators-suno-google-udio/687485/)\n\nExplore the astonishing amount of music available to AI developers.\n\n*The Atlantic’*s ongoing investigation of the books, videos, and other media used by the world’s most powerful tech companies to train their AI models.\n\nExplore the astonishing amount of music available to AI developers.\n\nTech companies believe in intellectual property, but not yours.\n\nLarge language models don’t “learn”—they copy. And that could change everything for the tech industry.\n\nAt least 15 million videos have been snatched by tech companies.\n\nMeta pirated millions of books to train its AI. Search through them here.\n\n“You shouldn’t have put your content on the internet if you didn’t want it to be on the internet,” Common Crawl’s executive director says.\n\nInside the data sets training new video-creating tools\n\n*Atlantic *writers help you wrap your mind around artificial intelligence and a new machine age.\n\nExplore the astonishing amount of music available to AI developers.\n\nIt involves 4chan, of all places.\n\nTech companies believe in intellectual property, but not yours.\n\nLarge language models don’t “learn”—they copy. And that could change everything for the tech industry.\n\n“You shouldn’t have put your content on the internet if you didn’t want it to be on the internet,” Common Crawl’s executive director says.\n\nInside the data sets training new video-creating tools\n\nAt least 15 million videos have been snatched by tech companies.\n\nAn ongoing investigation by *The Atlantic *to reveal the inner workings of generative AI\n\nThe video platform is quietly using AI to “improve clarity” in uploaded content. Why?\n\nCan AI companies keep stealing books to train their models?", "url": "https://wpnews.pro/news/ai-watchdog", "canonical_source": "https://www.theatlantic.com/category/ai-watchdog/", "published_at": "2026-06-22 02:21:11+00:00", "updated_at": "2026-06-22 02:39:50.379477+00:00", "lang": "en", "topics": ["artificial-intelligence", "ai-ethics", "ai-policy", "ai-research", "ai-tools"], "entities": ["The Atlantic", "Common Crawl", "Meta", "Google", "Suno", "Udio"], "alternates": {"html": "https://wpnews.pro/news/ai-watchdog", "markdown": "https://wpnews.pro/news/ai-watchdog.md", "text": "https://wpnews.pro/news/ai-watchdog.txt", "jsonld": "https://wpnews.pro/news/ai-watchdog.jsonld"}}