AI Watchdog The Atlantic's investigation reveals that tech companies have used at least 15 million videos and millions of songs to train AI models, often without permission. The report highlights the industry's reliance on copyrighted content and the controversial stance that public internet content is fair game for AI training. The Millions of Songs Mashed Into AI-Generated Music https://www.theatlantic.com/technology/2026/06/ai-music-generators-suno-google-udio/687485/ Explore the astonishing amount of music available to AI developers. The Atlantic’ s ongoing investigation of the books, videos, and other media used by the world’s most powerful tech companies to train their AI models. Explore the astonishing amount of music available to AI developers. Tech companies believe in intellectual property, but not yours. Large language models don’t “learn”—they copy. And that could change everything for the tech industry. At least 15 million videos have been snatched by tech companies. Meta pirated millions of books to train its AI. Search through them here. “You shouldn’t have put your content on the internet if you didn’t want it to be on the internet,” Common Crawl’s executive director says. Inside the data sets training new video-creating tools Atlantic writers help you wrap your mind around artificial intelligence and a new machine age. Explore the astonishing amount of music available to AI developers. It involves 4chan, of all places. Tech companies believe in intellectual property, but not yours. Large language models don’t “learn”—they copy. And that could change everything for the tech industry. “You shouldn’t have put your content on the internet if you didn’t want it to be on the internet,” Common Crawl’s executive director says. Inside the data sets training new video-creating tools At least 15 million videos have been snatched by tech companies. An ongoing investigation by The Atlantic to reveal the inner workings of generative AI The video platform is quietly using AI to “improve clarity” in uploaded content. Why? Can AI companies keep stealing books to train their models?