# Common Crawl

> Entity coverage from Web Pulse
> Last updated: 2026-06-05T15:21:50.953528+00:00
> 1 articles mentioning **Common Crawl**

- [Microsoft trained its MAI models on unlicensed web data despite promising "enterprise grade, clean and commercially licensed data"](https://wpnews.pro/news/microsoft-trained-its-mai-models-on-unlicensed-web-data-despite-promising-grade) — 2026-06-05
