cd /news/large-language-models/meta-was-running-on-googles-ai-until… · home topics large-language-models article
[ARTICLE · art-43850] src=gadgetreview.com ↗ pub= topic=large-language-models verified=true sentiment=↓ negative

Meta Was Running on Google’s AI – Until Google Said No

Meta relied on Google's Gemini models for customer service, ad tools, and content moderation until Google imposed capacity limits in March 2026, forcing Meta to ration AI tokens and shift workloads to its own Muse Spark model. The incident highlights the structural AI compute shortage as demand outpaces infrastructure growth.

read2 min views1 publishedJun 29, 2026
Meta Was Running on Google’s AI – Until Google Said No
Image: Gadgetreview (auto-discovered)

Behind Meta’s open-source AI announcements, a quieter story was unfolding. Customer service, advertiser chatbots, scam detection, content moderation, internal coding tools — all running, at least partly, on Google’s Gemini models. Then around March 2026, according to the Financial Times, Google told Meta it couldn’t supply the compute Meta wanted. The AI self-sufficiency narrative cracked like a phone screen on day one.

When Your Rival’s AI Keeps the Lights On #

Meta’s internal AI stack relied heavily on Google’s Gemini — and the dependency ran deeper than anyone outside the company knew.

The scope of Meta’s [ Gemini dependency](https://www.ft.com/content/c5d52f72-71ef-40bc-bad3-61afdba8b378?syn-25a6b1a6=1) is striking. Sources told the Financial Times that Gemini was chosen

[because it was](https://www.techspot.com/news/112930-meta-told-staff-watch-their-ai-tokens-after.html)“performing better than Meta’s own models” for specific internal tasks. Meta also ran

Anthropic’s Claudealongside Gemini and Llama — hedging bets across providers rather than committing to a single external source.

Here’s what the reporting reveals:

  • Gemini powered customer service, ad tools, coding assistance, and harmful content takedowns across Meta’s platforms
  • Google imposed capacity limits around March 2026 after Meta requested more compute than available
  • Meta’s demand was unusually high compared to other Google Cloud customers
  • Employees were told to conserve “AI tokens”— the units measuring how much model capacity each request consumes - Meta began shifting workloads to Muse Spark, its own internal model built for content moderation

The Token Wall Nobody Talked About #

Global AI compute demand is now growing faster than even the largest infrastructure providers can build to meet it.

This isn’t Google punishing Meta. It’s structural. Google Cloud‘s AI backlog now exceeds $460 billion, according to Alphabet’s recent results. Its models process over

16 billion tokens per minute— up 60% quarter-over-quarter. Even after enormous data center investment, demand is outrunning supply. It’s the GPU shortage during the crypto boom, except now the scarce resource is inference capacity for models that half the Fortune 500 wants running simultaneously.

Meta’s response has been swift — possibly forced. The company is investing up to $600 billion in US data center capacity through 2028, building custom MTIA chips with Broadcom. This mirrors the ambition behind the

Stargate Project, as the race to control AI infrastructure accelerates across the industry. Separately, per CNBC, Meta is replacing third-party human moderation vendors like Accenture and Concentrix with AI systems. Muse Spark is absorbing moderation duties that Gemini once handled.

The uncomfortable reality: “AI-native” is doing heavy lifting as a marketing term. Behind the keynotes, the biggest platforms are buying critical AI capacity from the same companies they’re racing against. Meta builds toward self-sufficiency, but Meta’s token-rationing memo is this era’s equivalent of discovering your favorite restaurant sources its signature bread from the shop next door. What happens next depends entirely on who controls the ovens — and right now, that list is very short.

── more in #large-language-models 4 stories · sorted by recency
── more on @meta 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/meta-was-running-on-…] indexed:0 read:2min 2026-06-29 ·