cd /news/artificial-intelligence/siri-ai-at-wwdc-2026 Β· home β€Ί topics β€Ί artificial-intelligence β€Ί article
[ARTICLE Β· art-24306] src=simonwillison.net β†— pub= topic=artificial-intelligence verified=true sentiment=Β· neutral

Siri AI at WWDC 2026

Apple at WWDC 2026 announced a new generation of Siri AI features, including a custom Gemini-derived model running on its Private Cloud Compute infrastructure and vision LLMs to extract information from users' screens. The company also introduced a Core AI library with PyTorch extensions to let developers run their own models on Apple hardware. Apple confirmed that the Private Cloud Compute Gemini models are running in Google Cloud using NVIDIA GPUs, with all binaries published for public inspection.

read2 min publishedJun 8, 2026

Given how badly burned anyone who took Apple's 2024 WWDC Apple Intelligence announcements at face value was, I'm holding to a strict "I'll believe it when I see it" policy for everything they announced today.

The new Siri AI features do at least look feasible with today's technology, especially since Apple are licensing a custom Gemini-derived model that they can run on their own Private Cloud Compute.

It sounds like they'll be taking advantage of vision LLMs to extract information from the user's screen, which neatly sidesteps the need for every existing application to ship custom code in order to integrate with Apple Intelligence. Vision LLMs were a much less mature category in June 2024.

The new Core AI library looks like a good step in enabling developers to finally take full advantage of Apple's hardware for running their own models. It integrates with Meta's open source PyTorch ecosystem, using these Core AI PyTorch extensions:

Core AI PyTorch Extensions (

coreai-torch

) is a Python package that bridges PyTorch and Core AI. You can use it to bring up an existing PyTorch model β€” exported as atorch.export.ExportedProgram

β€” into a Core AIAIProgram

ready to run on Apple hardware, traversing the FX graph node-by-node and mapping ATen operators to Core AI operations.

You can install an iOS 27 Developer Beta today, which supposedly has the new features - but you then have to make it through a waiting list for access to the new Siri AI. Aaron Perris from MacRumors reports having made it off the waitlist so we may start seeing credible reports on how well Siri AI works in the very near future.

Update: These Private Cloud Compute Gemini models are running in Google Cloud, and using NVIDIA hardware. According to Expanding Private Cloud Compute on Apple's Security Research blog:

For the most demanding tasks, including agentic tool-use and complex reasoning, we worked with Google and NVIDIA to extend our PCC infrastructure to Google Cloud systems using NVIDIA GPUs, while maintaining Apple's powerful security and privacy protections. [...] PCC on Google Cloud leverages many of the same architectural security patterns as PCC on Apple silicon to implement these layered protections: initial network data parsing for each request happens in a dedicated process within its own namespace, shared inference software is recycled with a short time-to-live duration, and attested keys are held in a separate, dedicated confidential VM isolated from external inputs. [...]

As with PCC on Apple silicon, all binaries will be published for public inspection.

Tags: vision-llms, apple, generative-ai, ai, llms, gemini, nvidia, google

── more in #artificial-intelligence 4 stories Β· sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain β€” perfect for shipping the agent you just read about.

$git push zahid main
β†’ Live at https://your-agent.zahid.host βœ“
Get free account β†’ Pricing
from €0/mo Β· no card required
LIVE [news/siri-ai-at-wwdc-2026] indexed:0 read:2min 2026-06-08 Β· β€”