Siri AI at WWDC 2026 Apple at WWDC 2026 announced a new generation of Siri AI features, including a custom Gemini-derived model running on its Private Cloud Compute infrastructure and vision LLMs to extract information from users' screens. The company also introduced a Core AI library with PyTorch extensions to let developers run their own models on Apple hardware. Apple confirmed that the Private Cloud Compute Gemini models are running in Google Cloud using NVIDIA GPUs, with all binaries published for public inspection. Given how badly burned anyone who took Apple's 2024 WWDC Apple Intelligence announcements https://simonwillison.net/2024/Jun/10/apple-intelligence/ at face value was, I'm holding to a strict "I'll believe it when I see it" policy for everything they announced today https://www.apple.com/newsroom/2026/06/apple-unveils-next-generation-of-apple-intelligence-siri-ai-and-more/ . The new Siri AI features do at least look feasible with today's technology, especially since Apple are licensing a custom Gemini-derived model that they can run on their own Private Cloud Compute https://simonwillison.net/2024/Jun/11/private-cloud-compute/ . It sounds like they'll be taking advantage of vision LLMs to extract information from the user's screen, which neatly sidesteps the need for every existing application to ship custom code in order to integrate with Apple Intelligence. Vision LLMs were a much less mature category in June 2024. The new Core AI library looks like a good step in enabling developers to finally take full advantage of Apple's hardware for running their own models. It integrates with Meta's open source PyTorch ecosystem, using these Core AI PyTorch extensions https://apple.github.io/coreai-torch/main/ : Core AI PyTorch Extensions coreai-torch is a Python package that bridges PyTorch and Core AI. You can use it to bring up an existing PyTorch model — exported as a torch.export.ExportedProgram — into a Core AI AIProgram ready to run on Apple hardware, traversing the FX graph node-by-node and mapping ATen operators to Core AI operations. You can install an iOS 27 Developer Beta today, which supposedly has the new features - but you then have to make it through a waiting list for access to the new Siri AI. Aaron Perris from MacRumors reports having made it off the waitlist https://twitter.com/aaronp613/status/2064078063814471977 so we may start seeing credible reports on how well Siri AI works in the very near future. Update : These Private Cloud Compute Gemini models are running in Google Cloud, and using NVIDIA hardware. According to Expanding Private Cloud Compute https://security.apple.com/blog/expanding-pcc/?linkId=100000425571569 on Apple's Security Research blog: For the most demanding tasks, including agentic tool-use and complex reasoning, we worked with Google and NVIDIA to extend our PCC infrastructure to Google Cloud systems using NVIDIA GPUs, while maintaining Apple's powerful security and privacy protections. ... PCC on Google Cloud leverages many of the same architectural security patterns as PCC on Apple silicon to implement these layered protections: initial network data parsing for each request happens in a dedicated process within its own namespace, shared inference software is recycled with a short time-to-live duration, and attested keys are held in a separate, dedicated confidential VM isolated from external inputs. ... As with PCC on Apple silicon, all binaries will be published for public inspection. Tags: vision-llms https://simonwillison.net/tags/vision-llms , apple https://simonwillison.net/tags/apple , generative-ai https://simonwillison.net/tags/generative-ai , ai https://simonwillison.net/tags/ai , llms https://simonwillison.net/tags/llms , gemini https://simonwillison.net/tags/gemini , nvidia https://simonwillison.net/tags/nvidia , google https://simonwillison.net/tags/google