cd /news/artificial-intelligence/openai-webrtc-audio-session-now-with… · home topics artificial-intelligence article
[ARTICLE · art-25759] src=simonwillison.net ↗ pub= topic=artificial-intelligence verified=true sentiment=↑ positive

OpenAI WebRTC Audio Session, now with document context

OpenAI's GPT-Realtime-2 model, promoted as having GPT-5-class reasoning, is now available in a WebRTC audio playground with document context support, enabling conversational audio interactions in the browser. The tool allows users to select the new model and paste document text for spoken discussions, though the model has not yet appeared in the ChatGPT iPhone app.

read1 min publishedJun 12, 2026

OpenAI WebRTC Audio Session, now with document context Last month OpenAI introduced a brand new model to that API called GPT‑Realtime‑2, which they promoted as "our first voice model with GPT‑5‑class reasoning" - with a Sep 30, 2024 knowledge cut-off.

I've been waiting for that model to show up in the ChatGPT iPhone app but it still hasn't, so I revisited my old playground.

You can now pick the better model, and you can also paste in a big chunk of document context so you can have as audio conversation in your browser about whatever information you think would be useful to explore in a conversational way.

Tags: audio, tools, ai, openai, generative-ai, llms, multi-modal-output, webrtc

── more in #artificial-intelligence 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/openai-webrtc-audio-…] indexed:0 read:1min 2026-06-12 ·