cd /news/large-language-models/openai-prepares-major-chatgpt-voice-… · home topics large-language-models article
[ARTICLE · art-30292] src=testingcatalog.com ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

OpenAI prepares major ChatGPT voice upgrade with GPT-Bidi-1

OpenAI is preparing a major upgrade to ChatGPT's voice mode with a new audio model tentatively named GPT-Bidi-1, featuring bidirectional architecture for more natural, interruptible conversations. The upgrade aims to close the gap between ChatGPT's text and voice capabilities, with a rollout expected soon across web and mobile platforms.

read1 min views2 publishedJun 16, 2026

OpenAI looks set to give ChatGPT's voice mode its biggest upgrade in months, with preparations underway for a next-generation audio model tentatively tagged GPT-Bidi-1. The name points to the bidirectional, or "BiDi," architecture the company has been building since early this year, a model designed to listen and speak at once, absorb interruptions, and adjust mid-sentence rather than freezing the moment a user says "mm-hm." Signs of it now span web and mobile, suggesting a consumer rollout is near, though the name may shift before launch.

The wider point is less about voice quality than a gap OpenAI has let widen. Its text models raced ahead to the GPT-5.5 generation while voice stayed on an older audio stack, leaving spoken conversations a step behind what the same assistant manages in writing. Closing that gap matters for a company betting that speech, not text, becomes the main way people reach AI, the wager behind its planned audio-first hardware and its voice-based support tools. GPT-Bidi-1 is built around that, promising smoother exchanges plus what is billed as a major jump in reasoning.

The feature's shape is coming into focus. ChatGPT users would likely keep today's setup, toggling between a new Bidi (Latest) mode and the current Advanced Voice Mode rather than being moved over wholesale. More telling is the choice of intelligence levels: High, Medium, and Instant, mirroring the tiers already offered on the text side and letting people trade speed for depth by task. A recent change that lets the voice bubble be dragged to the middle of the screen reads as an early piece of the same redesign.

Caution is warranted on timing. Whether that starts this week or later is unclear, but the groundwork is plainly being laid.

── more in #large-language-models 4 stories · sorted by recency
── more on @openai 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/openai-prepares-majo…] indexed:0 read:1min 2026-06-16 ·