Realtime regression in non-English production voice agents

wpnews.pro

cd /news/ai-agents/realtime-regression-in-non-english-p… · home › topics › ai-agents › article

[ARTICLE · art-21302] src=community.openai.com ↗ pub=2026-06-04T08:03Z topic=ai-agents verified=true sentiment=↓ negative

Realtime regression in non-English production voice agents

A Romanian voice agent developer reports that OpenAI's scheduled shutdown of the `gpt-realtime-mini-2025-10-06` model threatens major enterprise contracts, as the replacement `gpt-realtime-mini` model exhibits worse Romanian language quality and increased hallucination of business data. The company, which projects $50,000 monthly API usage by Q3, has requested delayed deprecation or an alternative model with equivalent non-English faithfulness to avoid jeopardizing its production deployments.

read2 min views18 publishedJun 4, 2026

Thank you for the clarification.

The difficulty for us is that revisiting the Realtime prompting guide is unlikely to solve the core issue. We have already spent thousands of hours internally prototyping, testing, and comparing different Realtime/audio-capable model configurations for Romanian production voice-agent use cases.

The stack we currently run, based on gpt-realtime-mini-2025-10-06 , is not something we selected casually. It is the result of substantial internal R&D, repeated production-like testing, and comparison against other available models. For our specific use case — Romanian AI voice agents that need to remain faithful to database-fed business information — this snapshot has been by far the most optimal option.

The issue we are seeing with the replacement model is not just something that can be corrected with minor prompt changes. The newer gpt-realtime-mini

shows worse Romanian/non-English quality and, more importantly, worse faithfulness to supplied business data. In our tests, it has hallucinated non-existing departments, services, or operational details that were not present in the database/context, while the older snapshot behaved much more reliably.

That reliability is precisely what allowed us to enter the market quickly and attract a large number of voice and call-center clients, both small and large. Although we have not been in the market for long, this OpenAI Realtime-based stack enabled us to grow quickly and explosively, and it is now central to several major enterprise deployments.

This is why the scheduled shutdown is such a serious concern for us. We are currently operating on enterprise deployments where the model behavior is not a minor implementation detail, but the foundation of the product’s trustworthiness. A forced migration to a replacement model with materially worse Romanian/non-English performance could jeopardise major contracts we are currently involved in along with future ones we’re negotiating.

It also affects OpenAI commercially. Based on our current pipeline, we project our OpenAI API usage could reach the ~$50,000/month threshold by Q3 as these deployments scale. Our preference is to continue building and scaling on OpenAI’s Realtime infrastructure, but we need a reliable migration path before moving production traffic away from the validated snapshot.

That is why we were hoping to speak with someone from OpenAI who could look at this from a production/API customer perspective, not only as a general prompting issue. Ideally, we would like to understand whether OpenAI can consider one of the following:

delaying the shutdown of `gpt-realtime-mini-2025-10-06`

for production users affected by language-specific regressions;

providing temporary extended access while regressions are investigated;

recommending or releasing an alternative Realtime/audio-capable model with equivalent Romanian/non-English faithfulness;

or routing this as a production-impacting model quality regression for Realtime API users.

We can provide side-by-side transcript and audio recorded evidence comparing `gpt-realtime-mini-2025-10-06`

and `gpt-realtime-mini`

under comparable flows/configuration.

source & further reading

community.openai.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/realtime-regression-in-n…

Read original on community.openai.com → community.openai.com/t/realtime-regression-in-no…

mentioned entities

OpenAI

gpt-realtime-mini-2025-10-06

gpt-realtime-mini

Romanian

metadata

slugrealtime-regression-in-non-english-production-voice-agents

topic#ai-agents

secondary4 topics

sentimentnegative

canonicalcommunity.openai.com

navigation

← prevMacworld Podcast: WWDC26 preview…

next →Sam Altman tells Congress to fun…

── more in #ai-agents 4 stories · sorted by recency

juliahub.com · 21 Jul · #ai-agents

GPT-5.6 vs. Claude Fable 5 for Physical AI, which performs best?

friday.haraldbregu.com · 21 Jul · #ai-agents

Show HN: Friday – a local-first desktop AI assistant that can use tools

cryptobriefing.com · 21 Jul · #ai-agents

KPMG named OpenAI Elite Partner to market AI-native platform

fortune.com · 21 Jul · #ai-agents

KPMG and OpenAI bet the future of software is ‘headless’ — and the future of work is mostly talking

── more on @openai 3 stories trending now

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required