{"slug": "realtime-regression-in-non-english-production-voice-agents", "title": "Realtime regression in non-English production voice agents", "summary": "A Romanian voice agent developer reports that OpenAI's scheduled shutdown of the `gpt-realtime-mini-2025-10-06` model threatens major enterprise contracts, as the replacement `gpt-realtime-mini` model exhibits worse Romanian language quality and increased hallucination of business data. The company, which projects $50,000 monthly API usage by Q3, has requested delayed deprecation or an alternative model with equivalent non-English faithfulness to avoid jeopardizing its production deployments.", "body_md": "Thank you for the clarification.\n\nThe difficulty for us is that revisiting the Realtime prompting guide is unlikely to solve the core issue. We have already spent thousands of hours internally prototyping, testing, and comparing different Realtime/audio-capable model configurations for Romanian production voice-agent use cases.\n\nThe stack we currently run, based on `gpt-realtime-mini-2025-10-06`\n\n, is not something we selected casually. It is the result of substantial internal R&D, repeated production-like testing, and comparison against other available models. For our specific use case — Romanian AI voice agents that need to remain faithful to database-fed business information — this snapshot has been by far the most optimal option.\n\nThe issue we are seeing with the replacement model is not just something that can be corrected with minor prompt changes. The newer `gpt-realtime-mini`\n\nshows worse Romanian/non-English quality and, more importantly, worse faithfulness to supplied business data. In our tests, it has hallucinated non-existing departments, services, or operational details that were not present in the database/context, while the older snapshot behaved much more reliably.\n\nThat reliability is precisely what allowed us to enter the market quickly and attract a large number of voice and call-center clients, both small and large. Although we have not been in the market for long, this OpenAI Realtime-based stack enabled us to grow quickly and explosively, and it is now central to several major enterprise deployments.\n\nThis is why the scheduled shutdown is such a serious concern for us. We are currently operating on enterprise deployments where the model behavior is not a minor implementation detail, but the foundation of the product’s trustworthiness. A forced migration to a replacement model with materially worse Romanian/non-English performance could jeopardise major contracts we are currently involved in along with future ones we’re negotiating.\n\nIt also affects OpenAI commercially. Based on our current pipeline, we project our OpenAI API usage could reach the ~$50,000/month threshold by Q3 as these deployments scale. Our preference is to continue building and scaling on OpenAI’s Realtime infrastructure, but we need a reliable migration path before moving production traffic away from the validated snapshot.\n\nThat is why we were hoping to speak with someone from OpenAI who could look at this from a production/API customer perspective, not only as a general prompting issue. Ideally, we would like to understand whether OpenAI can consider one of the following:\n\n-\ndelaying the shutdown of `gpt-realtime-mini-2025-10-06`\n\nfor production users affected by language-specific regressions;\n\n-\nproviding temporary extended access while regressions are investigated;\n\n-\nrecommending or releasing an alternative Realtime/audio-capable model with equivalent Romanian/non-English faithfulness;\n\n-\nor routing this as a production-impacting model quality regression for Realtime API users.\n\nWe can provide side-by-side transcript and audio recorded evidence comparing `gpt-realtime-mini-2025-10-06`\n\nand `gpt-realtime-mini`\n\nunder comparable flows/configuration.", "url": "https://wpnews.pro/news/realtime-regression-in-non-english-production-voice-agents", "canonical_source": "https://community.openai.com/t/realtime-regression-in-non-english-production-voice-agents-gpt-realtime-mini-vs-gpt-realtime-mini-2025-10-06/1380643", "published_at": "2026-06-04 08:03:10+00:00", "updated_at": "2026-06-04 08:48:22.208487+00:00", "lang": "en", "topics": ["ai-agents", "natural-language-processing", "large-language-models", "ai-products", "ai-tools"], "entities": ["OpenAI", "gpt-realtime-mini-2025-10-06", "gpt-realtime-mini", "Romanian"], "alternates": {"html": "https://wpnews.pro/news/realtime-regression-in-non-english-production-voice-agents", "markdown": "https://wpnews.pro/news/realtime-regression-in-non-english-production-voice-agents.md", "text": "https://wpnews.pro/news/realtime-regression-in-non-english-production-voice-agents.txt", "jsonld": "https://wpnews.pro/news/realtime-regression-in-non-english-production-voice-agents.jsonld"}}