Realtime voice, speech, and transcription now supported on AI Gateway

wpnews.pro

cd /news/ai-products/realtime-voice-speech-and-transcript… · home › topics › ai-products › article

[ARTICLE · art-43622] src=vercel.com ↗ pub=2026-06-29T00:00Z topic=ai-products verified=true sentiment=↑ positive

Realtime voice, speech, and transcription now supported on AI Gateway

Vercel's AI Gateway now supports realtime voice, speech, and transcription models in beta, enabling developers to build voice agents with low-latency audio-in/audio-out capabilities. The platform offers observability, spend controls, and bring-your-own-key support without markup or platform fees, with integration via AI SDK 7.

read2 min views1 publishedJun 29, 2026

AI Gateway now supports voice and audio models. You can build realtime voice agents, generate speech from text, and transcribe audio to text. This provides the same observability, spend controls, and bring-your-own-key support as text, image, and video models in AI Gateway, with no markup or platform fees. These capabilities are in beta and available via AI SDK 7.

With realtime support, a single model takes audio in and audio out, so a user can talk and hear a reply back in near real time instead of waiting on a chain of separate models.

Capability	What it does
Model listens to the user, works out a response, and speaks it back in a live, low-latency conversation. It can call your tools mid-conversation to look something up or take an action. The
Generate spoken audio from text, with a selectable voice and output format such as MP3. Use it for voiceovers, audio versions of written content, and spoken responses.
Transcribe recordings into text, from a file buffer, base64 string, or URL. Use it for voice notes or other transcriptions.

Two ways to get started:

Follow the realtime example below or the realtime quickstart to add a voice agent to your app.

Use the playground. Talk to a realtime model in the browser, no code required, in the AI Gateway Playground. A voice agent has two pieces: a server route that mints a short-lived token, so your API key never reaches the client, and a browser component that connects with it.

Add the token route:

Then connect from the browser. The useRealtime

hook fetches that route and manages the WebSocket connection, microphone capture, and audio playback:

You can also try audio models without writing any code. Open the models page, click into a model, and interact with it right in the browser:

Talk to a realtime model to hold a voice conversation

Send text and have a transcription model read it back

Speak to an audio model and have it transcribe your words

For more information on realtime voice, speech, and transcription models on AI Gateway, see the documentation. To view a list of all the supported realtime voice, speech, and transcription models on AI Gateway, check the full list here.

source & further reading

vercel.com — original article Query Speed Insights from the Vercel CLI Add MCP Apps to Your AI SDK Application Query Web Analytics from the Vercel CLI

~/api · this article 200

$curl api.wpnews.pro/v1/news/realtime-voice-speech-an…

Read original on vercel.com → vercel.com/changelog/realtime-voice-speech-and-t…

mentioned entities

Vercel

AI Gateway

AI SDK

metadata

slugrealtime-voice-speech-and-transcription-now-supported-on-ai-gateway

topic#ai-products

secondary3 topics

sentimentpositive

canonicalvercel.com

navigation

← prevWe need tech news sources which …

next →"Claude Code '400: no low surrog…

── more in #ai-products 4 stories · sorted by recency

vercel.com · 29 Jun · #ai-products

Build realtime voice agents on AI Gateway

dev.to · 29 Jun · #ai-products

Building a Legal AI Platform on Aurora DSQL and Vercel

marcoziccardi.com · 29 Jun · #ai-products

An agent opened this pull request. Nobody asked it to

dev.to · 29 Jun · #ai-products

The Ownership Dyad

── more on @vercel 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required