cd /news/ai-agents/building-real-time-voice-agents-from… · home topics ai-agents article
[ARTICLE · art-17499] src=nemorize.com pub= topic=ai-agents verified=true sentiment=· neutral

Building Real-Time Voice Agents from Scratch

Nemorize has published a learning roadmap titled "Building Real-Time Voice Agents from Scratch," covering the full pipeline from audio fundamentals and speech detection to ASR, LLM streaming, and TTS. The roadmap is divided into five parts, including foundations, pipeline construction, hard problems like barge-in and latency, engineering best practices, and capstone extensions.

read1 min publishedMay 29, 2026

← Back to Roadmaps roadmap...

Learning Topics #

This roadmap covers the following topics:

**Part I: Foundations**- ✅
[Shape of a Voice Agent](/roadmaps/building-real-time-voice-agents-from-scratch/lessons/019e6873-1262-7db5-9311-c80162b6688e)- ⚪ mic → ASR → LLM → TTS Loop
  • ⚪ Trade Matrix

  • Audio Fundamentals- ⚪ SR_IN vs SR_OUT

  • ⚪ float32 ↔ int16 Conversions

[VAD: Detecting Speech](/roadmaps/building-real-time-voice-agents-from-scratch/lessons/019e6873-1262-7413-838c-53bc3384556b)- ⚪ Threshold Tuning
- ⚪ Pre-roll Buffer

**Part II: The Pipeline**- ⚪ ASR with faster-whisper
- ⚪ Model Size Trade-offs
  • ⚪ ASR as a Blocking Call

  • ⚪ LLM Streaming & State

  • ⚪ Speakable System Prompt

  • ⚪ The Commit Pattern

  • ⚪ TTS & Latency Trick

  • ⚪ pop_sentences Deep Dive

  • ⚪ Kokoro vs Piper Backends

**Part III: The Hard Parts**- ⚪ Barge-in: Interruption
- ⚪ Yield-Point Latency
  • ⚪ Cancel Wire Protocol

  • ⚪ The Feedback Loop

  • ⚪ Browser AEC

  • ⚪ Playback State Machine

  • ⚪ Three Distinct Moments

Part IV: Engineering It Well- ⚪ Frontend Audio Scheduling

  • ⚪ AudioWorklet for Mic Capture

  • ⚪ Gapless playHead Scheduling

  • ⚪ Concurrency & Orchestration

  • ⚪ run_in_executor Pattern

  • ⚪ asyncio vs Threads — Same Shape

Part V: Make It Yours- ⚪ Capstone Extensions

  • ⚪ Measurable Latency Fork

  • ⚪ Extension Projects

  • ⚪ The Production Bridge

  • ⚪ Trade-offs You Now Own

  • ⚪ Why Hosted APIs Choose as They Do

Community Feedback #

Share your thoughts and rate this roadmap

Sign in to share your feedback and rate this roadmap

comments...

── more in #ai-agents 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/building-real-time-v…] indexed:0 read:1min 2026-05-29 ·