# Building a Voice AI Platform with 28 Modules in Python

> Source: <https://dev.to/ryanwinston_134/building-a-voice-ai-platform-with-28-modules-in-python-4hbm>
> Published: 2026-06-20 02:21:05+00:00

##
What I Built

Omni-VRAM is an open-source voice AI platform with 28 modules.

GitHub: [https://github.com/Liangchenxu/Omni-VRAM](https://github.com/Liangchenxu/Omni-VRAM)

##
Features

-
**Speech Recognition**: Whisper with 5 backends (faster-whisper, whisper.cpp, ONNX, TensorRT, OpenAI API)
-
**Real-time Streaming**: <200ms latency
-
**Speaker Diarization**: Who spoke when
-
**Emotion Recognition**: 6 emotions
-
**TTS Synthesis**: Edge-TTS + pyttsx3
-
**Chinese Processing**: Punctuation, tokenization, dialects
-
**Meeting Assistant**: Auto summarization with LLM
-
**APIs**: REST, WebSocket, gRPC
-
**Docker**: GPU and CPU support

##
Tech Stack

Python, PyTorch, CUDA, FastAPI, Whisper

##
Installation
