Dots.tts: 2B-parameter continuous, end-to-end autoregressive TTS system

wpnews.pro

cd /news/artificial-intelligence/dots-tts-2b-parameter-continuous-end… · home › topics › artificial-intelligence › article

[ARTICLE · art-23156] src=rednote-hilab.github.io ↗ pub=2026-06-06T04:53Z topic=artificial-intelligence verified=true sentiment=↑ positive

Dots.tts: 2B-parameter continuous, end-to-end autoregressive TTS system

A 2-billion-parameter fully continuous, end-to-end autoregressive text-to-speech system called dots.tts has achieved state-of-the-art performance across multiple benchmarks, including the best average results on Seed-TTS-Eval with word error rates of 0.94% and 1.30% on Chinese and English test sets. The system, which pairs a semantic encoder, LLM, and autoregressive flow-matching acoustic head over a 48 kHz AudioVAE without discrete tokens, also attained the highest average speaker similarity of 83.9 on the 24-language MiniMax multilingual benchmark. This marks a significant advancement in open-source TTS technology, demonstrating strong generation stability, voice cloning ability, and emotional expressiveness.

read1 min views14 publishedJun 6, 2026

A 2B-parameter fully continuous, end-to-end autoregressive text-to-speech system. Abstract dots.tts is a 2B-parameter fully continuous, end-to-end autoregressive (AR) text-to-speech system. The backbone pairs a semantic encoder, an LLM, and an autoregressive flow-matching acoustic head over a 48 kHz AudioVAE, with no discrete tokens anywhere in the pipeline.

dots.tts achieves the best average performance on Seed-TTS-Eval, with WERs of 0.94% / 1.30% / 6.60% and SIM scores of 81.0 / 77.1 / 79.5 on the zh / en / zh-hard test sets, respectively. It further attains the highest average speaker similarity (83.9) on the 24-language MiniMax multilingual benchmark. Across other benchmarks, dots.tts also consistently demonstrates open-source state-of-the-art performance, exhibiting strong generation stability, voice cloning ability, and emotional expressiveness.

Contents

source & further reading

rednote-hilab.github.io — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/dots-tts-2b-parameter-co…

Read original on rednote-hilab.github.io → rednote-hilab.github.io/dots.tts-demo/

mentioned entities

dots.tts

Seed-TTS-Eval

MiniMax

metadata

slugdots-tts-2b-parameter-continuous-end-to-end-autoregressive-tts-system

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalrednote-hilab.github.io

navigation

← prevAI vs Human: An Honest Scorecard

next →Show HN: Nanocode-CLI – A lightw…

── more in #artificial-intelligence 4 stories · sorted by recency

machinebrief.com · 22 Jul · #artificial-intelligence

DAIS: Dependency-Aware Intermediate QA Supervision for Complex Reasoning

machinebrief.com · 22 Jul · #artificial-intelligence

Reasoning Before Translation: Enhancing Legal Machine Translation with Structured Reasoning

machinebrief.com · 22 Jul · #artificial-intelligence

AutoJourn: Multi-Perspective Summarisation, Bias Detection and Bias Neutralisation for LLM-Generated News in Automated Journalism

filmworld-ai.github.io · 22 Jul · #artificial-intelligence

FilmWorld: Agentic Novel-to-Film Generation

── more on @dots.tts 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required