Inworld TTS Paralinguistic Tags Don't Work — Here's What Does Inworld's TTS-1.5 Max model ignores common paralinguistic tags like `[laugh]` and `[sigh]`, rendering them as silence or literal text instead of expressive audio. Developers at HoneyChat, a Telegram-native AI companion, tested multiple tag variants across 26 archetypes and 15 languages before discovering that SSML `` tags, ellipsis-based pauses, and spelled-out onomatopoeia like "ha-ha" reliably produce the intended vocal effects. If you've worked with expressive TTS in the last year you've probably seen the pattern: She paused. sigh "Fine, you can come in." Inline paralinguistic tags. Half the model demos use them. So when we wired up Inworld TTS-1.5 Max for HoneyChat https://honeychat.bot/ — Telegram-native AI companion where voice messages are a first-class output — we sprinkled laugh , sigh , breathe through the prompts and shipped. The audio sounded fine. Just… exactly the same as before. No laugh. No sigh. The tags were getting read out as silence at best, and as the literal text "sigh" at worst, depending on the voice. We tested all the variants we could find. None of them moved the needle. HoneyChat voice stack at a glance: en, ru, ja, zh, ko, es, fr, de, it, pt, pl, hi, ar, he, nl . voiceId strings in config/archetype voice ids.json . Generated via the Voice Design API and managed with core/voice design.py . core/voice clone manager.py — persistent voiceId minted from a WAV/MP3 sample. core/voice cache.py . VOICE GENDER MALE / VOICE GENDER FEMALE , not "male" / "female" strings. Passing the strings 400s silently.Tried on the same sentence, same voice, side-by-side audio comparison: | Pattern | What it did | |---|---| laugh sigh | Silence in output | laughs sighs | Sometimes read literally | laughs sighs | Silence asterisks get stripped |