{"slug": "duogesture-neuro-inspired-and-biomechanically-informed-dual-stream-co-speech", "title": "DuoGesture: Neuro-Inspired and Biomechanically Informed Dual-Stream Co-Speech Gesture Generation", "summary": "Researchers have developed DuoGesture, a neuro-inspired dual-stream system that separately generates semantic and beat gestures for co-speech animation. The model uses a stochastic gate to coordinate when semantic gestures override rhythmic motion, while a biomechanical arm module reduces jitter and improves kinematic smoothness. In evaluations, DuoGesture outperformed existing holistic gesture generation methods by improving semantic grounding, speech-motion alignment, and rhythmic consistency.", "body_md": "arXiv:2605.26236v1 Announce Type: new\nAbstract: Co-speech gesture generation requires both semantic expressivity and biomechanically plausible rhythmic motion. Existing holistic gesture models mix lexically grounded semantic gestures with frequent prosody-aligned beat gestures. This limits semantic grounding, speech-motion alignment, and kinematic smoothness. We propose \\emph{DuoGesture}, a neuro-inspired and biomechanically informed dual-stream approach that decomposes co-speech gesture synthesis into coupled semantic and beat streams. The two streams are coordinated by a \\emph{Semantic Variational Information Bottleneck}, a stochastic frame-level gate that learns when semantic gestures should override rhythmic beat motion. The semantic stream is controlled by \\emph{Motion-Grounded Semantic Conditioning}, which replaces purely linguistic word embeddings with motion-language representations to provide motion-aligned semantic priors for long-tailed lexical triggers of gestures. The beat stream is further regularised by an \\emph{Inertial Beat Prior}, an anthropometry-weighted arm-chain module that reduces jitter and improves rhythmic consistency without constraining semantic frames. Objective evaluations and subjective experiments show that DuoGesture outperforms strong holistic baselines, while component ablations confirm the complementary roles of semantic grounding, stochastic stream selection, and biomechanical regularisation.", "url": "https://wpnews.pro/news/duogesture-neuro-inspired-and-biomechanically-informed-dual-stream-co-speech", "canonical_source": "https://arxiv.org/abs/2605.26236", "published_at": "2026-05-27 04:00:00+00:00", "updated_at": "2026-05-27 04:26:36.479586+00:00", "lang": "en", "topics": ["artificial-intelligence", "machine-learning", "neural-networks", "generative-ai", "natural-language-processing"], "entities": ["DuoGesture", "Semantic Variational Information Bottleneck", "Motion-Grounded Semantic Conditioning", "Inertial Beat Prior"], "alternates": {"html": "https://wpnews.pro/news/duogesture-neuro-inspired-and-biomechanically-informed-dual-stream-co-speech", "markdown": "https://wpnews.pro/news/duogesture-neuro-inspired-and-biomechanically-informed-dual-stream-co-speech.md", "text": "https://wpnews.pro/news/duogesture-neuro-inspired-and-biomechanically-informed-dual-stream-co-speech.txt", "jsonld": "https://wpnews.pro/news/duogesture-neuro-inspired-and-biomechanically-informed-dual-stream-co-speech.jsonld"}}