cd /news/artificial-intelligence/end-to-end-model-that-listens-sees-t… · home topics artificial-intelligence article
[ARTICLE · art-41723] src=twitter.com ↗ pub= topic=artificial-intelligence verified=true sentiment=· neutral

End-to-end model that listens, sees, thinks and responds on video in real time

Alibaba unveiled Wan Streamer, an AI agent capable of real-time video interaction that can see, hear, and respond to users, marking a significant advancement beyond voice-only AI systems.

read1 min views1 publishedJun 27, 2026
End-to-end model that listens, sees, thinks and responds on video in real time
Image: source

Min Choi @minchoi We are cooked.

China's Alibaba just revealed Wan Streamer.

AI agents can now see you, hear you, and talk back on video in real time.

This is not voice mode anymore 🤯 00:00 3:25 AM · Jun 26, 2026 371K Views

── more in #artificial-intelligence 4 stories · sorted by recency
── more on @alibaba 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/end-to-end-model-tha…] indexed:0 read:1min 2026-06-27 ·