{"slug": "steady-forcing-balancing-spatial-persistence-and-motion-continuity-in-long-video", "title": "Steady-Forcing: Balancing Spatial Persistence and Motion Continuity in Long-Horizon Nature Video Diffusion", "summary": "Researchers propose Steady-Forcing, a memory and training framework for autoregressive video diffusion models that balances spatial persistence and motion continuity in long-horizon nature video generation. The method improves background consistency and fluid dynamics over multi-minute rollouts, outperforming seven baselines in stability and motion continuity. The work highlights shortcomings in generic video quality benchmarks for fixed-camera nature scenes.", "body_md": "arXiv:2606.14732v1 Announce Type: new\nAbstract: Autoregressive video diffusion models enable streaming generation but often degrade over long rollouts: static scene layouts drift, while mechanisms that improve spatial stability tend to suppress motion, causing natural flows such as water, fire, or smoke to stagnate. We study this stability-motion trade-off in fixed-camera long-horizon nature video generation, where the two failure modes can be more clearly separated than in moving-camera settings. We propose Steady-Forcing, a memory and training framework combining a persistent visual anchor (V-Sink), an exponential moving-average motion memory (EMA-Sink), block-relative temporal encoding, periodic cache purification, and distillation from a Wan2.1-14B teacher with motion-rewarded priors under task-focused configurations. Together, these components are designed to preserve background identity while sustaining visually plausible fluid dynamics over multi-minute autoregressive rollouts. Evaluations across seven baselines show that Steady-Forcing improves long horizon background consistency and imaging quality, while a blind user study indicates stronger perceived stability and motion continuity. The benchmark evaluation further suggest that generic VBench aggregate scores under-penalize fixed-camera artifacts as well as rewarding drift-induced optical flow as Dynamic Degree while not directly penalizing texture hardening or flow stagnation - motivating future task-specific benchmarks for static-camera nature-flow evaluation. Project page: https://minar09.github.io/steadyforcing/", "url": "https://wpnews.pro/news/steady-forcing-balancing-spatial-persistence-and-motion-continuity-in-long-video", "canonical_source": "https://arxiv.org/abs/2606.14732", "published_at": "2026-06-16 04:00:00+00:00", "updated_at": "2026-06-16 04:18:56.446302+00:00", "lang": "en", "topics": ["computer-vision", "machine-learning", "generative-ai", "ai-research"], "entities": ["Steady-Forcing", "Wan2.1-14B", "V-Sink", "EMA-Sink", "VBench"], "alternates": {"html": "https://wpnews.pro/news/steady-forcing-balancing-spatial-persistence-and-motion-continuity-in-long-video", "markdown": "https://wpnews.pro/news/steady-forcing-balancing-spatial-persistence-and-motion-continuity-in-long-video.md", "text": "https://wpnews.pro/news/steady-forcing-balancing-spatial-persistence-and-motion-continuity-in-long-video.txt", "jsonld": "https://wpnews.pro/news/steady-forcing-balancing-spatial-persistence-and-motion-continuity-in-long-video.jsonld"}}