Chorus II: Cross-Request Sparsity Reuse for Efficient Image-to-Video Generation

wpnews.pro

cd /news/machine-learning/chorus-ii-cross-request-sparsity-reu… · home › topics › machine-learning › article

[ARTICLE · art-38785] src=arxiv.org ↗ pub=2026-06-25T04:00Z topic=machine-learning verified=true sentiment=↑ positive

Chorus II: Cross-Request Sparsity Reuse for Efficient Image-to-Video Generation

Researchers propose Chorus II, a cross-request sparsity reuse framework for efficient image-to-video generation, achieving a 2.16× speedup by reusing sparse attention masks from similar historical requests. The method preserves generation quality while reducing online mask-prediction overhead, addressing computational challenges in large-scale deployment of diffusion models.

read1 min views1 publishedJun 25, 2026

arXiv:2606.25040v1 Announce Type: new Abstract: Serving diffusion models for image-to-video generation is computationally expensive, posing significant challenges for large-scale deployment. Real I2V workloads often contain similar requests, such as repeated effect templates, related subjects, and recurring shot layouts. Existing cross-request acceleration methods mainly exploit this redundancy through feature reuse. We observe that similar I2V requests also share highly consistent sparse attention patterns, enabling historical sparse masks to serve as request-conditioned priors with almost no online mask-prediction overhead. We propose a cross-request reuse framework centered on \textbf{sparsity reuse}, with \textbf{feature reuse} as an optional extension safeguarded by a lightweight \textbf{guidance enhancement}. Our sparsity reuse is implemented as shared sparse mask reuse, which reuses high-quality sparse masks from similar historical requests to avoid per-request online mask prediction. Optional feature reuse applies downsampled computation to highly redundant spatiotemporal regions, mitigating boundary artifacts while preserving efficiency gains. Guidance enhancement reinforces image/text conditioning after reuse, mitigating semantic drift and condition-adherence issues. Experiments show that default sparsity reuse configuration preserves generation quality with a \textbf{2.16$\times$} speedup.

source & further reading

arxiv.org — original article

── more in #machine-learning 4 stories · sorted by recency

dev.to · 25 Jun · #machine-learning

Why stop gaming saved my tokens: Building my own local AI Lab

wan-streamer.com · 25 Jun · #machine-learning

Wan Streamer v0.1: End-to-End Real-Time Interactive Foundation Models

arxiv.org · 25 Jun · #machine-learning

SEMIR: Topology-Preserving Graph Minors for Thin-Structure Segmentation

arxiv.org · 25 Jun · #machine-learning

Noise-Aware Boundary-Enhanced Generative Learning for Ultrasound Speckle Reduction

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required