cd /news/machine-learning/chorus-ii-cross-request-sparsity-reu… · home topics machine-learning article
[ARTICLE · art-38785] src=arxiv.org ↗ pub= topic=machine-learning verified=true sentiment=↑ positive

Chorus II: Cross-Request Sparsity Reuse for Efficient Image-to-Video Generation

Researchers propose Chorus II, a cross-request sparsity reuse framework for efficient image-to-video generation, achieving a 2.16× speedup by reusing sparse attention masks from similar historical requests. The method preserves generation quality while reducing online mask-prediction overhead, addressing computational challenges in large-scale deployment of diffusion models.

read1 min views1 publishedJun 25, 2026

arXiv:2606.25040v1 Announce Type: new Abstract: Serving diffusion models for image-to-video generation is computationally expensive, posing significant challenges for large-scale deployment. Real I2V workloads often contain similar requests, such as repeated effect templates, related subjects, and recurring shot layouts. Existing cross-request acceleration methods mainly exploit this redundancy through feature reuse. We observe that similar I2V requests also share highly consistent sparse attention patterns, enabling historical sparse masks to serve as request-conditioned priors with almost no online mask-prediction overhead. We propose a cross-request reuse framework centered on \textbf{sparsity reuse}, with \textbf{feature reuse} as an optional extension safeguarded by a lightweight \textbf{guidance enhancement}. Our sparsity reuse is implemented as shared sparse mask reuse, which reuses high-quality sparse masks from similar historical requests to avoid per-request online mask prediction. Optional feature reuse applies downsampled computation to highly redundant spatiotemporal regions, mitigating boundary artifacts while preserving efficiency gains. Guidance enhancement reinforces image/text conditioning after reuse, mitigating semantic drift and condition-adherence issues. Experiments show that default sparsity reuse configuration preserves generation quality with a \textbf{2.16$\times$} speedup.

── more in #machine-learning 4 stories · sorted by recency
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/chorus-ii-cross-requ…] indexed:0 read:1min 2026-06-25 ·