HELMET

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

22:29

2026-06-14

arxiv.org

large-language-models

Still: Amortized KV Cache Compaction in a Single Forward Pass

Researchers introduced Still, a per-layer Perceiver model that compacts KV cache in a single forward pass, enabling efficient long-context language model deployment. On Qwen and Gemma models, Still ou…

// co-occurs with top 7 entities

Qwen 1 Gemma 1 RULER 1 LongBench 1 KV-Distill 1 Perceiver 1 Still 1

// topics top 5 topics

large language models 1 machine learning 1 ai infrastructure 1 ai research 1 ai tools 1