04:00
2026-05-25
arxiv.org
machine-learning
Tensor Cache: Eviction-conditioned Associative Memory for Transformers
Researchers introduced Tensor Cache, a two-level memory system for Transformer models that stores evicted key-value pairs from a sliding-window cache into a compressed outer-product fast-weight memoryβ¦