{"type": "article", "title": "Speculative KV coding: losslessly compressing KV cache by up to ~4× using a predictor model", "publisher": "Web Pulse", "url": "https://wpnews.pro/news/speculative-kv-coding-losslessly-compressing-kv-cache-by-up-to-4x-using-a-model", "original_source": "https://fergusfinn.com/blog/kv-entropy-coder/", "published": "2026-05-08T00:00:00+00:00", "accessed": "2026-06-03", "id": "speculative-kv-coding-losslessly-compressing-kv-cache-by-up-to-4x-using-a-model"}