04:00
2026-05-25
arxiv.org
large-language-models
Parallel Context Compaction for Long-Horizon LLM Agent Serving
Researchers introduced parallel context compaction for long-horizon LLM agent serving, addressing the problem of growing conversation histories exceeding context windows. The method provides fine-graiβ¦