Sors

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

17:15

2026-06-16

github.com

large-language-models

Sors: a Rust proxy that reorders prompts to maximize vLLM prefix cache hits

A new Rust-based reverse proxy called Sors reorders prompt content to maximize prefix cache hits in LLM inference engines like vLLM and SGLang, improving latency by placing static content before dynam…

// co-occurs with top 5 entities

vLLM 1 SGLang 1 Rust 1 OpenAI 1 Qwen 1

// topics top 3 topics

large language models 1 ai infrastructure 1 developer tools 1