Gemma-2-2B-Instruct

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-06-19

arxiv.org

large-language-models

Closing the Social-Semantic Gap: SPSD for Edge-Based Prompt Compression in Cloud LLM Inference

Researchers propose SPSD, an edge-based pipeline that compresses user prompts using a small language model before sending them to a cloud LLM, reducing input tokens by an average of 99.9 per call whil…

// co-occurs with top 3 entities

SPSD 1 Llama-3.1-8B-Instruct 1 arXiv 1