cd/entity/CPUΒ· homeβ€Ί entitiesβ€Ί CPU
grep -l @cpu /news/*.json | wc -l β†’ 1

@CPU

mentions 1 type Organization feed RSS
18:56
2026-04-30
pytorch.org
large-language-models

SMG: The Case for Disaggregating CPU from GPU in LLM Serving

Shepherd Model Gateway (SMG) has disaggregated all CPU-bound workloads from GPU inference in large language model serving, moving tokenization, detokenization, and parsing into a dedicated Rust gatewa…

// co-occurs with top 7 entities