cd/entity/Gateway API Inference Extensionยท homeโ€บ entitiesโ€บ Gateway API Inference Extension
grep -l @gateway api inference extension /news/*.json | wc -l โ†’ 1

Gateway API Inference Extension

mentions 1 type Person feed RSS

// recent coverage 1 mentions

15:27
2026-06-27
cefboud.com
large-language-models

Distributed LLM Inference with LLM-d

A new open-source tool called llm-d acts as an LLM-aware load balancer for distributed inference, intelligently routing requests across vLLM instances based on KV cache locality and GPU utilization. Bโ€ฆ

// co-occurs with top 6 entities