cd/entity/OpenAiIngressยท homeโ€บ entitiesโ€บ OpenAiIngress
grep -l @openaiingress /news/*.json | wc -l โ†’ 1

OpenAiIngress

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

09:00
2026-06-18
anyscale.com
large-language-models

High Performance Distributed Inference with Ray Serve LLM

Ray Serve LLM, in partnership with Google Kubernetes Engine, announced major performance improvements achieving up to 4.4x higher throughput on prefill-heavy workloads and 24x higher on decode-heavy wโ€ฆ

// co-occurs with top 7 entities