Aayush Deshpande

mentions 2 type Person feed RSS

// recent coverage 2 mentions

00:00

2026-05-21

modular.com

large-language-models

Modular: Why LLM Inference Needs a New Kind of Router - Part 2

Modular has built a new data layer for LLM inference routing that solves the problem of querying cached blocks across hundreds of pods in microseconds. The company's architecture uses a specialized da…

00:00

2026-05-08

modular.com

ai-infrastructure

Modular: Why LLM Inference Needs a New Kind of Router - Part 1

Modular announced that traditional HTTP-era load balancing algorithms like round-robin, consistent hashing, and least-connections are inadequate for large language model inference because GPU pods are…

// co-occurs with top 7 entities

Modular 2 Hippocratic AI 2 DeepSeek 2 FLUX 2 Kimi 2 MiniMax 2 Wan 2