cd/entity/Aayush DeshpandeΒ· homeβ€Ί entitiesβ€Ί Aayush Deshpande
grep -l @aayush deshpande /news/*.json | wc -l β†’ 2

@Aayush Deshpande

mentions 2 type Person feed RSS
00:00
2026-05-21
modular.com
large-language-models

Modular: Why LLM Inference Needs a New Kind of Router - Part 2

Modular has built a new data layer for LLM inference routing that solves the problem of querying cached blocks across hundreds of pods in microseconds. The company's architecture uses a specialized da…

00:00
2026-05-08
modular.com
ai-infrastructure

Modular: Why LLM Inference Needs a New Kind of Router - Part 1

Modular announced that traditional HTTP-era load balancing algorithms like round-robin, consistent hashing, and least-connections are inadequate for large language model inference because GPU pods are…

// co-occurs with top 7 entities