cd/entity/FP-16Β· homeβ€Ί entitiesβ€Ί FP-16
grep -l @fp-16 /news/*.json | wc -l β†’ 1

FP-16

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

18:57
2026-06-16
injuly.in
large-language-models

Inference cost at scale with napkin math

A technical analysis calculates the dollar cost per user for serving large language models at scale using napkin math, breaking down GPU resources, matrix multiplication costs, and attention mechanism…

// co-occurs with top 7 entities