07:21
2026-06-15
corti.com
large-language-models
Measuring LLM Inference: A Practical Look at token-sec-calc I published on GitHub.
A developer published token-sec-calc, an open-source Python CLI tool that benchmarks LLM inference throughput, latency, time-to-first-token, and queue wait against any OpenAI-compatible endpoint. The โฆ