05:06
2026-06-16
dev.to
large-language-models
I measure how fast 42 LLMs actually answer. Here's the honest method.
An independent speed tracker, ollamatps.com, benchmarks 42 Ollama Cloud models by measuring time to first token (TTFT) and tokens per second (TPS). The tracker reveals that smaller models can be fasteβ¦