07:44
2026-06-13
dev.to
large-language-models
I expected the cheaper model to be cheaper. It cost 8.6 more.
An engineer discovered that Gemini 2.5 Flash, despite a lower per-token price, cost 8.6 times more per request than Claude Haiku due to its reasoning tokens being billed as output. The engineer built โฆ