08:37
2026-06-26
coinerella.com
artificial-intelligence
Paying for LLM inference by the kilowatt-hour instead of per token
NeuralWatt, a US-based AI inference provider, introduced energy-based metering for LLM inference, charging by kilowatt-hour instead of per token. A user reported an average 82.9% cost reduction comparβ¦