20:17
2026-05-30
tomshardware.com
large-language-models
768GB Intel Optane DIMMs to run 1T-parameter LLM with single GPU at 4tps
A Redditor running a workstation with 768GB of used Intel Optane Persistent Memory DIMMs achieved roughly 4 tokens per second on a 1-trillion-parameter LLM (Kimi K2.5) using a single GPU. The six seco…