cd/entity/TurboquantΒ· homeβ€Ί entitiesβ€Ί Turboquant
grep -l @turboquant /news/*.json | wc -l β†’ 1

Turboquant

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

22:01
2026-06-23
pub.towardsai.net
large-language-models

A GPU-Poor’s Guide to Local LLM Inference in 2026

A 35-billion-parameter Mixture-of-Experts model runs at 28 tokens per second with full 128K context on a 2019 gaming laptop with a GTX 1660 Ti and 6 GB VRAM using llama.cpp's --n-cpu-moe flag and Turb…

// co-occurs with top 7 entities