16:37
2026-06-30
loomcycle.dev
large-language-models
Local LLMs on a Ryzen 8700G iGPU: 13-15 tok/s on gemma4, 9-12 on qwen3.6
A developer rebuilt a home server using an AMD Ryzen 7 8700G APU with 96 GB of DDR5 to run local LLM inference alongside NAS and VM workloads. The Radeon 780M iGPU achieves 13-15 tok/s on gemma4 and 9โฆ