00:00
2026-05-10
jola.dev
large-language-models
Running local models on an M4 with 24GB memory
The article describes the author's successful setup for running local AI models on an M4 Mac with 24GB of memory, specifically highlighting Qwen 3.5-9B (Q4 quantized) as the best performing model at ~โฆ