Quoting Georgi Gerganov Georgi Gerganov, creator of llama.cpp, reported that the Qwen3.6-27B model is highly capable for local coding tasks, using it daily on his M2 Ultra and RTX 5090 systems. He noted the model's utility for mundane maintenance tasks but limited use due to PR review commitments. I can 100% attest to the fact that Qwen3.6-27B is a very capable local model for coding tasks. Over the last month and a half I've been using it almost daily, either on my M2 Ultra or on my RTX 5090 box. I use it for small mundane tasks at ggml-org - nothing really impressive, but definitely a helpful tool for a maintainer. I think I would be using it much more, if I didn't have to spend a lot of my time on reviewing PRs. Currently, I have a very lightweight harness - the pi agent with everything stripped pi -nc --offline and a short system prompt to align it a bit with my style. — Georgi Gerganov https://news.ycombinator.com/item?id=48555993 48557304 , Hacker News comment on Running local models is good now https://vickiboykis.com/2026/06/15/running-local-models-is-good-now/ by Boykis Tags: georgi-gerganov https://simonwillison.net/tags/georgi-gerganov , llms https://simonwillison.net/tags/llms , ai https://simonwillison.net/tags/ai , generative-ai https://simonwillison.net/tags/generative-ai , pi https://simonwillison.net/tags/pi , ai-assisted-programming https://simonwillison.net/tags/ai-assisted-programming , local-llms https://simonwillison.net/tags/local-llms , qwen https://simonwillison.net/tags/qwen , coding-agents https://simonwillison.net/tags/coding-agents