llama.cpp's Creator Says 90% of AI Agents Will Ditch the Cloud — I Ran a 70B Model in One Command Georgi Gerganov, creator of llama.cpp, predicts that 90% of AI agents will move from cloud to local inference. He demonstrated running a 70B-parameter model locally with a single command, highlighting the trend toward decentralized AI. When llama.cpp crossed 100,000 GitHub stars, its creator Georgi Gerganov posted a half-joke that I haven’t stopped thinking about: “now… Continue reading on Towards AI »