{"slug": "llama-cpp-now-has-an-official-website-llama-app", "title": "Llama.cpp now has an official website: llama.app", "summary": "The open-source AI inference engine llama.cpp has launched an official website at llama.app, providing users with a streamlined installation process via a single curl command. The platform enables local AI model execution without API keys, telemetry, or usage limits, and supports optimized performance across hardware from laptops to clusters.", "body_md": "[llama.app](./)\n\n[GitHub 112.2K](https://github.com/ggml-org/llama.cpp)\n\n`curl -LsSf https://llama.app/install.sh | sh`\n\nPrefer Brew or Winget?\n\n[Package managers](https://github.com/ggml-org/llama.cpp/blob/master/docs/install.md)Rather build from source?[Follow instructions](https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md)## AI that lives on your computer.\n\nOpen-source, private, always local.\n\nRun frontier AI entirely on your machine. No API keys, no telemetry, no limits. Take AI back.\n\n```\n# 1. Serve a model\nllama serve\n\n# 2. Install the pi-llama plugin\npi install git:github.com/huggingface/pi-llama\n\n# 3. Run Pi, everything is set\npi\n```\n\n## Pair it with a local coding agent.\n\nRun `llama serve`\n\n, then launch [Pi](https://github.com/badlogic/pi-mono). It auto-discovers your local model. No config, no API keys. Files stay on your machine,\nrequests never leave it.\n\n## Optimized for any hardware.\n\nFrom your laptop to a cluster, llama.cpp runs on whatever you have. Same binary, same models, same hand-tuned kernels for every GPU and CPU.\n\nApple Silicon M Ultra RTX 5090\n\nH100 MI300 RTX 4090\n\nM Max A100 DGX Spark T4\n\nJetson B200 Intel Arc\n\nCPU Radeon RX M Pro RTX 3090", "url": "https://wpnews.pro/news/llama-cpp-now-has-an-official-website-llama-app", "canonical_source": "https://llama.app/", "published_at": "2026-05-29 16:58:26+00:00", "updated_at": "2026-05-29 17:18:07.876181+00:00", "lang": "en", "topics": ["large-language-models", "ai-tools", "ai-infrastructure", "ai-products", "generative-ai"], "entities": ["llama.cpp", "llama.app", "GitHub", "Pi", "Hugging Face", "Apple", "NVIDIA", "AMD"], "alternates": {"html": "https://wpnews.pro/news/llama-cpp-now-has-an-official-website-llama-app", "markdown": "https://wpnews.pro/news/llama-cpp-now-has-an-official-website-llama-app.md", "text": "https://wpnews.pro/news/llama-cpp-now-has-an-official-website-llama-app.txt", "jsonld": "https://wpnews.pro/news/llama-cpp-now-has-an-official-website-llama-app.jsonld"}}