Run Claude Code locally for free: mlx-serve on Apple Silicon

wpnews.pro

cd /news/developer-tools/run-claude-code-locally-for-free-mlx… · home › topics › developer-tools › article

[ARTICLE · art-47467] src=dev.to ↗ pub=2026-07-03T23:26Z topic=developer-tools verified=true sentiment=↑ positive

Run Claude Code locally for free: mlx-serve on Apple Silicon

A developer released mlx-serve, a native Zig server for MLX-format language models on Apple Silicon, enabling local, free, and private use of AI coding assistants like Claude Code. The server exposes OpenAI, Anthropic, and Ollama-compatible APIs from a single binary, achieving 35% faster decode than LM Studio on Gemma 4 E4B 4-bit. It requires no Python, conda, or Docker, and can be installed via Homebrew.

read1 min views1 publishedJul 3, 2026

Claude Code is the best AI coding assistant available right now. But it calls the Anthropic API by default, which adds up fast on long sessions.

What if you could run it entirely locally - free, private, and on hardware you already own?

mlx-serve makes this possible on any Apple Silicon Mac.

mlx-serve is a native Zig server for MLX-format language models on Apple Silicon. It exposes OpenAI-compatible, Anthropic-compatible, and Ollama-compatible HTTP APIs - all on a single port, from a single binary.

brew install mlx-serve

That's it. No Python. No conda. No Docker.

Claude Code looks for ANTHROPIC_BASE_URL

and ANTHROPIC_API_KEY

in your environment. mlx-serve implements the full Anthropic Messages API, so you just point Claude Code at it:

export ANTHROPIC_BASE_URL=http://localhost:8080
export ANTHROPIC_API_KEY=local
export ANTHROPIC_DEFAULT_MODEL=mlx-serve
mlx-serve --model ~/.mlx-serve/models/mlx-community/gemma-4-e4b-it-4bit --serve

Then launch Claude Code as normal. Streaming, tool calls, thinking blocks - all work.

Full setup guide: https://mlxserve.com/claude-code-local/

On Apple Silicon, mlx-serve achieves 35%+ faster decode than LM Studio on Gemma 4 E4B 4-bit. The server is written in Zig with no Python runtime overhead.

/api/chat

, /api/generate

, /api/embed

endpoints - works with Raycast, Open WebUI, Obsidian

source & further reading

dev.to — original article AI-Assisted AuthZ Review: Reading Permission Boundaries in Ory Kratos Mastering Local Deployment of SOTA LLMs: Jamesob’s Guide to Overcoming Resource Constraints Why 'Just Be Careful Next Time' Never Reaches an AI

~/api · this article 200

$curl api.wpnews.pro/v1/news/run-claude-code-locally-…

Read original on dev.to → dev.to/ddalcu/run-claude-code-locally-for-free-m…

mentioned entities

mlx-serve

Claude Code

Anthropic

Apple Silicon

Zig

MLX

Gemma 4

LM Studio

metadata

slugrun-claude-code-locally-for-free-mlx-serve-on-apple-silicon

topic#developer-tools

secondary3 topics

sentimentpositive

canonicaldev.to

navigation

← prevEvidence saturation k*: retrieva…

next →Letters: With GOP in control, ta…

── more in #developer-tools 4 stories · sorted by recency

github.com · 3 Jul · #developer-tools

Save Claude Code Tokens with Smart Routing

dev.to · 4 Jul · #developer-tools

Mastering Local Deployment of SOTA LLMs: Jamesob’s Guide to Overcoming Resource Constraints

startupfortune.com · 3 Jul · #developer-tools

HCLTech beats Infosys to land a $1.14 billion AI deal with Mercedes-Benz

letsdatascience.com · 3 Jul · #developer-tools

Claude Mac App Uses Electron, Developer Implicated

── more on @mlx-serve 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

wpnews · 1 Jul · #ai-agents

Build agentic full-stack apps with Genkit

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required