# BaseRT, A fast inference runtime for local AI on Apple Silicon

> Source: <https://www.basecompute.co/getbasert>
> Published: 2026-07-01 12:30:44+00:00

`$ curl -LsSf https://basecompute.co/install.sh | sh`

Up to 35% on Decode, up to 78% on Prefill.

Tokens / sec · Apple M4 Pro · 4-bit

Serve a model with BaseRT, point your agent at it, and keep everything on your machine. No API keys, no data leaving your device.
