$ curl -LsSf https://basecompute.co/install.sh | sh
Up to 35% on Decode, up to 78% on Prefill.
Tokens / sec · Apple M4 Pro · 4-bit
Serve a model with BaseRT, point your agent at it, and keep everything on your machine. No API keys, no data leaving your device.
source & further reading
basecompute.co — original article