BaseRT, A fast inference runtime for local AI on Apple Silicon

wpnews.pro

cd /news/ai-infrastructure/basert-a-fast-inference-runtime-for-… · home › topics › ai-infrastructure › article

[ARTICLE · art-46624] src=basecompute.co ↗ pub=2026-07-01T12:30Z topic=ai-infrastructure verified=true sentiment=↑ positive

BaseRT, A fast inference runtime for local AI on Apple Silicon

BaseCompute released BaseRT, a fast inference runtime for local AI on Apple Silicon, claiming up to 35% faster decode and 78% faster prefill on an Apple M4 Pro with 4-bit quantization. The runtime allows users to serve models locally without API keys or data leaving their device.

read1 min views1 publishedJul 1, 2026

$ curl -LsSf https://basecompute.co/install.sh | sh

Up to 35% on Decode, up to 78% on Prefill.

Tokens / sec · Apple M4 Pro · 4-bit

Serve a model with BaseRT, point your agent at it, and keep everything on your machine. No API keys, no data leaving your device.

source & further reading

basecompute.co — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/basert-a-fast-inference-…

Read original on basecompute.co → www.basecompute.co/getbasert

mentioned entities

BaseCompute

BaseRT

Apple Silicon

Apple M4 Pro

metadata

slugbasert-a-fast-inference-runtime-for-local-ai-on-apple-silicon

topic#ai-infrastructure

secondary2 topics

sentimentpositive

canonicalbasecompute.co

navigation

← prevIn the AI era, US-China competit…

next →Stale RAG vs. expensive RAG: how…

── more in #ai-infrastructure 4 stories · sorted by recency

dev.to · 1 Jul · #ai-infrastructure

Stratagems #4: P Walked Into an AI Monitoring POC. P Didn't Run a Single Test.

news.ycombinator.com · 1 Jul · #ai-infrastructure

Ask HN: Move to Private Models?

wagtail.org · 1 Jul · #ai-infrastructure

Comparing open weight AI models and providers

fortune.com · 1 Jul · #ai-infrastructure

Cisco is rolling out AI agents to every single one of its 90,000 employees

── more on @basecompute 3 stories trending now

wpnews · 30 May · #ai-tools

I was wasting 10 minutes every Claude session. So I built a fix.

wpnews · 27 May · #machine-learning

hunting for headroom on modded-nanoGPT (WR #82)

wpnews · 2 Jun · #ai-products

Microsoft launches Discovery platform for scientific R&D with Ginkgo Bioworks partnership

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required