Zeta2.1: 3x Fewer Tokens, 50ms Faster

wpnews.pro

cd /news/artificial-intelligence/zeta2-1-3x-fewer-tokens-50ms-faster · home › topics › artificial-intelligence › article

[ARTICLE · art-14122] src=zed.dev ↗ pub=2026-05-08T00:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

Zeta2.1: 3x Fewer Tokens, 50ms Faster

Zed Industries released Zeta2.1, an updated edit prediction model that emits 3x fewer output tokens than its predecessor, reducing response times by up to 50 milliseconds and requiring 30% fewer servers to handle the same traffic. The efficiency gains come from a new "Multi-Region" prompt format that outputs only the code region the model intends to change, rather than a large area around the cursor. Zeta2.1 is open-weight, available on Hugging Face, and is now the default edit prediction model in the Zed editor.

read2 min views14 publishedMay 8, 2026

We launched Zeta2, Zed's edit prediction model, in March, and promised more improvements were on the way. Here they are.

Zeta2.1 emits 3x fewer output tokens than Zeta2, bringing predictions up to 50ms faster and requiring 30% fewer servers to serve the same traffic:

| Metric | Zeta2 | Zeta2.1 |

|---|---|---|
| Output tokens (avg) | ~270 | ~90 (−67%) |
| Response Time (p50) | 189ms | 136ms (−28%) |
| Response Time (p90) | 401ms | 350ms (−13%) |

These efficiency gains came from a new prompt format we've dubbed "Multi-Region". While Zeta2 output a large region around your cursor with its edits applied, with the new Multi-Region format Zeta2.1 only outputs the region around the code it wants to change. This took several iterations to get right, but the result is even faster predictions on every keystroke.

Zeta2.1 is open-weight, just like Zeta1 and Zeta2.

You can see examples of the new prompt format, and download the model on Hugging Face. As with Zeta2, Zeta2.1 was trained entirely on opt-in data in open-source repositories.

If you'd like to help contribute to future improvements, you can opt in by [toggling the data collection setting](zed://settings/edit_predictions.allow_data_collection).

[Try It](#try-it)

Zeta2.1 is even better for running locally, and works [out of the box](https://zed.dev/docs/ai/edit-prediction#local-and-self-hosted-models).

Additionally with this release we've begun to publish bindings for the Rust code we use in production to format prompts to PyPI, making it even easier to self host.

Zeta2.1 is the default edit prediction model in Zed today. You can try it out for free, or check out Zed Pro or Zed Business for unlimited edit predictions.

Check out similar blogs from the Zed team.

Looking for a better editor?

You can try Zed today on macOS, Windows, or Linux. Download now!

We are hiring!

If you're passionate about the topics we cover on our blog, please consider joining our team to help us ship the future of software development.

source & further reading

zed.dev — original article What GitHub Copilot's Usage-Based Billing Means for Zed Users Why and How to Run Local Models in Zed Terminal Threads Are Live in Zed

~/api · this article 200

$curl api.wpnews.pro/v1/news/zeta2-1-3x-fewer-tokens-…

Read original on zed.dev → zed.dev/blog/zeta2-1

mentioned entities

Zeta2.1

Zeta2

Zeta1

Zed

Hugging Face

metadata

slugzeta2-1-3x-fewer-tokens-50ms-faster

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalzed.dev

navigation

← prevEvaluating Geekbench 6

next →Did xAI just concede the AI race…

── more in #artificial-intelligence 4 stories · sorted by recency

lobste.rs · 6 Jul · #artificial-intelligence

What are you doing this week?

blog.kilo.ai · 10 Jul · #artificial-intelligence

The New GPT-5.6 Models Will Blow Your Mind. Try Them in Kilo Today.

hookami.ai · 10 Jul · #artificial-intelligence

Hookami – AI advisor that tells YouTube creators what to film

cryptobriefing.com · 10 Jul · #artificial-intelligence

Microsoft 365 Copilot adopts GPT-5.6 as preferred model, and the AI infrastructure race just got more expensive

── more on @zeta2.1 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required

Zeta2.1: 3x Fewer Tokens, 50ms Faster

Related Posts

Looking for a better editor?

We are hiring!

Run your AI side-project on zahid.host