cd /news/large-language-models/glm-5-2-fast-via-wafer-now-available… · home topics large-language-models article
[ARTICLE · art-38294] src=vercel.com ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

GLM 5.2 Fast via Wafer now available on AI Gateway

Vercel's AI Gateway now offers GLM 5.2 Fast via Wafer, delivering 2x higher throughput than other providers in benchmarking tests. The model achieves over 170 tok/s for small contexts and over 200 tok/s for large contexts, with no markup or platform fees.

read1 min views5 publishedJun 24, 2026

GLM 5.2 Fast via Wafer is now available on AI Gateway. Based on our own benchmarking across small-context, large-context, and tool-call scenarios, Wafer delivers a 2x higher throughput than other providers serving GLM-5.2 on serverless, leading on decode and end-to-end speed for sustained generation in the small- and large-context cases.

In our testing, GLM 5.2 Fast on Wafer measured:

Small context: 170+ tok/s

Large context: 200+ tok/s

To use GLM 5.2 Fast, set model

to `zai/glm-5.2-fast`

in the [AI SDK](https://ai-sdk.dev/):

AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in custom reporting, Zero Data Retention support, budgets for API keys, and more.

AI Gateway reflects provider pricing with no markup and does not charge a platform fee on inference, including on Bring Your Own Key (BYOK) requests.

Try GLM 5.2 Fast in the model playground.

── more in #large-language-models 4 stories · sorted by recency
vercel.com · · #large-language-models
AI SDK 7
── more on @vercel 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/glm-5-2-fast-via-waf…] indexed:0 read:1min 2026-06-24 ·