GLM 5.2 Fast via Wafer now available on AI Gateway

wpnews.pro

cd /news/large-language-models/glm-5-2-fast-via-wafer-now-available… · home › topics › large-language-models › article

[ARTICLE · art-38294] src=vercel.com ↗ pub=2026-06-24T00:00Z topic=large-language-models verified=true sentiment=↑ positive

GLM 5.2 Fast via Wafer now available on AI Gateway

Vercel's AI Gateway now offers GLM 5.2 Fast via Wafer, delivering 2x higher throughput than other providers in benchmarking tests. The model achieves over 170 tok/s for small contexts and over 200 tok/s for large contexts, with no markup or platform fees.

read1 min views5 publishedJun 24, 2026

GLM 5.2 Fast via Wafer is now available on AI Gateway. Based on our own benchmarking across small-context, large-context, and tool-call scenarios, Wafer delivers a 2x higher throughput than other providers serving GLM-5.2 on serverless, leading on decode and end-to-end speed for sustained generation in the small- and large-context cases.

In our testing, GLM 5.2 Fast on Wafer measured:

Small context: 170+ tok/s

Large context: 200+ tok/s

To use GLM 5.2 Fast, set model

to `zai/glm-5.2-fast`

in the [AI SDK](https://ai-sdk.dev/):

AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in custom reporting, Zero Data Retention support, budgets for API keys, and more.

AI Gateway reflects provider pricing with no markup and does not charge a platform fee on inference, including on Bring Your Own Key (BYOK) requests.

Try GLM 5.2 Fast in the model playground.

source & further reading

vercel.com — original article AI SDK 7 AI SDK 7 is now available Deploy from Claude Design to Vercel

~/api · this article 200

$curl api.wpnews.pro/v1/news/glm-5-2-fast-via-wafer-n…

Read original on vercel.com → vercel.com/changelog/glm-5-2-fast-via-wafer-now-…

mentioned entities

Vercel

AI Gateway

GLM 5.2 Fast

Wafer

AI SDK

metadata

slugglm-5-2-fast-via-wafer-now-available-on-ai-gateway

topic#large-language-models

secondary3 topics

sentimentpositive

canonicalvercel.com

navigation

← prevBuild Cross-Language Multi-Agent…

next →Creditors in aggressive push to …

── more in #large-language-models 4 stories · sorted by recency

ably.com · 25 Jun · #large-language-models

Your Vercel AI SDK app is missing a session layer

vercel.com · 25 Jun · #large-language-models

AI SDK 7

ably.com · 25 Jun · #large-language-models

Stop vs disconnect - why canceling AI streaming is harder than it looks

zackproser.com · 25 Jun · #large-language-models

The Agent Fleet That Runs My Business

── more on @vercel 3 stories trending now

wpnews · 22 Jun · #generative-ai

Bain tests software takeover targets using vibecoding AI replicas

wpnews · 19 Oct · #developer-tools

Windows Script to clean up and remove all ASUS software

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required