GLM-5.2 is the leading open weights model on the Intelligence Index v4.1

wpnews.pro

cd /news/large-language-models/glm-5-2-is-the-leading-open-weights-… · home › topics › large-language-models › article

[ARTICLE · art-30704] src=xcancel.com ↗ pub=2026-06-17T08:12Z topic=large-language-models verified=true sentiment=↑ positive

GLM-5.2 is the leading open weights model on the Intelligence Index v4.1

Z ai's GLM-5.2, a 744B-parameter open weights model with 40B active parameters, achieved a score of 51 on the Artificial Analysis Intelligence Index v4.1, surpassing competitors MiniMax-M3 and DeepSeek V4 Pro. It also leads on the GDPval-AA v2 agentic benchmark with a score of 1524, matching proprietary models like GPT-5.5, and is available under an MIT license with a 1M-token context window.

read3 min views31 publishedJun 17, 2026

Z ai’s GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index scoring 51 and it sits on the Pareto frontier of Intelligence vs Cost per Task

@Zai_org’s GLM-5.2 is the same size as GLM-5.1 (744B total / 40B active parameters) but scores 11 points higher on the Intelligence Index v4.1, placing ahead of MiniMax-M3 (44) and DeepSeek V4 Pro (max, 44). On the first-party API it is priced in line with GLM-5.1 at $1.4/$4.4/$0.26 per 1M input/output/cache hit tokens Key results: ➤ GLM-5.2 is the leading open weights model on the Intelligence Index v4.1. At 51, it leads MiniMax-M3 (44), DeepSeek V4 Pro (max, 44) and Kimi K2.6 (43) ➤ Improvements across most evaluations, particularly scientific reasoning: GLM-5.2 gains over GLM-5.1 on most evaluations, led by scientific reasoning on CritPt (+16 points to 21%) and HLE (+12 points to 40%), alongside AA-LCR (+9 points to 71%), tau3 banking (+15 points to 27%) and SciCode (+7 points to 50%). TerminalBench v2.1 also improves (+16 points to 78%) and GPQA Diamond gains 3 points to 89% ➤ Leading open weights model on GDPval-AA v2 and competitive with proprietary models: GLM-5.2 scores 1524 on GDPval-AA v2, ahead of MiniMax-M3 (1418) and DeepSeek V4 Pro (max, 1328). This impressive result places GLM-5.2 in-line with proprietary models including GPT-5.5 (xhigh reasoning). GDPval-AA v2 builds on the original GDPval-AA by baselining Elo to human performance at 1000, introducing a rotating panel of frontier-model judges, and raising the turn limit from 100 to 250 for longer-horizon agent trajectories ➤ GLM-5.2 uses more output tokens per task than other leading open weights models: the model uses 43k output tokens per Intelligence Index task, up from GLM-5.1 (26k) and above MiniMax-M3 (24k), Kimi K2.6 (35k) and DeepSeek V4 Pro (max, 37k) ➤ On the Intelligence vs. Cost per Task Pareto Frontier: GLM-5.2 is on the Pareto frontier of the Intelligence vs Cost per Task chart, with the lowest cost per task among models at its intelligence level. GLM-5.2 costs ~$0.46 per task, compared to GLM-5.1 ($0.25), Kimi K2.6 ($0.31), MiniMax-M3 ($0.18) and DeepSeek V4 Pro (max, $0.05) Additional Model Details: ➤ License: MIT ➤ Size: 744B total parameters, 40B active parameters, equivalent to GLM-5.1 ➤ Context window: 1M tokens, up from 200K on GLM-5.1 ➤ Pricing: $1.4/$0.26/$4.4 per 1M input/cache hit/output tokens ➤ Availability: Alongside Z ai's first-party API, GLM-5.2 is available across third-party providers including@DeepInfra,@novita_labs,@nebiusai,@parasailnetwork,@SiliconFlowAI,@gmi_cloud,@Basetenand@FireworksAI_HQJun 17, 2026 · 6:41 AM UTC

489

45,739

GLM-5.2 leads all open weights models on GDPval-AA v2, our primary metric for real-world agentic performance. At 1524 it places ahead of MiniMax-M3 (1418) and DeepSeek V4 Pro (max, 1328), and is effectively level with GPT-5.5 (xhigh, 1514). We visually inspected GLM-5.2's outputs across a range of GDPval-AA tasks. We have attached a selection below

2,551

GLM-5.2 scores 4 on the AA-Omniscience Index, up from GLM-5.1 (2). The gain comes from both higher accuracy (25.1% vs 24.2%) and a lower hallucination rate (28.1% vs 29.4%), with attempt rate flat at 47%

2,020

GLM-5.2 uses 43k output tokens per Intelligence Index task, of which 37k is reasoning. This is up from GLM-5.1 (26k) and higher than open weights peers MiniMax-M3 (24k) and Kimi K2.6 (35k), placing it among the less token-efficient open weights models at its intelligence level. GLM-5.2 sits off the most attractive quadrant on the Intelligence vs Output Tokens chart

1,532

Breakdown of the individual evaluations in the Artificial Analysis Intelligence Index v4.1

2,624

Compare GLM-5.2 with other leading models at:

artificialanalysis.ai/models… 18 2,259

source & further reading

xcancel.com — original article AI companies are shredding rare books Now that Huang believes in open source, looking forward to CUDA open sourcing Jensen Huang on why open models matter

~/api · this article 200

$curl api.wpnews.pro/v1/news/glm-5-2-is-the-leading-o…

Read original on xcancel.com → xcancel.com/ArtificialAnlys/status/2067135640249…

mentioned entities

Z ai

GLM-5.2

MiniMax-M3

DeepSeek V4 Pro

Kimi K2.6

GPT-5.5

Artificial Analysis

DeepInfra

metadata

slugglm-5-2-is-the-leading-open-weights-model-on-the-intelligence-index-v4-1

topic#large-language-models

secondary4 topics

sentimentpositive

canonicalxcancel.com

navigation

← prevClaude Code Skills Turn It into …

next →Anthropic Paused Its Agent SDK B…

── more in #large-language-models 4 stories · sorted by recency

promptcube3.com · 2 Aug · #large-language-models

DoorDash + Chinese AI: Why the House Probe Misses the Point

sourcefeed.dev · 21 Jul · #large-language-models

LM Studio's Bionic Agent Is Also Its Cloud Pivot

dev.to · 2 Aug · #large-language-models

Aeglix Mind

unusualwhales.com · 2 Aug · #large-language-models

DeepMind Disbands AlphaFold Team, Pivots to Gemini

── more on @z ai 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 1 Aug · #developer-tools

I Built a Portable AI Skill That Safely Upgrades .NET Applications

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required