Virtuals integrates Leyten’s distributed GPU inference engine to run GLM-5.2 across its AI agent network

wpnews.pro

cd /news/artificial-intelligence/virtuals-integrates-leytens-distribu… · home › topics › artificial-intelligence › article

[ARTICLE · art-34851] src=cryptobriefing.com ↗ pub=2026-06-20T12:44Z topic=artificial-intelligence verified=true sentiment=↑ positive

Virtuals integrates Leyten’s distributed GPU inference engine to run GLM-5.2 across its AI agent network

Virtuals Protocol integrated Leyten's distributed GPU inference engine to run GLM-5.2, a 744 billion parameter open-weight AI model, across its decentralized AI agent network. The partnership enables frontier-scale AI inference without centralized cloud providers, supporting autonomous onchain agents with a 1 million token context window.

read2 min views1 publishedJun 20, 2026

Virtuals integrates Leyten’s distributed GPU inference engine to run GLM-5.2 across its AI agent network — Image: Cryptobriefing (auto-discovered)

The integration lets Virtuals split a 744 billion parameter model across multiple GPUs, a key step toward running frontier-scale AI in decentralized environments

Running a model with roughly 744 billion parameters is not something you do on a single graphics card. Virtuals Protocol just partnered with Leyten to make sure it doesn’t have to.

The AI agent platform has integrated Leyten’s shard engine, a system designed to distribute large-model inference across multiple GPUs over a network. The immediate target: running GLM-5.2, the open-weight model from Z.ai that dropped publicly under an MIT license on June 16, 2026. The combination gives Virtuals a path to frontier-scale AI inference without relying on centralized cloud providers or single massive GPU clusters.

What GLM-5.2 actually is, and why it matters here #

GLM-5.2 is a big model. We’re talking approximately 744 billion total parameters, though only around 39 to 40 billion are active per token. In English: the model uses a mixture-of-experts architecture that keeps most of its knowledge stored but only fires up a fraction of it for any given task, keeping compute costs manageable despite the enormous overall size.

The model also ships with a context window of 1 million tokens. That’s five times larger than its predecessor, GLM-5.1.

Z.ai released GLM-5.2 to subscribers on June 13, 2026, before making it publicly available three days later. The MIT license means anyone can use, modify, and deploy it commercially.

How Leyten’s shard engine solves the hardware problem #

Leyten built a different approach. Its shard engine uses pipeline-parallel inference, which essentially slices a large model into pieces and distributes those pieces across separate GPUs connected over a network. No single node needs to hold the entire model in memory.

Where Virtuals Protocol fits in the AI agent landscape #

Virtuals Protocol operates in the AI agent vertical of crypto, specifically focused on the creation and monetization of onchain AI agents. These are autonomous digital entities that can transact, execute tasks, and interact with blockchain protocols. The ecosystem runs on its native token, VIRTUAL.

GLM-5.2 has been positioned as competitive with proprietary frontier models but at significantly lower operational costs. A model with 1 million tokens of context and strong agentic coding capabilities is precisely the kind of foundation that makes autonomous agent behavior plausible rather than aspirational.

Disclosure: This article was edited by Editorial Team. For more information on how we create and review content, see our

Editorial Policy.

source & further reading

cryptobriefing.com — original article JPMorgan posts record $16.5B Q1 profit as Dimon warns the next credit crisis will be worse than anyone expects Google, Microsoft, and Salesforce back new AI software standard to counter OpenAI and Anthropic Intercontinental Exchange unveils ICE Compass for trade analytics

~/api · this article 200

$curl api.wpnews.pro/v1/news/virtuals-integrates-leyt…

Read original on cryptobriefing.com → cryptobriefing.com/virtuals-leyten-distributed-g…

mentioned entities

Virtuals Protocol

Leyten

GLM-5.2

Z.ai

VIRTUAL

metadata

slugvirtuals-integrates-leytens-distributed-gpu-inference-engine-to-run-glm-5-2-its

topic#artificial-intelligence

secondary4 topics

sentimentpositive

canonicalcryptobriefing.com

navigation

← prevThe IPv4 Parser AI Couldn't Have…

next →Convert your landing pages to po…

── more in #artificial-intelligence 4 stories · sorted by recency

byteiota.com · 20 Jun · #artificial-intelligence

Stack Overflow for Agents: AI Coding Memory Layer Lands

runtimewire.com · 20 Jun · #artificial-intelligence

Elon Musk takes Grok into Databricks as xAI chases enterprise distribution

devclubhouse.com · 20 Jun · #artificial-intelligence

Google Antigravity and the Shift to Autonomous Developer Agents

dev.to · 20 Jun · #artificial-intelligence

The FullAgenticStack Manifesto: Agents are not just LLMs

── more on @virtuals protocol 3 stories trending now

wpnews · 19 Jun · #artificial-intelligence

From Dream Job to 'The Gulag': Inside Staff Revolt Zuckerberg's Brutal AI Push

wpnews · 19 Jun · #artificial-intelligence

Stop Guessing Which Library to Use — I Built an AI Capability Discovery Engine

wpnews · 19 Jun · #large-language-models

I Cut My AI Agent's Token Bill by 62% in One Weekend. Here's the Receipts.

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required