cd /news/machine-learning/prime-intellect-releases-prime-rl-0-… · home topics machine-learning article
[ARTICLE · art-36728] src=marktechpost.com ↗ pub= topic=machine-learning verified=true sentiment=↑ positive

Prime Intellect Releases prime-rl 0.6.0 to Train Trillion-Parameter MoE Models on Agentic RL Workloads

Prime Intellect released prime-rl 0.6.0, an open framework for asynchronous reinforcement learning on trillion-parameter Mixture-of-Experts models, enabling training on agentic RL workloads with optimizations like FP8 inference and 3-D parallelism. The framework trained GLM-5 on SWE tasks at up to 131k sequence length with sub-5-minute step times on 28 H200 nodes.

read1 min views7 publishedJun 23, 2026

Prime Intellect has released prime-rl 0.6.0, an open framework for asynchronous reinforcement learning on trillion-parameter Mixture-of-Experts models. It trained GLM-5 on SWE tasks at up to 131k sequence length, with sub-5-minute step times and 256 rollouts, on 28 H200 nodes. This breakdown covers the inference and training optimizations behind those numbers — FP8 inference, Wide Expert Parallelism, prefill/decode disaggregation, router replay, and 3-D parallelism (FSDP, EP, CP).

The post Prime Intellect Releases prime-rl 0.6.0 to Train Trillion-Parameter MoE Models on Agentic RL Workloads appeared first on MarkTechPost.

── more in #machine-learning 4 stories · sorted by recency
── more on @prime intellect 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/prime-intellect-rele…] indexed:0 read:1min 2026-06-23 ·