Sina Weibo’s VibeThinker-3B matches flagship AI models with just 3 billion parameters

wpnews.pro

cd /news/large-language-models/sina-weibos-vibethinker-3b-matches-f… · home › topics › large-language-models › article

[ARTICLE · art-30348] src=cryptobriefing.com ↗ pub=2026-06-17T00:37Z topic=large-language-models verified=true sentiment=↑ positive

Sina Weibo’s VibeThinker-3B matches flagship AI models with just 3 billion parameters

A nine-person team at Chinese social media company Sina Weibo developed VibeThinker-3B, a 3-billion-parameter language model that matches the reasoning performance of flagship AI models hundreds of times larger, scoring 94.3 on the AIME 2026 math competition. The model is open-sourced under MIT license, challenging the economics of AI development by achieving frontier-level results at a fraction of the cost.

read3 min views31 publishedJun 17, 2026

A nine-person team at a Chinese social media company just embarrassed the biggest names in AI with a model that's hundreds of times smaller than the competition

A language model with 3 billion parameters just matched the reasoning performance of systems that are 200 times its size. The team behind it doesn’t work at OpenAI, Google DeepMind, or Anthropic. They work at a microblogging company.

Sina Weibo, the Chinese social media platform most people associate with viral posts rather than frontier AI research, published a 14-page technical report on arXiv detailing VibeThinker-3B. The model scored 94.3 on AIME 2026, one of the most demanding standardized math competitions in the world, placing it alongside DeepSeek V3.2 and its 671 billion parameters.

Small model, big numbers #

The benchmark results tell the story. On AIME 2026, VibeThinker-3B hit 94.3, a score that climbs to 97.1 when using claim-level test-time scaling. On LiveCodeBench v6, a coding benchmark, it posted a Pass@1 score of 80.2. The model also demonstrated superior out-of-distribution performance on recent LeetCode contests, often matching or beating those much larger systems.

The model is built on top of Qwen2.5-Coder-3B as its base architecture. The Sina Weibo team, comprising nine researchers including Sen Xu, Shixi Liu, and Wei Wang, enhanced performance through a combination of curriculum-based supervised fine-tuning, multi-domain reinforcement learning, and offline self-distillation techniques. The paper also introduces the Parametric Compression-Coverage Hypothesis, which offers a theoretical framework for why smaller models can punch above their weight in structured reasoning tasks.

The efficiency arms race #

VibeThinker-3B didn’t come out of nowhere. Its predecessor, VibeThinker-1.5B, launched in November 2025 and used the Spectrum-to-Signal Principle to achieve impressive results at a training cost of roughly $7,800. For context, training frontier models at companies like OpenAI and Google typically costs tens or hundreds of millions of dollars.

The 3B version extends the approach from the earlier model with more sophisticated training techniques, including curriculum-based supervised fine-tuning, multi-domain reinforcement learning, and offline self-distillation.

Both the model weights and codebase are fully open. Weights are available on Hugging Face and code is hosted on GitHub, both under the MIT license.

What this means for the AI investment landscape #

VibeThinker-3B has no direct connection to crypto, blockchain, or tokenization. The researchers didn’t mention anything remotely related to Web3.

For the decentralized AI projects in the crypto space that are working on inference, model hosting, and compute marketplaces, the efficiency trend is constructive. Smaller, high-performance models are easier to run on distributed networks, require less specialized hardware, and are more practical for edge deployment. A 3-billion-parameter model that performs like a 671-billion-parameter model is exactly the kind of development that makes decentralized inference economically viable rather than a theoretical exercise. When models this capable are released freely under an MIT license, the barrier to building competitive AI applications drops dramatically. That is potentially challenging for any company, crypto or otherwise, that is trying to monetize model access as a core revenue stream.

Disclosure: This article was edited by Editorial Team. For more information on how we create and review content, see our

Editorial Policy.

source & further reading

cryptobriefing.com — original article Meta’s new paper exposes why reinforcement learning struggles with code optimization, and how to fix it Mexico emerges as key player in US AI infrastructure boom Wall Street gains as Amazon eases AI concerns

~/api · this article 200

$curl api.wpnews.pro/v1/news/sina-weibos-vibethinker-…

Read original on cryptobriefing.com → cryptobriefing.com/sina-weibo-vibethinker-3b-mat…

mentioned entities

Sina Weibo

VibeThinker-3B

DeepSeek V3.2

Qwen2.5-Coder-3B

OpenAI

Google DeepMind

Anthropic

Hugging Face

metadata

slugsina-weibos-vibethinker-3b-matches-flagship-ai-models-with-just-3-billion

topic#large-language-models

secondary4 topics

sentimentpositive

canonicalcryptobriefing.com

navigation

← prevUS government holds off on black…

next →Humans Still Beat AI in the Long…

── more in #large-language-models 4 stories · sorted by recency

marktechpost.com · 19 Jun · #large-language-models

VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline

the-decoder.com · 28 Jun · #large-language-models

Sina's open model VibeThinker-3B aims to show reasoning compresses well but factual knowledge doesn't

officechai.com · 1 Aug · #large-language-models

OpenAI Says It Has Solved 10 Open Math Problems Using Astra, Its New Model

twitter.com · 1 Aug · #large-language-models

GPT has proved nonsofic groups are exist

── more on @sina weibo 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 1 Aug · #developer-tools

I Built a Portable AI Skill That Safely Upgrades .NET Applications

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required