MGPO

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

06:04

2026-06-23

devclubhouse.com

large-language-models

When a 3B Model Out-Reasons Opus 4.5, Read the Fine Print

Weibo's AI group released VibeThinker-3B, a 3-billion-parameter model that achieves 94.3 on AIME and outperforms Claude Opus 4.5 on competition reasoning, but collapses on general knowledge tasks. The…

// co-occurs with top 7 entities

Weibo 1 VibeThinker-3B 1 Claude Opus 4.5 1 Qwen2.5-Coder-3B 1 DeepSeek 1 GRPO 1 Rachel Goldstein 1