A 3-billion-parameter model just scored 94.3 on AIME 2026. Gemini 3 Pro scored 91.7. The 3B model is from Weibo, it is MIT-licensed, and… Continue reading on Towards AI »
source & further reading
pub.towardsai.net — original article
I Gave Five AI Coding Agents a way to Fact-Check the Docs They Were handed. They Refused to Use it.
I Tested the Viral “Caveman” AI Trick. Here’s What It Actually Saves (And What It Doesn’t)
You Can’t Monitor an AI Agent Like a Web Service. Here’s What I Track Instead.