🤖 AI Agents Weekly: Microsoft's Seven MAI Models, Gemma 4 12B, NVIDIA Nemotron 3 Ultra, Agents' Last Exam, Devin Desktop, and More

wpnews.pro

cd /news/large-language-models/ai-agents-weekly-microsoft-s-seven-m… · home › topics › large-language-models › article

[ARTICLE · art-28652] src=nlp.elvissaravia.com ↗ pub=2026-06-06T15:01Z topic=large-language-models verified=true sentiment=· neutral

🤖 AI Agents Weekly: Microsoft's Seven MAI Models, Gemma 4 12B, NVIDIA Nemotron 3 Ultra, Agents' Last Exam, Devin Desktop, and More

Microsoft launched seven in-house MAI models, including its first reasoning model MAI-Thinking-1, aiming for self-sufficiency from OpenAI. Google released Gemma 4 12B, an open multimodal model for agentic reasoning on consumer hardware. NVIDIA opened Nemotron 3 Ultra, a 550B parameter model, while Anthropic warned about recursive self-improvement risks.

read2 min views1 publishedJun 6, 2026

Microsoft's Seven MAI Models, Gemma 4 12B, NVIDIA Nemotron 3 Ultra, Agents' Last Exam, Devin Desktop, and More

In today’s issue:

Microsoft ships seven new MAI models

MAI-Thinking-1 takes on Claude Sonnet Gemma 4 12B runs agents on a laptop

NVIDIA opens 550B Nemotron 3 Ultra

Anthropic warns of recursive self-improvement

Agents’ Last Exam stumps frontier agents

Claude Platform gets an ant CLI

Cognition launches Devin Desktop

Nous ships Hermes Desktop

Codex builds iOS apps end-to-end ChatGPT memory learns to dream

Multi-agent computer use beats solo CUAs

Economy of Minds prices agent actions

LEAP solves all 12 Putnam problems

A harness rewrites itself for +19 SWE points

And all the top AI dev news, papers, and tools.

Top Stories

Microsoft Launches Seven In-House MAI Models

Microsoft AI unveiled a family of seven models trained from scratch, led by MAI-Thinking-1, its first reasoning model, in a bid for long-term self-sufficiency from OpenAI.

MAI-Thinking-1: A 35B reasoning model that scores 97% on AIME and 53% on SWE-Bench Pro, with early testers preferring it side-by-side over Claude Sonnet 4.6 on overall quality.A full stack: The launch also ships MAI-Image-2.5 and Flash, MAI-Transcribe-1.5, MAI-Voice-2 and Flash, and MAI-Code-1-Flash for code generation.Clean training: Every model was trained on commercially licensed data with no distillation from third-party labs, which Microsoft frames as a hedge against legal risk for enterprise customers.Why it matters: Suleyman positions the release as a “hill-climbing machine,” a shared training infrastructure meant to keep Microsoft on the frontier as compute scales, and a direct shot at its biggest enterprise rival.

MAI-Thinking-1 ships with a detailed 109-page technical report.

Gemma 4 12B Brings Agentic Reasoning to Your Laptop

Google released Gemma 4 12B, a unified, encoder-free multimodal open model that brings agentic reasoning, vision, and native audio to consumer hardware under an Apache 2.0 license.

Encoder-free design: Vision inputs pass through a single lightweight matrix multiplication and audio is projected directly into the same space as text tokens, dropping separate modality encoders.Runs locally: Fits in 16GB of VRAM or unified memory, small enough for a laptop, with support across LM Studio, Ollama, and Google AI Edge Gallery.Punches up: Reaches performance nearing Google’s larger 26B MoE model at less than half the memory footprint, and is the first mid-sized Gemma with native audio input.Community traction: The release topped Hacker News, with builders showing it running on a 10-year-old Xeon CPU.

source & further reading

nlp.elvissaravia.com — original article Autonomous Long-Running Coding Agents 🥇Top AI Papers of the Week 🤖 AI Agents Weekly: Claude Fable 5, Kimi K2.7-Code, NotebookLM Goes Agentic, DiffusionGemma, MiMo Code, and More

~/api · this article 200

$curl api.wpnews.pro/v1/news/ai-agents-weekly-microso…

Read original on nlp.elvissaravia.com → nlp.elvissaravia.com/p/ai-agents-weekly-microsof…

mentioned entities

Microsoft

Google

NVIDIA

Anthropic

Cognition

Nous

Codex

OpenAI

metadata

slugai-agents-weekly-microsoft-s-seven-mai-models-gemma-4-12b-nvidia-nemotron-3-last

topic#large-language-models

secondary4 topics

sentimentneutral

canonicalnlp.elvissaravia.com

navigation

← prev“Whoever builds the most joyous …

next →Blackbaud Touts AI Agents, Payme…

── more in #large-language-models 4 stories · sorted by recency

marktechpost.com · 15 Jun · #large-language-models

Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-Page Research Reports With Slides

code.visualstudio.com · 17 Jun · #large-language-models

Visual Studio Code 1.125

cryptobriefing.com · 15 Jun · #large-language-models

Amazon invests $20B to build new data center campus in Maryland

pub.towardsai.net · 15 Jun · #large-language-models

Agentic Inference Deployment: From Prose Skills to Deployed Endpoints

── more on @microsoft 3 stories trending now

wpnews · 15 Jun · #artificial-intelligence

Facebook now has an AI search engine that pulls answers from your Group posts and Reels

wpnews · 15 Jun · #large-language-models

The Grain of Thought

wpnews · 15 Jun · #developer-tools

How I Built a Zero-Dependency Token Compressor for AI Coding Agents (During My High School Exams)

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required