# 🤖 AI Agents Weekly: Gemini 3.5 Flash, Antigravity 2.0, Codex Thursday, Cohere Command A+, Qwen3.7-Max, and More

> Source: <https://nlp.elvissaravia.com/p/ai-agents-weekly-gemini-35-flash>
> Published: 2026-05-23 15:02:42+00:00

# 🤖 AI Agents Weekly: Gemini 3.5 Flash, Antigravity 2.0, Codex Thursday, Cohere Command A+, Qwen3.7-Max, and More

### Gemini 3.5 Flash, Antigravity 2.0, Codex Thursday, Cohere Command A+, Qwen3.7-Max, and More

In today’s issue:

Google ships Gemini 3.5 Flash for agents

Antigravity 2.0 becomes a full agent platform

OpenAI ships Appshots and /goal in Codex

Cohere open-sources Command A+ on Apache 2.0

Qwen3.7-Max runs agents for 35 hours straight

NVIDIA verifies agent skills

Cursor Composer 2.5 sharpens coding agents

Anthropic acquires Stainless for SDK tooling

Browserbase opens Browse.sh skills catalog

Gemini Omni unifies create-anything model

OpenAI cracks an 80-year Erdős problem

Compiling agent workflows into model weights

PEEK orientation cache for long-context agents

SaaS-Bench exposes computer-use agent ceiling

And all the top AI dev news, papers, and tools.

**Top Stories**

**Gemini 3.5 Flash and Managed Agents Land**

Google opened I/O 2026 with Gemini 3.5 Flash, a frontier model tuned explicitly for agents and coding, alongside Managed Agents in the Gemini API that ship an isolated execution environment with every request.

**Agentic benchmarks:** Gemini 3.5 Flash posts 76.2% on Terminal-Bench 2.1, 83.6% on MCP Atlas, and 1656 Elo on GDPval-AA, outperforming Gemini 3.1 Pro on long-horizon coding and tool-use tasks at 4x faster output.**Managed Agents preview:** A single Gemini API call spins up an agent that reasons, uses tools, and executes code in an ephemeral Linux sandbox managed by Google, with AGENTS.md and SKILL.md as versionable config.**Where it ships:** Available in Google AI Studio, Android Studio, Antigravity, Gemini Enterprise Agent Platform, the Gemini app, and AI Mode in Search, with 3.5 Pro slated for next month.**Why it matters:** Flash is now the cost-optimized agent default at Google scale, and Managed Agents removes the build-your-own-sandbox tax that has kept many teams on third-party runtimes.
