cd /news/large-language-models/ai-agents-weekly-gemini-3-5-flash-an… · home topics large-language-models article
[ARTICLE · art-28654] src=nlp.elvissaravia.com ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

🤖 AI Agents Weekly: Gemini 3.5 Flash, Antigravity 2.0, Codex Thursday, Cohere Command A+, Qwen3.7-Max, and More

Google launched Gemini 3.5 Flash, a frontier model optimized for agents and coding, alongside Managed Agents in the Gemini API that provide an isolated execution environment. The model achieves state-of-the-art results on agentic benchmarks and is available across multiple Google platforms, positioning Flash as the cost-optimized default for agent workloads.

read1 min views1 publishedMay 23, 2026

Gemini 3.5 Flash, Antigravity 2.0, Codex Thursday, Cohere Command A+, Qwen3.7-Max, and More

In today’s issue:

Google ships Gemini 3.5 Flash for agents

Antigravity 2.0 becomes a full agent platform

OpenAI ships Appshots and /goal in Codex

Cohere open-sources Command A+ on Apache 2.0

Qwen3.7-Max runs agents for 35 hours straight

NVIDIA verifies agent skills

Cursor Composer 2.5 sharpens coding agents

Anthropic acquires Stainless for SDK tooling

Browserbase opens Browse.sh skills catalog

Gemini Omni unifies create-anything model

OpenAI cracks an 80-year Erdős problem

Compiling agent workflows into model weights

PEEK orientation cache for long-context agents

SaaS-Bench exposes computer-use agent ceiling

And all the top AI dev news, papers, and tools.

Top Stories

Gemini 3.5 Flash and Managed Agents Land

Google opened I/O 2026 with Gemini 3.5 Flash, a frontier model tuned explicitly for agents and coding, alongside Managed Agents in the Gemini API that ship an isolated execution environment with every request.

Agentic benchmarks: Gemini 3.5 Flash posts 76.2% on Terminal-Bench 2.1, 83.6% on MCP Atlas, and 1656 Elo on GDPval-AA, outperforming Gemini 3.1 Pro on long-horizon coding and tool-use tasks at 4x faster output.Managed Agents preview: A single Gemini API call spins up an agent that reasons, uses tools, and executes code in an ephemeral Linux sandbox managed by Google, with AGENTS.md and SKILL.md as versionable config.Where it ships: Available in Google AI Studio, Android Studio, Antigravity, Gemini Enterprise Agent Platform, the Gemini app, and AI Mode in Search, with 3.5 Pro slated for next month.Why it matters: Flash is now the cost-optimized agent default at Google scale, and Managed Agents removes the build-your-own-sandbox tax that has kept many teams on third-party runtimes.

── more in #large-language-models 4 stories · sorted by recency
── more on @google 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/ai-agents-weekly-gem…] indexed:0 read:1min 2026-05-23 ·