cd/entity/tau-bench· home entities tau-bench
grep -l @tau-bench /news/*.json | wc -l → 1

tau-bench

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

00:00
2026-06-29
okaneland.com
ai-agents

AI agents finish a third of the job, and the math says why

The best AI agents, including Gemini 2.5 Pro, complete only about 30% of realistic multi-step office tasks autonomously, according to Carnegie Mellon's TheAgentCompany benchmark. Salesforce's own CRMA…

// co-occurs with top 6 entities