cd /news/ai-agents/ask-hn-what-are-some-good-benchmarks… · home topics ai-agents article
[ARTICLE · art-35183] src=news.ycombinator.com ↗ pub= topic=ai-agents verified=true sentiment=· neutral

Ask HN: What are some good benchmarks for different agent harnesses?

A Hacker News user asks the community for recommendations on benchmarks to evaluate different agent harnesses, noting that Terminal Bench does not align with their experience.

read1 min views1 publishedJun 20, 2026

Hacker News new | past | comments | ask | show | jobs | submit login Ask HN: What are some good benchmarks for different agent harnesses? 2 points by Bnjoroge 9 minutes ago | hide | past | favorite | discuss Other than terminal bench which doesnt quite map to my experience, what are some other benchmarks to see how different models do in different harnesses? help Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact Search:

── more in #ai-agents 4 stories · sorted by recency
── more on @hacker news 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/ask-hn-what-are-some…] indexed:0 read:1min 2026-06-20 ·