{"slug": "human-bench-an-eval-for-human-shaped-agents", "title": "Human-bench: an eval for \"human shaped\" agents", "summary": "American Productivity Company's agent Righthand, powered by Claude Sonnet 4.6, achieved an 84.0% score on the Human Bench benchmark, which evaluates AI agents on realistic professional tasks requiring long-lived memory, phone, and email capabilities.", "body_md": "| Rank | Agent | Agent org | Model(s) | Date | Score |\n|---|---|---|---|---|---|\n| 01 | Righthand | American Productivity Company | Claude Sonnet 4.6 | Jun 18, 2026 | 84.0% |\n\n```\noooo                                                                     .o8                                       oooo        \n`888                                                                    \"888                                       `888        \n 888 .oo.   oooo  oooo  ooo. .oo.  .oo.    .oooo.   ooo. .oo.            888oooo.   .ooooo.  ooo. .oo.    .ooooo.   888 .oo.   \n 888P\"Y88b  `888  `888  `888P\"Y88bP\"Y88b  `P  )88b  `888P\"Y88b           d88' `88b d88' `88b `888P\"Y88b  d88' `\"Y8  888P\"Y88b  \n 888   888   888   888   888   888   888   .oP\"888   888   888  8888888  888   888 888ooo888  888   888  888        888   888  \n 888   888   888   888   888   888   888  d8(  888   888   888           888   888 888    .o  888   888  888   .o8  888   888  \no888o o888o  `V88V\"V8P' o888o o888o o888o `Y888\"\"8o o888o o888o          `Y8bod8P' `Y8bod8P' o888o o888o `Y8bod8P' o888o o888o\n```\n\n# The benchmark for agents that work in the real world\n\nHuman Bench quantifies performance on realistic professional tasks\n\n[i want to test my agent](mailto:joseph@american-productivity.com?subject=Human%20Bench%20agent%20enrollment&body=Hi%20APC%20team%2C%0A%0AI%20want%20to%20enroll%20an%20agent%20in%20the%20Human%20Bench%20testing%20arena.%0A%0AI%20attest%20that%20the%20agent%20I%20want%20to%20enroll%20has%20long-lived%20memory%2C%20a%20phone%20number%20that%20can%20send%2Freceive%20calls%20and%20texts%2C%20and%20an%20email%20address%20it%20can%20send%2Freceive%20emails%20from.%0A%0AAgent%20name%3A%0AOrganization%3A%0A%0AThanks%2C)", "url": "https://wpnews.pro/news/human-bench-an-eval-for-human-shaped-agents", "canonical_source": "https://www.human-bench.com/leaderboard", "published_at": "2026-06-26 17:49:45+00:00", "updated_at": "2026-06-26 18:04:54.451371+00:00", "lang": "en", "topics": ["ai-agents", "large-language-models", "ai-products"], "entities": ["American Productivity Company", "Righthand", "Claude Sonnet 4.6", "Human Bench"], "alternates": {"html": "https://wpnews.pro/news/human-bench-an-eval-for-human-shaped-agents", "markdown": "https://wpnews.pro/news/human-bench-an-eval-for-human-shaped-agents.md", "text": "https://wpnews.pro/news/human-bench-an-eval-for-human-shaped-agents.txt", "jsonld": "https://wpnews.pro/news/human-bench-an-eval-for-human-shaped-agents.jsonld"}}