04:09
2026-06-05
github.com
artificial-intelligence
BrowseComp-Plus: A More Fair and Transparent Benchmark of Deep-Research Agent
Researchers at Tevatron released BrowseComp-Plus, a new benchmark designed to evaluate deep-research AI agents by isolating the effects of retrievers and large language models for fair and reproduciblβ¦