Tevatron

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:09

2026-06-05

github.com

artificial-intelligence

BrowseComp-Plus: A More Fair and Transparent Benchmark of Deep-Research Agent

Researchers at Tevatron released BrowseComp-Plus, a new benchmark designed to evaluate deep-research AI agents by isolating the effects of retrievers and large language models for fair and reproducibl…

// co-occurs with top 3 entities

OpenAI 1 BrowseComp 1 Hugging Face 1

// topics top 5 topics

artificial intelligence 1 large language models 1 ai agents 1 ai research 1 natural language processing 1