Nerds.xyz

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

21:34

2026-06-20

science.slashdot.org

artificial-intelligence

OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test

OpenAI released LifeSciBench, a 750-task benchmark to evaluate AI systems on realistic life science research tasks. Its top-performing GPT-Rosalind model achieved only a 36.1% pass rate, failing nearl…

// co-occurs with top 4 entities

OpenAI 1 GPT-Rosalind 1 LifeSciBench 1 Slashdot 1