21:34
2026-06-20
science.slashdot.org
artificial-intelligence
OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test
OpenAI released LifeSciBench, a 750-task benchmark to evaluate AI systems on realistic life science research tasks. Its top-performing GPT-Rosalind model achieved only a 36.1% pass rate, failing nearlβ¦