UK Biobank

mentions 1 type Person feed RSS

// recent coverage 1 mentions

00:00

2026-05-16

sparsethought.com

large-language-models

curation all the way down: on clinical AI benchmarks

The medARC group released Medmarks v1.0, the largest fully open medical LLM evaluation suite, featuring 30 benchmarks across verifiable and open-ended subsets, covering 61 models on 71 configurations.…

// co-occurs with top 7 entities

medARC 1 Medmarks 1 PrimeIntellect-ai 1 OpenAI 1 All of Us 1 Human Phenotype Project 1 NEJM 1