cd/entity/UpToDateยท homeโ€บ entitiesโ€บ UpToDate
grep -l @uptodate /news/*.json | wc -l โ†’ 1

UpToDate

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

21:30
2026-06-14
sparsethought.com
large-language-models

A bitter lesson for medicine, or a benchmark problem?

A Nature Medicine paper claiming general-purpose LLMs outperform specialized clinical tools on medical benchmarks is criticized for flawed methodology. The benchmark, Real Clinical Queries, evaluated โ€ฆ

// co-occurs with top 3 entities