cd/entity/CLExEvalΒ· homeβ€Ί entitiesβ€Ί CLExEval
grep -l @clexeval /news/*.json | wc -l β†’ 1

CLExEval

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

08:25
2026-07-01
machinebrief.com
large-language-models

The Mirage of AI in Clinical Diagnostics: A Sobering Reality Check

A new research framework called CLExEval reveals that large language models like GPT-4o-mini and HuatuoGPT-o1 exhibit significant flaws in clinical reasoning, including verbosity bias, hidden knowledg…

// co-occurs with top 2 entities