22:52
2026-06-18
massgeneralbrigham.org
large-language-models
New benchmark evaluates AI for everyday patient care
Mass General Brigham researchers developed BRIDGE, a multilingual benchmark that evaluates large language models on real-world clinical tasks, revealing significant gaps between AI performance on mediβ¦