04:00
2026-06-30
arxiv.org
large-language-models
IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations
Researchers introduced IMCBench, a benchmark for multimodal LLMs in image-grounded medical conversations, evaluating eight models across four families. Claude Opus 4.6 achieved the highest overall scoโฆ