IMCBench

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-06-30

arxiv.org

large-language-models

IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations

Researchers introduced IMCBench, a benchmark for multimodal LLMs in image-grounded medical conversations, evaluating eight models across four families. Claude Opus 4.6 achieved the highest overall sco…

// co-occurs with top 7 entities

Claude Opus 4.6 1 Claude Sonnet 4.6 1 GPT-5.2 1 Claude 1 GPT 1 Nova 1 Llama 1