cd/entity/MTM-Bench· home entities MTM-Bench
grep -l @mtm-bench /news/*.json | wc -l → 1

@MTM-Bench

mentions 1 type Organization feed RSS
04:00
2026-05-28
arxiv.org
large-language-models

Disentangling Language Roles in Multilingual LLM Task Execution

Researchers introduced MTM-Bench, a controlled benchmark that isolates three distinct language roles—instruction, content, and response—across English, Spanish, and Chinese to evaluate multilingual LL…

// co-occurs with top 1 entities