MTM-Bench

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-05-28

arxiv.org

large-language-models

Disentangling Language Roles in Multilingual LLM Task Execution

Researchers introduced MTM-Bench, a controlled benchmark that isolates three distinct language roles—instruction, content, and response—across English, Spanish, and Chinese to evaluate multilingual LL…

// co-occurs with top 1 entities

arXiv 1