04:00
2026-05-28
arxiv.org
large-language-models
Disentangling Language Roles in Multilingual LLM Task Execution
Researchers introduced MTM-Bench, a controlled benchmark that isolates three distinct language roles—instruction, content, and response—across English, Spanish, and Chinese to evaluate multilingual LL…