04:00
2026-05-26
arxiv.org
large-language-models
Direct Preference Optimization for English-Mandarin Code-Switching Speech Recognition in Audio LLMs
Researchers have identified three systematic failure modes in Audio LLMs when transcribing English-Mandarin code-switching speech: language omission, translation-instead-of-transcription, and hallucinβ¦