04:00
2026-05-25
arxiv.org
large-language-models
Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection
Researchers have developed a principled method for multilingual language steering in large language models using sparse autoencoders (SAEs), addressing the unreliability of existing English-only SAE aโฆ