Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection

wpnews.pro

cd /news/large-language-models/multilingual-steering-by-design-mult… · home › topics › large-language-models › article

[ARTICLE · art-13641] src=arxiv.org ↗ pub=2026-05-25T04:00Z topic=large-language-models verified=true sentiment=· neutral

Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection

Researchers have developed a principled method for multilingual language steering in large language models using sparse autoencoders (SAEs), addressing the unreliability of existing English-only SAE approaches. By training SAEs on multilingual data and introducing a layer-selection rule based on the intersection of multilingual alignment and language separability, the team achieved more reliable language control across models like LLaMA-3.1-8B and Gemma-2-9B. The approach stabilizes the trade-off between language identification accuracy and generation quality, offering a predictive framework for multilingual SAE steering without exhaustive layer searches.

read1 min views9 publishedMay 25, 2026

arXiv:2605.23036v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) enable feature-level mechanistic interpretability and activation steering in large language models (LLMs), but SAE-based language control remains unreliable in multilingual settings: most SAEs are trained on English-only data, and steering layers are chosen heuristically. We address these limitations by advancing a principled, mechanistic account of multilingual language steering with SAEs. First, we show that training SAEs on multilingual data consistently strengthens cross-lingual representations and yields more reliable, quality-preserving language control across layers and model families. Second, we introduce an \emph{a priori} steering layer-selection rule based on the intersection of multilingual alignment and language separability, which predicts effective intervention depths without exhaustive layerwise search. We evaluate our approach on LLaMA-3.1-8B and Gemma-2-9B across machine translation and cross-lingual summarization (CrossSumm), using SpBLEU, ROUGE-L, COMET, and LaSE. Our results show that multilingual SAEs combined with intersection-selected layers stabilize the trade-off between language identification accuracy and generation quality, providing a principled, predictive, representation-level account of multilingual SAE steering.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/multilingual-steering-by…

Read original on arxiv.org → arxiv.org/abs/2605.23036

mentioned entities

LLaMA-3.1-8B

Gemma-2-9B

CrossSumm

SpBLEU

ROUGE-L

COMET

LaSE

metadata

slugmultilingual-steering-by-design-multilingual-sparse-autoencoders-and-principled

topic#large-language-models

secondary4 topics

sentimentneutral

canonicalarxiv.org

navigation

← prevThe Eternal Sloptember

next →Samsung memory workers call off …

── more in #large-language-models 4 stories · sorted by recency

pub.towardsai.net · 9 Jul · #large-language-models

How a GPT-2 Decoder Actually Predicts the Next Word

pub.towardsai.net · 9 Jul · #large-language-models

Natural Language Processing (NLP) for Business: From Chatbots to Document Intelligence

spyglass.org · 9 Jul · #large-language-models

The First True AI Chatbot

arxiv.org · 25 Jun · #large-language-models

Evidence for feature-specific error correction in LLMs

── more on @llama-3.1-8b 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 8 Jul · #artificial-intelligence

Anthropic's "J-lens" reveals workspace in Claude mirrors theory of consciousness

wpnews · 8 Jul · #ai-safety

China warns of security risks in Anthropic’s AI tool, impacting market confidence

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required