cd /news/large-language-models/soft-token-alignment-for-cross-lingu… · home topics large-language-models article
[ARTICLE · art-40245] src=arxiv.org ↗ pub= topic=large-language-models verified=true sentiment=↑ positive

Soft Token Alignment for Cross-Lingual Reasoning

Researchers propose SOLAR, an auxiliary objective for supervised fine-tuning that aligns soft-token representations across languages using English as a pivot, improving cross-lingual reasoning in multilingual large language models. Across four benchmarks, SOLAR boosts accuracy by up to +17.7 points over the base model and +3.8 over standard fine-tuning, with largest gains on low-resource languages.

read1 min views1 publishedJun 26, 2026

arXiv:2606.26466v1 Announce Type: new Abstract: Multilingual large language models often produce inconsistent reasoning and answers for semantically equivalent prompts in different languages. Prior work suggests that intermediate representations can be relatively language-agnostic, but generation becomes increasingly language-specific as models commit to discrete output tokens. This is problematic because language-specific lexical choices can cause semantically equivalent reasoning paths to diverge across languages. These divergences motivate searching for a cross-lingual alignment signal that is less tied to any single vocabulary item or script. We propose SOLAR, an auxiliary objective for supervised fine-tuning that aligns soft-token representations across languages, using English as a pivot. Soft tokens are probability-weighted mixtures over the vocabulary embeddings, yielding continuous representations that can aggregate information from semantically related tokens across languages. We then align each non-English soft-token summary to its English counterpart in the shared embedding space. Across four multilingual reasoning benchmarks, SOLAR improves accuracy by up to +17.7 points over the base model and +3.8 over standard supervised fine-tuning, with the largest gains on low-resource languages. SOLAR also strengthens final-layer cross-lingual similarity and substantially reduces language-cluster separability, suggesting that aligning soft-token representations helps preserve shared semantic structure during multilingual reasoning.

── more in #large-language-models 4 stories · sorted by recency
── more on @solar 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/soft-token-alignment…] indexed:0 read:1min 2026-06-26 ·