Experience with dissimilar language ablation?

wpnews.pro

cd /news/large-language-models/experience-with-dissimilar-language-… · home › topics › large-language-models › article

[ARTICLE · art-41762] src=discuss.huggingface.co ↗ pub=2026-06-27T10:40Z topic=large-language-models verified=true sentiment=· neutral

Experience with dissimilar language ablation?

A Hugging Face user is exploring dissimilar language ablation, aiming to remove Mandarin, Russian, and Arabic from a primarily Latin-based language model to free space for further training or pruning. They are creating a Swadesh-like list of noun-verb pairs across the four languages for this purpose.

read1 min views1 publishedJun 27, 2026

a13ph 1 Anyone here have experience with dissimilar language ablation? I’m thinking of ablating Mandarin/Russian/Arabic to leave a primarily Latin-base language model with the hope of making some space for further training and/or safely pruning where English demonstrably has no activation.

I’m presently creating a Swadesh-esque noun/verb (“thing”,“does”) pair list across the four languages where each pair either token-matches every other pair or gets padded to match, if necessary.

source & further reading

discuss.huggingface.co — original article Rakarrack-0.6.1 port making progress! ( AI assisted ) Cloud Storage Poll Welcome to Haiku basic(Haiku Docs, Haiku slide and Haiku sheets)

~/api · this article 200

$curl api.wpnews.pro/v1/news/experience-with-dissimil…