{"slug": "experience-with-dissimilar-language-ablation", "title": "Experience with dissimilar language ablation?", "summary": "A Hugging Face user is exploring dissimilar language ablation, aiming to remove Mandarin, Russian, and Arabic from a primarily Latin-based language model to free space for further training or pruning. They are creating a Swadesh-like list of noun-verb pairs across the four languages for this purpose.", "body_md": "[a13ph](https://discuss.huggingface.co/u/a13ph)\n1\nAnyone here have experience with dissimilar language ablation? I’m thinking of ablating Mandarin/Russian/Arabic to leave a primarily Latin-base language model with the hope of making some space for further training and/or safely pruning where English demonstrably has no activation.\n\nI’m presently creating a Swadesh-**esque** noun/verb (“thing”,“does”) pair list across the four languages where each pair either token-matches every other pair or gets padded to match, if necessary.", "url": "https://wpnews.pro/news/experience-with-dissimilar-language-ablation", "canonical_source": "https://discuss.huggingface.co/t/experience-with-dissimilar-language-ablation/177196#post_1", "published_at": "2026-06-27 10:40:13+00:00", "updated_at": "2026-06-27 11:10:25.442870+00:00", "lang": "en", "topics": ["large-language-models", "natural-language-processing", "ai-research"], "entities": ["Hugging Face"], "alternates": {"html": "https://wpnews.pro/news/experience-with-dissimilar-language-ablation", "markdown": "https://wpnews.pro/news/experience-with-dissimilar-language-ablation.md", "text": "https://wpnews.pro/news/experience-with-dissimilar-language-ablation.txt", "jsonld": "https://wpnews.pro/news/experience-with-dissimilar-language-ablation.jsonld"}}