04:00
2026-05-28
arxiv.org
large-language-models
Soro: A Lightweight Foundation Model and Chatbot for Tajik
Researchers have developed Soro, a family of Tajik-specialized conversational AI models built from Gemma 3 checkpoints and trained on a 1.9-billion-token Tajik corpus. The models outperform same-size โฆ