RightNow-Arabic-0.5B-Turbo: An Open Sub-1B Arabic Language Model via Vocabulary Injection and Edge-First Deployment

wpnews.pro

cd /news/large-language-models/rightnow-arabic-0-5b-turbo-an-open-s… · home › topics › large-language-models › article

[ARTICLE · art-17169] src=arxiv.org ↗ pub=2026-05-29T04:00Z topic=large-language-models verified=true sentiment=↑ positive

RightNow-Arabic-0.5B-Turbo: An Open Sub-1B Arabic Language Model via Vocabulary Injection and Edge-First Deployment

RightNowAI released RightNow-Arabic-0.5B-Turbo, a 518M-parameter Arabic-specialized language model that outperforms all existing sub-1B open models on Arabic benchmarks while running on edge devices. The model, built by injecting 27,032 Arabic tokens into Qwen2.5-0.5B and continuing pretraining on 504M Arabic tokens, achieves 35.9% mean accuracy on Arabic benchmarks and ties Falcon-H1-1.5B on COPA-ar at one-third the size. Quantized to 398 MB, the model delivers 635 tokens per second on a single H100, with all code, weights, and benchmarks released openly.

read1 min views8 publishedMay 29, 2026

arXiv:2605.28827v1 Announce Type: new Abstract: Open Arabic large language models split into two classes: sub-1B multilingual models that treat Arabic as an afterthought (Qwen2.5-0.5B, Falcon-H1-0.5B), and 7B-70B Arabic-specialized models that require a server to run (Jais, AceGPT, ALLaM, SILMA). The one published attempt at a sub-2B Arabic-specialized model, Kuwain-1.5B, never released its weights. We present RightNow-Arabic-0.5B-Turbo, a 518M-parameter Arabic-specialized decoder LLM built on Qwen2.5-0.5B. The pipeline adds 27,032 Arabic tokens via mean-subtoken initialization, continues pretraining on 504M Arabic tokens on 8xH100 with FSDP, FlashAttention varlen packing, and Liger fused kernels, then applies supervised fine-tuning on 129,116 Arabic instruction pairs with response-only loss masking, direct preference optimization on 6,750 Arabic preference pairs, and weight soup merging across three checkpoints. On three lm-evaluation-harness Arabic benchmarks (COPA-ar, Arabic HellaSwag, ArabicMMLU) the merged model reaches 35.9% mean accuracy, beats every same-class open model, ties Falcon-H1-1.5B on COPA-ar (58.4%) at one-third the size, and recovers 67% of SILMA-9B's mean at 1/18 the parameters. The edge build quantizes to 398 MB (q4_k_m) and delivers 635 tokens/s at batch size 1 on a single H100 via llama.cpp. All code (5,555 lines across 25 scripts), weights (bf16, int8, and four GGUF quantizations), and benchmark scripts are released at https://huggingface.co/RightNowAI/RightNow-Arabic-0.5B-Turbo.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/rightnow-arabic-0-5b-tur…

Read original on arxiv.org → arxiv.org/abs/2605.28827

mentioned entities

RightNow-Arabic-0.5B-Turbo

Qwen2.5-0.5B

Falcon-H1-0.5B

Jais

AceGPT

ALLaM

SILMA

Kuwain-1.5B

metadata

slugrightnow-arabic-0-5b-turbo-an-open-sub-1b-arabic-language-model-via-vocabulary

topic#large-language-models

secondary4 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevChatGPT glitch is leaking OpenAI…

next →New infosec products of the mont…

── more in #large-language-models 4 stories · sorted by recency

spectrum.ieee.org · 15 Jul · #large-language-models

The First Chatbot’s Multiple Personalities

insideai.news · 15 Jul · #large-language-models

Spotify Launches AI Voice Assistant in the US, Ireland, and Sweden

sourcefeed.dev · 15 Jul · #large-language-models

DSLs Make LLM Code Generation Production-Ready

dev.to · 15 Jul · #large-language-models

The Trillion-Parameter RL Paper Is Really About Letting the Model Find the Workflow

── more on @rightnow-arabic-0.5b-turbo 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 23 May · #artificial-intelligence

AccessLens — a blind person's lanyard, powered by Gemma 4 on-device

wpnews · 21 May · #developer-tools

Antigravity CLI: A Hands-On Guide to Google's Terminal Coding Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required