OpenBMB Runs Local Agents with MiniCPM5-1B OpenBMB released MiniCPM5-1B, a 1.08 billion-parameter Transformer model designed for on-device deployment, supporting context lengths up to 131,072 tokens with a built-in thinking chat template. The model can run local agents on phones and demonstrates strengths in agentic tool use and code generation, though it struggles with logic traps. This release lowers barriers for prototyping private, offline assistants without cloud dependencies, but reliability limits in complex reasoning mean outputs should be treated as opportunistic rather than authoritative. OpenBMB Runs Local Agents with MiniCPM5-1B OpenBMB released MiniCPM5-1B, a dense 1.08 billion-parameter Transformer designed for on-device deployment, according to the model card on Hugging Face. The model supports very long context up to 131,072 tokens and includes a built-in "