04:00
2026-05-29
arxiv.org
large-language-models
Aryabhata 2: Scaling Reinforcement Learning for Advanced STEM Reasoning
Researchers have developed Aryabhata 2, a reasoning-focused language model for competitive STEM examinations, trained via reinforcement learning on PhysicsWallah's internal question banks. The model oโฆ