AIBullisharXiv – CS AI · 14h ago6/10
🧠
Aryabhata 2: Scaling Reinforcement Learning for Advanced STEM Reasoning
Aryabhata 2 is a specialized language model designed for competitive STEM examinations that uses reinforcement learning to improve reasoning capabilities while reducing computational output by up to 64%. Trained on PhysicsWallah's question banks, it outperforms its base model on JEE and NEET exams, addressing the practical challenge of deploying AI at scale for educational applications.