y0news
← Feed
Back to feed
🧠 AI🟢 BullishImportance 7/10

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

arXiv – CS AI| DeepReinforce Team, Xiaoya Li, Xiaofei Sun, Guoyin Wang, Songqiao Su, Chris Shum, Jiwei Li|
🤖AI Summary

GrandCode, a new multi-agent reinforcement learning system, has become the first AI to consistently defeat all human competitors in live competitive programming contests, placing first in three recent Codeforces competitions. This breakthrough demonstrates AI has now surpassed even the strongest human programmers in the most challenging coding tasks.

Key Takeaways
  • GrandCode is the first AI system to consistently beat all human participants in live competitive programming contests.
  • The system placed first in three consecutive Codeforces competitions in March 2026, defeating legendary grandmasters.
  • GrandCode uses a multi-agent RL approach with specialized modules for hypothesis proposal, solving, test generation, and summarization.
  • The breakthrough introduces Agentic GRPO, a new technique designed for multi-stage agent rollouts with delayed rewards.
  • This achievement marks AI surpassing human capability in one of the last remaining strongholds of human coding superiority.
Mentioned in AI
Models
GeminiGoogle
Read Original →via arXiv – CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.
Connect Wallet to AI →How it works
Related Articles