AIBullisharXiv โ CS AI ยท 5h ago
๐ง
Perfect score on IPhO 2025 theory by Gemini agent
Google's Gemini 3.1 Pro Preview achieved a perfect score on IPhO 2025 theory problems across five runs, surpassing previous AI performance that fell behind top human contestants. However, the researchers acknowledge potential data contamination since the model was released after the competition.
๐ง Gemini