AIBullisharXiv – CS AI · 6h ago6/10
🧠
Verifier-Backed Hard Problem Generation for Mathematical Reasoning
Researchers introduce VHG, a verifier-enhanced framework that improves how large language models generate valid and challenging mathematical problems through three-party self-play involving a setter, solver, and independent verifier. The approach addresses critical limitations in existing problem generation methods by constraining reward signals to ensure both problem validity and difficulty, demonstrating substantial improvements over baseline approaches.