AIBullisharXiv โ CS AI ยท 6h ago2
๐ง
Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models
Researchers introduce Mix-GRM, a new framework for Generative Reward Models that improves AI evaluation by combining breadth and depth reasoning mechanisms. The system achieves 8.2% better performance than leading open-source models by using structured Chain-of-Thought reasoning tailored to specific task types.