AIBullisharXiv – CS AI · 18h ago7/10
🧠
Reliable to Expressive: A Curriculum for Rubric-Following Safety Judges
Researchers developed a curriculum-based training method for safety judges that dramatically improves their consistency across different evaluation rubrics. The approach combines dynamic rubric generation with a staged learning process, achieving 94.12-94.88% accuracy with minimal variance across three different rubric styles, outperforming larger general-purpose and specialized LLMs.