AIBullisharXiv – CS AI · 18h ago7/10
🧠
Internalizing Geometric Law: Learning from Solver Residuals for Precision-Critical Generation
Researchers introduce PyGeoX, a geometric constraint solver and benchmark that addresses hallucination problems in large language models for precision-critical tasks like technical design. They identify a failure mode called Outlier Gradient Masking in standard reward schemes and propose Saturating Additive Rewards (SAR) to improve constraint satisfaction, achieving 2.3x performance gains on hard problems.