AIBullisharXiv – CS AI · 8h ago7/10
🧠
Invariant Gradient Alignment for Robust Reasoning Distillation
Researchers introduce Invariant Gradient Alignment (IGA), a training framework that improves how large language models generalize to out-of-distribution inputs by aligning gradient updates across semantically diverse but logically equivalent problems. The method achieves up to 14.3 percentage point accuracy improvements over standard approaches and demonstrates a fourfold improvement in logical consistency, addressing a fundamental limitation in knowledge distillation pipelines.