AINeutralarXiv – CS AI · 7h ago6/10
🧠
Stability Analysis of Sharpness-Aware Minimization
Researchers reveal that Sharpness-Aware Minimization (SAM), a popular deep learning training method, has convergence instability near saddle points and may actually escape saddle points more poorly than standard gradient descent. The study demonstrates that momentum and batch-size adjustments are critical for mitigating these instabilities and achieving strong generalization performance.