AINeutralarXiv – CS AI · 14h ago7/10
🧠
The Hamilton-Jacobi Theory of Deep Learning
Researchers establish a mathematical framework connecting neural network training to Hamilton-Jacobi partial differential equations, showing that gradient descent searches through solutions to viscous PDEs. This theoretical unification applies across major architectures including residual networks and transformers, with implications for understanding generalization, adversarial robustness, and interpretability.