AINeutralarXiv – CS AI · May 297/10
🧠Researchers establish a mathematical framework connecting neural network training to Hamilton-Jacobi partial differential equations, showing that gradient descent searches through solutions to viscous PDEs. This theoretical unification applies across major architectures including residual networks and transformers, with implications for understanding generalization, adversarial robustness, and interpretability.
AIBullisharXiv – CS AI · May 17/10
🧠Researchers introduce Flow Map Reward Guidance (FMRG), a novel training-free method for guiding generative models toward user-specified objectives using optimal control theory. The approach achieves comparable or superior results to existing baselines while requiring only 3 neural function evaluations, representing a 10x+ speedup over prior methods.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers have developed a new training-free framework for reward-guided image editing using diffusion models. The approach treats image editing as a trajectory optimal control problem, allowing for better preservation of source image content while enhancing target rewards compared to existing methods.
AINeutralarXiv – CS AI · Mar 37/104
🧠Researchers developed a novel algorithm using topological derivatives to automatically determine where and how to add new layers to neural networks during training. The approach uses mathematical principles from optimal control theory and topology optimization to adaptively grow network architecture, showing superior performance compared to baseline networks and other adaptation strategies.
AINeutralarXiv – CS AI · 1d ago6/10
🧠Researchers propose LA-LQR, an optimal control framework that uses activation steering to safely guide text-to-video model outputs toward desired behaviors while minimizing visual quality loss. By projecting high-dimensional video activations onto low-dimensional task-relevant subspaces and applying closed-loop feedback interventions, the method achieves better safety outcomes than existing steering approaches without heavy-handed oversteering.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers demonstrate that optimal control in Markov decision processes with catastrophic failure states naturally produces prospect-theory-like behaviors—including S-shaped value functions and loss aversion—without requiring utility curvature or probability weighting. The mechanism emerges purely from the mathematical structure of Bellman optimality when agents face absorbing failure states, with results validated across 495 configurations and multiple learning paradigms.
AINeutralarXiv – CS AI · May 296/10
🧠Researchers present a systematic review of Data-Driven Optimal Control (DDOC), a framework that integrates machine learning with traditional control theory for autonomous driving motion planning. The approach aims to bridge the gap between rule-based systems' safety guarantees and learning-based methods' adaptability, proposing implementation across three dimensions: customization, dynamics adaptation, and self-tuning.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers develop a hybrid neural network approach for solving Hamilton-Jacobi-Bellman equations in continuous-time reinforcement learning, combining physics-informed neural solvers with stabilized finite-difference methods. The work provides rigorous error analysis separating residual, policy, and model-identification errors, with experimental validation across multiple control benchmarks.
AINeutralarXiv – CS AI · May 76/10
🧠Researchers demonstrate that reinforcement learning with overcomplete sparse image codes can efficiently solve optimal control tasks orders of magnitude larger than traditional methods, without requiring deep learning. The work formalizes vision-based control as a reinforcement learning problem and provides theoretical justification for why efficient image representations enable scalable policy learning.
AIBullisharXiv – CS AI · Apr 146/10
🧠Researchers introduce SODACER, a reinforcement learning framework combining dual-buffer experience replay with Control Barrier Functions to enable safe optimal control of nonlinear systems. The approach demonstrates improved convergence and sample efficiency while maintaining safety constraints, with potential applications in robotics, healthcare, and large-scale optimization.