AINeutralarXiv – CS AI · 7h ago5/10
🧠
Geometrically Averaged Hard Target Updates for Linear Q-Learning
Researchers introduce λ-target updates, a novel mechanism that geometrically averages periodic hard target updates in linear Q-learning to improve stability. This theoretical advancement bridges traditional periodic updates and continuous projected Q-value iteration, with potential applications in reinforcement learning optimization.