AINeutralarXiv – CS AI · Mar 24/106
🧠
Adaptive Correlation-Weighted Intrinsic Rewards for Reinforcement Learning
Researchers propose ACWI, a new reinforcement learning framework that dynamically balances intrinsic and extrinsic rewards through adaptive scaling coefficients. The system uses a lightweight Beta Network to optimize exploration in sparse reward environments, demonstrating improved sample efficiency and stability in MiniGrid experiments.