AINeutralarXiv – CS AI · 8h ago5/10
🧠
Dynamic Entropy Tuning in Reinforcement Learning Low-Level Quadcopter Control: Stochasticity vs Determinism
Researchers compare dynamic entropy tuning in stochastic reinforcement learning policies versus deterministic policies for quadcopter control, finding that dynamic entropy adjustment in the Soft Actor-Critic algorithm prevents catastrophic forgetting and improves exploration efficiency compared to static entropy or purely deterministic approaches using TD3.