AINeutralarXiv – CS AI · 5h ago6/10
🧠
CHDP: Cooperative Hybrid Diffusion Policies for Reinforcement Learning in Parameterized Action Space
Researchers propose CHDP (Cooperative Hybrid Diffusion Policies), a novel reinforcement learning framework that addresses the challenge of optimizing hybrid action spaces combining discrete and continuous parameters. The method employs two cooperative agents with separate diffusion policies and achieves up to 19.3% performance improvement over existing approaches in robot control and game AI applications.