AINeutralarXiv – CS AI · 18h ago6/10
🧠
Structure-Conditioned Actor-Critic Branches for Quality-Diversity Reinforcement Learning
Researchers introduce SV-QD-RL, a reinforcement learning framework that generates diverse policy repertoires by conditioning actor networks on learned structural masks and pairing them with branch-specific critics. The approach demonstrates improved performance on continuous control tasks while maintaining behavioral diversity through structure-aware archive management.