AIBullisharXiv – CS AI · 9h ago6/10
🧠
Scalable Option Learning in High-Throughput Environments
Facebook Research introduces Scalable Option Learning (SOL), a hierarchical reinforcement learning algorithm that achieves 35x higher throughput than existing methods. The system was validated on complex environments including NetHack using 30 billion frames of experience, demonstrating superior performance over flat agents and suggesting that hierarchical RL can finally benefit from large-scale training.
$SOL