AIBullisharXiv – CS AI · 8h ago7/10
🧠
SPIRAL: Learning to Search and Aggregate
Researchers introduce SPIRAL, a reinforcement learning framework that trains language models to leverage sequential reasoning, parallel sampling, and trace aggregation during inference. The approach demonstrates superior scaling efficiency compared to existing methods, achieving 11× better compute scaling and 15% higher performance on reasoning tasks.