#adaptive-training News & Analysis

2 articles tagged with #adaptive-training. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · Jun 96/10

🧠

Adaptive Loss Balancing for Noise-Robust GRPO in Generative Recommendation

Researchers introduce AdaGRPO, a reinforcement learning framework that selectively applies reward signals in generative recommendation systems rather than uniformly, addressing the problem of noisy reward models trained on biased data. The approach combines supervised learning with adaptive gating mechanisms and demonstrates significant improvements in e-commerce recommendation metrics and production performance.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Scalable Reinforcement Learning via Adaptive Batch Scaling

Researchers propose Adaptive Batch Scaling (ABS), a technique that dynamically adjusts batch sizes during reinforcement learning training by measuring policy stability through a novel 'Behavioral Divergence' metric. The approach challenges the conventional belief that large batches are incompatible with RL, demonstrating that combining larger networks with larger batch sizes can achieve superior performance when batch size adapts to training phase stability.