y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#policy-improvement News & Analysis

2 articles tagged with #policy-improvement. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AIBullisharXiv – CS AI · Apr 137/10
🧠

Advantage-Guided Diffusion for Model-Based Reinforcement Learning

Researchers propose Advantage-Guided Diffusion (AGD-MBRL), a novel approach that improves model-based reinforcement learning by using advantage estimates to guide diffusion models during trajectory generation. The method addresses the short-horizon myopia problem in existing diffusion-based world models and demonstrates 2x performance improvements over current baselines on MuJoCo control tasks.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

SPAR: Support-Preserving Action Rectification

Researchers introduce SPAR (Support-Preserving Action Rectification), a new offline reinforcement learning method that addresses the fundamental tension between maximizing value and staying true to training data. By anchoring policy improvements to frozen behavior cloning and operating in residual space, SPAR achieves state-of-the-art results on D4RL benchmarks while maintaining data distribution fidelity.