AIBullisharXiv – CS AI · 7h ago6/10
🧠
Drift Q-Learning
Researchers propose DriftQL, a new offline reinforcement learning method that combines drift-based behavioral regularization with critic-driven policy improvement to outperform diffusion and flow-based policies. The approach achieves single forward-pass inference while maintaining robustness under degraded data quality, advancing state-of-the-art performance on standard benchmarks.