🧠 AI⚪ NeutralImportance 4/10

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

arXiv – CS AI|Ruhan Wang, Yu Yang, Zhishuai Liu, Dongruo Zhou, Pan Xu|March 3, 2026 at 05:00 AM|2 views

🤖AI Summary

Researchers introduce Return Augmented (REAG) method for Decision Transformer frameworks to improve offline reinforcement learning when training data comes from different dynamics than the target domain. The method aligns return distributions between source and target domains, with theoretical analysis showing it achieves optimal performance levels despite dynamics shifts.

Key Takeaways

→REAG method addresses dynamics shift problems in offline reinforcement learning by augmenting returns rather than rewards.
→The approach specifically targets Decision Transformer frameworks that predict actions based on desired returns and trajectory history.
→Theoretical analysis proves REAG achieves the same suboptimality level as if there were no dynamics shift.
→Two practical implementations REAG_Dara and REAG_MV were developed and tested.
→Experiments on D4RL datasets show consistent performance improvements across various DT-type baselines.