AIBullisharXiv โ CS AI ยท 6h ago5
๐ง
SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer
Researchers developed Score Matched Actor-Critic (SMAC), a new offline reinforcement learning method that enables smooth transition to online RL algorithms without performance drops. SMAC achieved successful transfer in all 6 D4RL tasks tested and reduced regret by 34-58% in 4 of 6 environments compared to best baselines.