AINeutralarXiv – CS AI · 9h ago5/10
🧠
Drifting Field Policy: A One-Step Generative Policy via Wasserstein Gradient Flow
Researchers introduce Drifting Field Policy (DFP), a one-step generative policy that uses Wasserstein gradient flow to optimize reinforcement learning without ODE-based approaches. DFP demonstrates state-of-the-art performance on robotic manipulation tasks, suggesting a potential shift in how generative models are applied to control problems.