AIBullisharXiv – CS AI · 9h ago7/10
🧠
Sword: Style-Robust World Models as Simulators via Dynamic Latent Bootstrapping for VLA Policy Post-Training
Researchers introduce Sword, a world model framework that improves Vision-Language-Action (VLA) models' ability to simulate environments for policy training. By addressing visual style sensitivity and error accumulation in long-horizon predictions, Sword demonstrates significant performance gains on the LIBERO benchmark, advancing the feasibility of training AI agents entirely within simulated environments.