AIBullisharXiv โ CS AI ยท 14h ago7/10
๐ง
Grounded World Model for Semantically Generalizable Planning
Researchers propose Grounded World Model (GWM), a novel approach to visuomotor planning that aligns world models with vision-language embeddings rather than requiring explicit goal images. The method achieves 87% success on unseen tasks versus 22% for traditional vision-language action models, demonstrating superior semantic generalization in robotics and embodied AI applications.