AIBullisharXiv – CS AI · 15h ago6/10
🧠
Olaf-World: Orienting Latent Actions for Video World Modeling
Researchers introduce Olaf-World, a new approach to training action-controllable video world models that solves the problem of action latents failing to transfer across different contexts. By anchoring latent actions to observable semantic effects rather than relying on scarce labeled data, the method achieves stronger zero-shot transfer and more efficient adaptation to new control interfaces.