AIBullisharXiv โ CS AI ยท Feb 277/106
๐ง
Dual-IPO: Dual-Iterative Preference Optimization for Text-to-Video Generation
Researchers introduce Dual-Iterative Preference Optimization (Dual-IPO), a new method that iteratively improves both reward models and video generation models to create higher-quality AI-generated videos better aligned with human preferences. The approach enables smaller 2B parameter models to outperform larger 5B models without requiring manual preference annotations.