AIBullisharXiv โ CS AI ยท 7h ago7/10
๐ง
Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation
Researchers demonstrate a data-efficient fine-tuning method for text-to-video diffusion models that enables new generative controls using sparse, low-quality synthetic data rather than expensive, photorealistic datasets. Counterintuitively, models trained on simple synthetic data outperform those trained on high-fidelity real data, supported by both empirical results and theoretical justification.