AIBullisharXiv โ CS AI ยท 9h ago7/10
๐ง
UniVid: Pyramid Diffusion Model for High Quality Video Generation
Researchers have developed UniVid, a new pyramid diffusion model that unifies text-to-video and image-to-video generation into a single system. The model uses dual-stream cross-attention mechanisms to process both text prompts and reference images, achieving superior temporal coherence across different video generation tasks.