AINeutralarXiv – CS AI · 9h ago6/10
🧠
Towards Unified Song Generation and Singing Voice Conversion with Accompaniment Co-Generation
Researchers introduce UniSinger, an AI framework that unifies song generation with singing voice conversion by enabling zero-shot speaker cloning and accompaniment co-generation. The system uses a multimodal diffusion transformer with curriculum learning to simultaneously handle vocal timbre control and musical accompaniment, advancing generative music production capabilities.