#music-generation News & Analysis

13 articles tagged with #music-generation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

13 articles

AIBearisharXiv – CS AI · Feb 277/107

🧠

Bob's Confetti: Phonetic Memorization Attacks in Music and Video Generation

Researchers discovered a vulnerability in AI music and video generation systems where phonetic prompts can bypass copyright filters. The 'Adversarial PhoneTic Prompting' attack achieves 91% similarity to copyrighted content by using sound-alike phrases that preserve acoustic patterns while evading text-based detection.

$NEAR$APT

AINeutralDecrypt – AI · 5d ago6/10

🧠

ElevenLabs, Stability AI Drop New AI Music Models—Can They Catch Suno?

ElevenLabs and Stability AI have released new AI music generation models—Music v2 and Stable Audio 3.0 respectively—featuring advanced composition tools and longer track generation. Both companies are positioning themselves to compete with market leader Suno, though their competitive advantage remains unclear.

🏢 Stability

AINeutralarXiv – CS AI · 5d ago6/10

🧠

Genre Controlled Music Generation via Activation Steering

Researchers present a novel method for controlling music generation in the MusicGen transformer by using activation steering techniques applied at inference time. The approach enables precise genre control through linear probes that manipulate the model's residual stream, demonstrating how interpretable AI behaviors can enhance collaborative music creation.

AIBearishThe Verge – AI · Apr 56/10

🧠

Suno is a music copyright nightmare

AI music platform Suno's copyright filters can be easily bypassed with minimal effort, allowing users to generate AI imitations of popular songs from artists like Beyoncé, Black Sabbath, and Aqua. Despite Suno's policy prohibiting copyrighted material use, the platform's detection system proves inadequate at preventing copyright infringement.

AIBullishGoogle DeepMind Blog · Feb 186/106

🧠

A new way to express yourself: Gemini can now create music

Google's Gemini app has integrated Lyria 3, its most advanced music generation model, allowing users to create 30-second music tracks from text or image inputs. This feature democratizes music creation by making AI-powered composition accessible to anyone through the Gemini interface.

AINeutralOpenAI News · Apr 306/104

🧠

Jukebox

A new neural network called Jukebox has been introduced that can generate music and rudimentary singing as raw audio across various genres and artist styles. The developers are releasing the model weights, code, and exploration tools to the public.

AIBullishOpenAI News · Apr 256/106

🧠

MuseNet

OpenAI has created MuseNet, a deep neural network capable of generating 4-minute musical compositions using 10 different instruments and combining various musical styles from country to classical to rock. The system uses the same transformer technology as GPT-2, learning musical patterns through unsupervised training on hundreds of thousands of MIDI files rather than explicit musical programming.

AINeutralThe Verge – AI · Mar 255/10

🧠

Google Lyria 3 Pro makes longer AI songs

Google has released Lyria 3 Pro, an upgraded AI music generation tool that can create tracks up to three minutes long, six times longer than the previous 30-second limit. The tool allows users to prompt for specific song elements like intros, choruses, and bridges, and can generate both music and lyrics from text prompts or reference photos.

AIBullisharXiv – CS AI · Mar 35/105

🧠

Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation

Researchers developed SMDIM, a new diffusion model for symbolic music generation that efficiently handles long sequences by combining global structure construction with local refinement. The model outperforms existing approaches in both generation quality and computational efficiency across various musical styles including Western classical, popular, and folk music.

$NEAR

AINeutralarXiv – CS AI · Mar 34/104

🧠

GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Aware Alignment

Researchers propose GACA-DiT, a new AI framework that generates music synchronized with dance movements using diffusion transformers. The system addresses limitations of existing methods by incorporating genre-adaptive rhythm extraction and context-aware temporal alignment for better synchronization between dance and music.

AIBullisharXiv – CS AI · Mar 34/104

🧠

Depth-Structured Music Recurrence: Budgeted Recurrent Attention for Full-Piece Symbolic Music Modeling

Researchers introduce Depth-Structured Music Recurrence (DSMR), a new AI training method for symbolic music generation that processes complete compositions efficiently. The technique uses stateful recurrent attention with distributed memory across layers, achieving similar performance to full-memory models while using 59% less GPU memory and 36% higher throughput.

AINeutralarXiv – CS AI · Mar 34/106

🧠

CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction

Researchers introduce CMI-RewardBench, a comprehensive evaluation framework for music generation AI models that can process multimodal inputs including text, lyrics, and audio. The system includes a 110k sample preference dataset and reward models that show strong correlation with human judgments for music quality assessment.

AINeutralarXiv – CS AI · Mar 34/107

🧠

SyncTrack: Rhythmic Stability and Synchronization in Multi-Track Music Generation

Researchers introduce SyncTrack, an AI model for multi-track music generation that addresses rhythmic stability and synchronization issues in existing models. The model uses track-shared modules for common rhythm and track-specific modules for diverse timbres, introducing new metrics to evaluate multi-track music quality.