y0news
AnalyticsDigestsSourcesRSSAICrypto
#text-to-audio1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 9h ago5/10
๐Ÿง 

Evaluating Semantic Fragility in Text-to-Audio Generation Systems Under Controlled Prompt Perturbations

Researchers evaluated the semantic fragility of text-to-audio generation systems, finding that small changes in prompts can lead to substantial variations in generated audio output. While larger models like MusicGen-large showed better semantic consistency, all models exhibited persistent divergence in acoustic and temporal characteristics even when semantic similarity remained high.