AIBearisharXiv – CS AI · 8h ago7/10
🧠
The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer
Researchers discovered the 'Alignment Curse,' revealing that stronger text-audio alignment in multimodal AI models inadvertently enables more effective transfer of text-based jailbreak attacks to audio channels. The finding exposes a critical safety vulnerability in recent omni-models like Qwen, suggesting current audio safety evaluations significantly underestimate risks originating from text modalities.