AINeutralarXiv – CS AI · 5h ago6/10
🧠
Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech
Researchers introduce Speech Generation Speaker Poisoning (SGSP), a framework for removing specific speaker identities from zero-shot text-to-speech models while maintaining utility for other speakers. The study evaluates privacy-utility trade-offs and identifies scalability limitations when attempting to forget more than 15 speakers, highlighting emerging challenges in generative voice privacy.