AINeutralarXiv – CS AI · 6h ago6/10
🧠
RIVET: Robust Idempotent Voice Attribute Editing
Researchers introduce RIVET, a training framework that uses idempotency constraints to improve voice attribute editing models' robustness to noisy or inconsistent labels in large-scale speech datasets. By enforcing the property that repeated applications produce identical results, the method acts as an implicit regularizer that reduces sensitivity to mislabeled training data while preserving speaker identity.