AIBullisharXiv – CS AI · 6h ago6/10
🧠
FlowEdit: Associative Memory for Lifelong Pronunciation Adaptation in Flow-Matching TTS
Researchers introduce FlowEdit, a lifelong adaptation framework for text-to-speech systems that corrects pronunciation errors without retraining the underlying model. Using associative memory and latent conditioning edits, FlowEdit achieves 92.7% error reduction on multilingual proper nouns while maintaining speech quality and completing corrections in ~15 seconds.