y0news
AnalyticsDigestsSourcesRSSAICrypto
#temporal-masking1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 10h ago6/10
๐Ÿง 

SyncSpeech: Efficient and Low-Latency Text-to-Speech based on Temporal Masked Transformer

Researchers introduce SyncSpeech, a new text-to-speech model that combines autoregressive and non-autoregressive approaches using a Temporal Mask Transformer architecture. The model achieves 5.8x lower first-packet latency and 8.8x improved real-time performance while maintaining comparable speech quality to existing models.