y0news
AnalyticsDigestsSourcesRSSAICrypto
#research-dataset1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 6d ago7/103
๐Ÿง 

WAXAL: A Large-Scale Multilingual African Language Speech Corpus

Researchers have released WAXAL, a large-scale multilingual speech dataset covering 24 Sub-Saharan African languages representing over 100 million speakers. The dataset includes 1,250 hours of transcribed speech for ASR and 235 hours of high-quality recordings for TTS, released under CC-BY-4.0 license to advance inclusive AI technologies.