AINeutralarXiv – CS AI · 6h ago6/10
🧠
Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models
Researchers have achieved significant improvements in dysarthric speech recognition by systematically combining acoustic features with the Factorized Time Delay Neural Network (F-TDNN) model, demonstrating 4.65% relative improvement in word recognition and 4.63% in sentence recognition. The study identifies pitch features as particularly effective for handling the acoustic variability characteristic of impaired speech, advancing accessibility technology for individuals with speech disorders.