Improving Children Speech Recognition through Feature Learning from Raw Speech Signal

S. Pavankumar Dubagunta,Selen Hande Kabil,Mathew Magimai-Doss

Improving Children Speech Recognition through Feature Learning from Raw Speech Signal

2019

S. Pavankumar Dubagunta
Selen Hande Kabil
Mathew Magimai-Doss

Children speech recognition based on short-term spectral features is a challenging task. One of the reasons is that children speech has high fundamental frequency that is comparable to formant frequency values. Furthermore, as children grow, their vocal apparatus also undergoes changes. This presents difficulties in extracting standard short-term spectral-based features reliably for speech recognition. In recent years, novel acoustic modeling methods have emerged that learn both the feature and phone classifier in an end-to-end manner from the raw speech signal. Through an investigation on PF-STAR corpus we show that children speech recognition can be improved using end-to-end acoustic modeling methods.

Keywords:

Speech recognition
Computer science
Feature learning
Pattern recognition
Data modeling
Convolution
Formant
Phone
Artificial intelligence
Classifier (linguistics)
Feature extraction
Fundamental frequency

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations