Towards the Development of Automatic Speech Recognition for Bikol and Kapampangan

2019 
In this paper, we developed continuous speech recognition for Bikol and Kapampangan language using the CMU Sphinx Toolkit. The speech corpus that was collected by the researchers has duration of 7 hours from 150 native Kapampangan and Bikolano speakers. The speech corpus contains commonly used travelling phrases. This research explains how the researchers developed the speech recognition for local Philippine languages by using the traditional machine learning approach of Gaussian Mixture Model based on Hidden Markov Model. Experimentation of different tied states (senones) is integrated to this study. Experiment shows the highest obtained accuracy of 96.41% for Bikol Language; 96.49% for Kapampangan both hit at 600 tied states. While on the combined languages the highest accuracy gained is 95.39% at 1400 tied states.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    5
    References
    0
    Citations
    NaN
    KQI
    []