A Tool for Making Segmented Speech Corpus for ASR and TTS Modeling

Agung Santosa,Harnum Annisa,Asril Jarin,Gunarso,Lyla Ruslana Aini,Mohammad Teduh Uliniansyah,Made Gunawan,Andi Djalal Latief,Elvira Nurfadhilah,Fara Ayuningtyas

A Tool for Making Segmented Speech Corpus for ASR and TTS Modeling

2019

Agung Santosa
Harnum Annisa
Asril Jarin
Gunarso
Lyla Ruslana Aini
Mohammad Teduh Uliniansyah
Made Gunawan
Andi Djalal Latief
Elvira Nurfadhilah
Fara Ayuningtyas

To develop models of Natural Language Processing (NLP), such as speech recognition and speech synthesis, require the provision of a speech corpus that has been segmented and useful as training data. On the other hand, making a speech corpus costs a lot because it can include studio rent and payment of recorded speaker fees. Therefore, in this paper, we develop a tool for making speech corpus equipped with a function that guides the speaker to record their speech in sentences. This tool can be run independently on a personal computer so that we can do recording anytime and anywhere. To produce a better speech corpus, the recording results of this tool still require to be checked because they could potentially have a signal clip or low amplitudes.

Keywords:

Speech recognition
sentence segmentation
Speech corpus
Speech synthesis
Training set
Computer science
personal computer

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations