Phoneme segmentation using deep learning for speech synthesis

Young Han Lee,Jong Yeol Yang,Choong Sang Cho,Hyedong Jung

Phoneme segmentation using deep learning for speech synthesis

2018

Young Han Lee
Jong Yeol Yang
Choong Sang Cho
Hyedong Jung

In this paper, we propose the phoneme segmentation method, which is one of the basic module that consist unit-selection-based speech synthesis, using deep learning algorithm. To enhance this, we apply the additional cross entropy loss into the Deep speech based speech recognition architecture. From this approach, we can get higher accuracy of phoneme boundary. In our experiments, the proposed method has 20.91 % boundary accuracy which is higher than the conventional phoneme segmentation.

Keywords:

Computer vision
Deep learning
Artificial intelligence
Cross entropy
Speech recognition
Speech synthesis
Segmentation
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations