Arabic Speech Synthesis using Deep Neural Networks

Aya Hamdy Ali,Mohamed Magdy,Maher Alfawzy,Mikhail Ghaly,Hazem M. Abbas

Arabic Speech Synthesis using Deep Neural Networks

2021

Aya Hamdy Ali
Mohamed Magdy
Maher Alfawzy
Mikhail Ghaly
Hazem M. Abbas

Text-to-speech (TTS) synthesis is a rapidly growing field of research. Deep learning has shown impressive results in speech synthesis and outperformed the older concatenative and parametric methods. In this paper, speech synthesis using deep learning architectures is explored and two models are utilized in an end-to-end Arabic TTS system. The results of the two systems are compared to concatenative TTS system using the Mean Opinion Score (MOS) of the synthesized speech and indicates that deep learning based systems have outperformed the concatenative system when it comes to naturalness and intelligibility; moreover, it reduces system complexity.

Keywords:

Field (computer science)
Artificial neural network
Speech synthesis
Deep learning
Hidden Markov model
Naturalness
Intelligibility (communication)
Mean opinion score
Artificial intelligence
Computer science
Speech recognition

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations