Probability based prosody model for unit selection

Xijun Ma,Wei Zhang,Weibin Zhu,Qin Shi,Ling Jin

Probability based prosody model for unit selection

2004

Xijun Ma
Wei Zhang
Weibin Zhu
Qin Shi
Ling Jin

Most modern text-to-speech (TTS) systems are unit selection style. In this kind of system, the predicted prosody values, such as pitch, duration and energy values for each synthesis unit, are important factors to conduct unit selection. We present a probability based prosody model in which the distribution of prosody values in a given context equivalent cluster is described by a Gaussian mixture model (GMM), and the distance between a candidate unit and the context equivalent cluster is defined by the GMM probability output. A novel framework for unit selection style TTS systems is derived from the model, and a series of experiments are done on the framework.

Keywords:

Probability distribution
Mixture model
Gaussian process
Pattern recognition
Artificial intelligence
Speech synthesis
Prosody
Machine learning
Computer science
Speech recognition

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations