Implicit trajectory modeling through Gaussian Transition Models for speech recognition

Hua Yu,Tanja Schultz

Implicit trajectory modeling through Gaussian Transition Models for speech recognition

2003

Hua Yu
Tanja Schultz

It is well known that frame independence assumption is a fundamental limitation of current HMM based speech recognition systems. By treating each speech frame independently, HMMs fail to capture trajectory information in the acoustic signal. This paper introduces Gaussian Transition Models (GTM) to model trajectories implicitly. Comparing to alternative approaches, such as segment modeling and parallel path HMM, GTM has the advantage that it integrates seamlessly with the HMM frame-work; it can model a large number of trajectories and there is no need to define a topology a priori. Preliminary experiments on Switchboard, a large vocabulary conversational speech recognition task, have shown promising results.

Keywords:

Trajectory
Machine learning
Speech recognition
Natural language processing
Artificial intelligence
Vocabulary
Gaussian
Computer science
Hidden Markov model
Statistical assumption
conversational speech
A priori and a posteriori

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations