Structural design of hidden Markov model speech recognizer using multivalued phonetic features: Comparison with segmental speech units

1992 
A novel approach to speech recognition, on the basis of a multidimensional, multivalued phonetic‐feature description of speech signals, is presented and evaluated. The hidden Markov model (HMM) framework is used to provide the recognition algorithm, which assumes that the underlying Markov chain tracks the temporal evolution of the features. It is shown that this approach can naturally accommodate such coarticulatory effects as feature spreading and formant transition in the functionality of the recognizer, and can provide a high degree of acoustic data sharing that makes effective use of training data. Use of phonetic features as the basic speech units creates a framework where the Markov model’s state topology in the recognizer can be designed with guidance of detailed speech knowledge. Details of such a design for a stop consonant–vowel vocabulary are described. Experimental results on the task of speaker‐dependent stop consonant discrimination, evaluated from speech data from a total of ten male and f...
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    43
    Citations
    NaN
    KQI
    []