Collection of phoneme samples using time alignment and spectral stationarity of speech signals
1985
An automatic method for collecting a large number of phoneme samples to be used as training data for speech recognition is described. Time alignment and spectral stationarity of speech signals are used to transfer phoneme labels from a hand labeled utterance of a standard speaker to a similar utterance of another speaker for whom training data are needed. Experimental results based on speech data obtained from eight male speakers show that automatically obtained training data almost yield the same phoneme recognition accuracy as hand labeled training data.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
10
References
1
Citations
NaN
KQI