Constrained Subword Units for Speaker Recognition

Doris Baum,Daniel Schneider,Timo Mertens,Joachim Köhler

Constrained Subword Units for Speaker Recognition

2010

Phonetic features have been proposed to overcome performance degradation in spectral speaker recognition in difficult acoustic conditions. The harmful effect of those conditions, however, is not restricted to spectral systems but also affects the performance of the open-loop phone recognisers on which phonetic systems are based. In automatic speech recognition, larger subword units and the use of additional constraints from language models have been employed to improve robustness against adverse acoustic conditions. This paper evaluates the performance of more constrained phone recognition and different subword units for speaker recognition on heterogeneous broadcast data from German parliamentary speeches. Using phone clusters and a strong language model instead of phones obtained from unconstrained recognition improves the equal error rate from 14.3% to 8.6% on the given data.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations