X-vectors based Urdu Speaker Identification for short utterances

Muhammad Farooq,Farah Adeeba,Sarmad Hussain

X-vectors based Urdu Speaker Identification for short utterances

2019

Muhammad Farooq
Farah Adeeba
Sarmad Hussain

In context of commercial applications, robustness of a Speaker Identification (SI) system is adversely effected by short utterances. Performance of SI systems fairly depends upon extracted feature sets. This paper investigates the effect of various feature extraction techniques on performance of i-vectors and x-vectors based Urdu speakers' identification models. The scope of this paper is restricted to text independent speaker identification for short utterances (up to 4 seconds). SI systems demand for a large data covering sufficient inter-speaker and intra-speaker variability. Available Urdu speech corpus is used to measure performance of various feature sets on SI systems. A minimum percentage Equal Error Rate (%EER) of 0.113 is achieved using x-vectors with Linear Frequency Cepstral Coefficients (LFCCs) feature set.

Keywords:

Speech recognition
speaker identification
Computer science
Urdu
feature set
Feature extraction
Speech corpus
Mel-frequency cepstrum
Robustness (computer science)
Word error rate

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations