ASV-SUBTOOLS: Open Source Toolkit for Automatic Speaker Verification

Fuchuan Tong,Miao Zhao,Jianfeng Zhou,Hao Lu,Zheng Li,Lin Li,Qingyang Hong

ASV-SUBTOOLS: Open Source Toolkit for Automatic Speaker Verification

2021

In this paper, we introduce a new open source toolkit for automatic speaker verification (ASV), named ASV-Subtools. Adopting PyTorch as main deep learning engine and Kaldi toolkit for data processing, ASV-Subtools allows users to develop modern speaker recognizers flexibly and efficiently. The toolkit prioritizes efficiency, modularity, and extensibility with the goal of supporting the state-of-the-art technologies in speaker recognition. In addition to including the commonly used networks, such as the time delay neural networks (TDNN), factorized TDNN (F-TDNN) and ResNet, ASV-Subtools also integrates an upgraded version of SpecAugment data augmentation method, named Inverted SpecAugment, with focus on making it more appropriate for speaker recognition subtasks. Besides, for alleviating the domain mismatch between training and test data, ASV-Subtools provides multiple domain adaptation methods of Probabilistic Linear Discriminant Analysis (PLDA). Experimental results show that state-of-the-art techniques implemented on ASV-Subtools could achieve competitive performance compared to other implementations.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations