ASVtorch toolkit: Speaker verification with deep neural networks

Kong Aik Lee,Ville Vestman,Tomi Kinnunen

ASVtorch toolkit: Speaker verification with deep neural networks

2021

Kong Aik Lee
Ville Vestman
Tomi Kinnunen

Abstract The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) — recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non-experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework.

Keywords:

speaker verification
Artificial intelligence
Human voice
Python (programming language)
deep neural networks
Human–computer interaction
Deep learning
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations