ASVtorch toolkit: Speaker verification with deep neural networks
Author(s) -
Kong Aik Lee,
Ville Vestman,
Tomi Kinnunen
Publication year - 2021
Publication title -
softwarex
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.528
H-Index - 21
ISSN - 2352-7110
DOI - 10.1016/j.softx.2021.100697
Subject(s) - computer science , python (programming language) , speaker verification , deep learning , deep neural networks , artificial intelligence , open source , artificial neural network , machine learning , programming language , speaker recognition , software
The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) — recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non-experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom