ASVtorch toolkit: Speaker verification with deep neural networks | Zendy

Kong Aik Lee | Zendy; Ville Vestman | Zendy; Tomi Kinnunen | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

ASVtorch toolkit: Speaker verification with deep neural networks

Author(s) -

Kong Aik Lee,

Ville Vestman,

Tomi Kinnunen

Publication year - 2021

Publication title -

softwarex

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.528

H-Index - 21

ISSN - 2352-7110

DOI - 10.1016/j.softx.2021.100697

Subject(s) - computer science , python (programming language) , speaker verification , deep learning , deep neural networks , artificial intelligence , open source , artificial neural network , machine learning , programming language , speaker recognition , software

The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) — recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non-experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research