z-logo
open-access-imgOpen Access
Speaker Identification Based On Ivector And Xvector
Author(s) -
Xinyu Yuan,
Guanyu Li,
Jiao Han,
Di Wang,
Zhi Tiankai
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1827/1/012133
Subject(s) - speaker recognition , computer science , speaker identification , speech recognition , artificial neural network , artificial intelligence , identification (biology) , speaker diarisation , feature extraction , feature (linguistics) , key (lock) , pattern recognition (psychology) , linguistics , philosophy , botany , computer security , biology
As an important branch of AI (Artificial Intelligence), speaker recogniti-on technology has developed rapidly in recent years. At present, speaker recognition technologies based on traditional methods and deep learning methods are very mature and have achieved good results. As a key technology in human-machine voice intera-ction, speaker recognition has changed human life in many ways. However, there is almost no research on speaker recognition in Tibetan. This paper uses Gaussian mix-ture model and deep neural network TDNN as the theoretical basis. On the Tibetan language corpus, the speaker identification on the ivector of GMM-UBM and the speaker identification on the xvector of TDNN are performed respectively, and the experimental results are compared. The experimental system is divided into prepro-cessing, feature extraction, model training and decision.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here