Noise Robust Speaker Recognition using Reduced Multiconditional Gaussian Mixture Models
Author(s) -
Frederico D’Almeida,
Francisco Assis de Oliveira Nascimento,
Pedro Berger,
Lúcio Martins da Silva
Publication year - 2008
Publication title -
the international journal of forensic computer science
Language(s) - English
Resource type - Journals
eISSN - 1980-7333
pISSN - 1809-9807
DOI - 10.5769/j200801006
Subject(s) - speech recognition , speaker recognition , mixture model , noise (video) , computer science , acoustics , gaussian noise , pattern recognition (psychology) , artificial intelligence , physics , image (mathematics)
Multiconditional Modeling is widely used to create noise-robust speaker recognition systems. However, the approach is computationally intensive. An alternative is to optimize the training condition set in order to achieve maximum noise robustness while using the smallest possible number of noise conditions during training. This paper establishes the optimal conditions for a noise-robust training model by considering audio material at different sampling rates and with different coding methods. Our results demonstrate that using approximately four training noise conditions is sufficient to guarantee robust models in the 60 dB to 10 dB Signal-to-Noise Ratio (SNR) range
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom