z-logo
open-access-imgOpen Access
Noise Robust Speaker Recognition using Reduced Multiconditional Gaussian Mixture Models
Author(s) -
Frederico D’Almeida,
Francisco Assis de Oliveira Nascimento,
Pedro Berger,
Lúcio Martins da Silva
Publication year - 2008
Publication title -
the international journal of forensic computer science
Language(s) - English
Resource type - Journals
eISSN - 1980-7333
pISSN - 1809-9807
DOI - 10.5769/j200801006
Subject(s) - speech recognition , speaker recognition , mixture model , noise (video) , computer science , acoustics , gaussian noise , pattern recognition (psychology) , artificial intelligence , physics , image (mathematics)
Multiconditional Modeling is widely used to create noise-robust speaker recognition systems. However, the approach is computationally intensive. An alternative is to optimize the training condition set in order to achieve maximum noise robustness while using the smallest possible number of noise conditions during training. This paper establishes the optimal conditions for a noise-robust training model by considering audio material at different sampling rates and with different coding methods. Our results demonstrate that using approximately four training noise conditions is sufficient to guarantee robust models in the 60 dB to 10 dB Signal-to-Noise Ratio (SNR) range

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom