
Usable speech detection based on empirical mode decomposition
Author(s) -
Ghezaiel W.,
Slimanne A. Ben,
Braiek E. Ben
Publication year - 2013
Publication title -
electronics letters
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.375
H-Index - 146
ISSN - 1350-911X
DOI - 10.1049/el.2012.3639
Subject(s) - usable , speech recognition , computer science , hilbert–huang transform , channel (broadcasting) , speech processing , mode (computer interface) , timit , artificial intelligence , hidden markov model , telecommunications , white noise , world wide web , operating system
Recently, usable speech criteria have been proposed to extract minimally corrupted speech for speaker identification in co‐channel speech. Proposed is a new usable speech extraction method based on the pitch information obtained from a multi‐resolution analysis by empirical mode decomposition. The idea is to retain the speech segments that have only one pitch detected and remove the others. Evaluation of this method is performed on the TIMIT database referring to the target to interferer ratio measure. Co‐channel speech is constructed by mixing all possible gender speakers. Results do not show much difference for different mixtures. For the overall mixtures 94.97% of usable speech is correctly detected with false alarms of 16.52%.