z-logo
open-access-imgOpen Access
Ideal neighbourhood mask for speech enhancement
Author(s) -
Arcos C.D.,
Vellasco M.,
Alcaim A.
Publication year - 2018
Publication title -
electronics letters
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.375
H-Index - 146
eISSN - 1350-911X
pISSN - 0013-5194
DOI - 10.1049/el.2017.2935
Subject(s) - computer science , ideal (ethics) , binary number , speech enhancement , neighbourhood (mathematics) , speech recognition , word error rate , artificial intelligence , pattern recognition (psychology) , noise reduction , mathematics , arithmetic , mathematical analysis , philosophy , epistemology
A novel approach for speech enhancement applications by applying spectral mask estimation is introduced. The new application uses the local binary patterns to estimate an ideal neighbourhood mask. This will indicate which time–frequency units of the noisy speech are dominated by the noise. The performance assessment of the proposed application in conjunction with the traditional mask techniques, i.e. ideal binary mask and ideal ratio mask, are carried out under various environments in terms of the objective speech quality measures, as well as word error rate performance in speech recognition systems using deep neural networks. Results indicated that the proposed mask yielded significantly better performance than the conventional techniques.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here