z-logo
open-access-imgOpen Access
SINGLE CHANNEL SPEECH ENHANCEMENT USING EVOLUTIONARY ALGORITHM WITH LOG-MMSE
Author(s) -
Kalpana Ghorpade,
Arti Khaparde
Publication year - 2022
Publication title -
asean engineering journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.135
H-Index - 3
ISSN - 2586-9159
DOI - 10.11113/aej.v12.16770
Subject(s) - pesq , speech enhancement , intelligibility (philosophy) , particle swarm optimization , speech recognition , computer science , minimum mean square error , algorithm , noise (video) , noise reduction , mathematics , artificial intelligence , statistics , epistemology , estimator , image (mathematics) , philosophy
Additive noise degrades speech quality and intelligibility. Speech enhancement reduces this noise to make speech more pleasant and intelligible. It plays a significant role in speech recognition or speech-operated systems. In this paper, we propose a single-channel speech enhancement method in which the log-minimum mean square error method (log-MMSE) and modified accelerated particle swarm optimization algorithm are used to design a filter for improving the quality and intelligibility of noisy speech. Accelerated particle swarm optimization (APSO) algorithm is modified in which a single dimension of particle position is changed in a single iteration while obtaining the particle’s new position. Using this algorithm, a filter is designed with multiple passbands and notches for speech enhancement. The modified algorithm converges faster compared with standard particle swarm optimization algorithm (PSO) and APSO giving optimum filter coefficients. The designed filter is used to enhance the speech. The proposed speech enhancement method improves the perceptual estimation of speech quality (PESQ) by 17.05% for 5dB babble noise, 33.92 %  for 5dB car noise, 14.96 % for 5dB airport noise, and 39.13 % for 5dB exhibition noise. The average output PESQ for these four types of noise is improved compared to conventional methods of speech enhancement. There is an average of 7.58 dB improvement in segmental SNR for these noise types. The proposed method improves speech intelligibility with minimum speech distortion.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here