SINGLE CHANNEL SPEECH ENHANCEMENT USING EVOLUTIONARY ALGORITHM WITH LOG-MMSE | Zendy

Kalpana Ghorpade | Zendy; Arti Khaparde | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

SINGLE CHANNEL SPEECH ENHANCEMENT USING EVOLUTIONARY ALGORITHM WITH LOG-MMSE

Author(s) -

Kalpana Ghorpade,

Arti Khaparde

Publication year - 2022

Publication title -

asean engineering journal

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.135

H-Index - 3

ISSN - 2586-9159

DOI - 10.11113/aej.v12.16770

Subject(s) - pesq , speech enhancement , intelligibility (philosophy) , particle swarm optimization , speech recognition , computer science , minimum mean square error , algorithm , noise (video) , noise reduction , mathematics , artificial intelligence , statistics , epistemology , estimator , image (mathematics) , philosophy

Additive noise degrades speech quality and intelligibility. Speech enhancement reduces this noise to make speech more pleasant and intelligible. It plays a significant role in speech recognition or speech-operated systems. In this paper, we propose a single-channel speech enhancement method in which the log-minimum mean square error method (log-MMSE) and modified accelerated particle swarm optimization algorithm are used to design a filter for improving the quality and intelligibility of noisy speech. Accelerated particle swarm optimization (APSO) algorithm is modified in which a single dimension of particle position is changed in a single iteration while obtaining the particle’s new position. Using this algorithm, a filter is designed with multiple passbands and notches for speech enhancement. The modified algorithm converges faster compared with standard particle swarm optimization algorithm (PSO) and APSO giving optimum filter coefficients. The designed filter is used to enhance the speech. The proposed speech enhancement method improves the perceptual estimation of speech quality (PESQ) by 17.05% for 5dB babble noise, 33.92 % for 5dB car noise, 14.96 % for 5dB airport noise, and 39.13 % for 5dB exhibition noise. The average output PESQ for these four types of noise is improved compared to conventional methods of speech enhancement. There is an average of 7.58 dB improvement in segmental SNR for these noise types. The proposed method improves speech intelligibility with minimum speech distortion.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research