Cuckoo search based optimal mask generation for noise suppression and enhancement of speech signal
Author(s) -
Anil Garg,
Omprakash Sahu
Publication year - 2015
Publication title -
journal of king saud university - computer and information sciences
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.617
H-Index - 33
eISSN - 2213-1248
pISSN - 1319-1578
DOI - 10.1016/j.jksuci.2014.04.006
Subject(s) - spectrogram , cuckoo search , waveform , speech enhancement , computer science , speech recognition , signal (programming language) , noise (video) , signal to noise ratio (imaging) , set (abstract data type) , pattern recognition (psychology) , algorithm , artificial intelligence , noise reduction , telecommunications , radar , particle swarm optimization , image (mathematics) , programming language
In this paper, an effective noise suppression technique for enhancement of speech signals using optimized mask is proposed. Initially, the noisy speech signal is broken down into various time–frequency (TF) units and the features are extracted by finding out the Amplitude Magnitude Spectrogram (AMS). The signals are then classified based on quality ratio into different classes to generate the initial set of solutions. Subsequently, the optimal mask for each class is generated based on Cuckoo search algorithm. Subsequently, in the waveform synthesis stage, filtered waveforms are windowed and then multiplied by the optimal mask value and summed up to get the enhanced target signal. The experimentation of the proposed technique was carried out using various datasets and the performance is compared with the previous techniques using SNR. The results obtained proved the effectiveness of the proposed technique and its ability to suppress noise and enhance the speech signal
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom