
Noise reduction optimization of sound sensor based on a Conditional Generation Adversarial Network
Author(s) -
Xi Lin,
Dongru Yang,
Yadong Mao,
Lei Zhou,
Xiaobo Zhao,
ShengGuo Lu
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1873/1/012034
Subject(s) - pesq , computer science , noise reduction , speech recognition , speech enhancement , intelligibility (philosophy) , noise (video) , artificial neural network , reduction (mathematics) , noise measurement , artificial intelligence , mathematics , image (mathematics) , philosophy , geometry , epistemology
To address the problems in the traditional speech signal noise elimination methods, such as the residual noise, poor real-time performance and narrow applications a new method is proposed to eliminate network voice noise based on deep learning of conditional generation adversarial network. In terms of the perceptual evaluation of speech quality (PESQ) and shorttime objective intelligibility measure (STOI) functions used as the loss function in the neural network, which were used as the loss function in the neural network, the flexibility of the whole network was optimized, and the training process of the model simplified. The experimental results indicate that, under the noisy environment, especially in a restaurant, the proposed noise reduction scheme improves the STOI score by 26.23% and PESQ score by 17.18%, respectively, compared with the traditional Wiener noise reduction algorithm. Therefore, the sound sensor’s noise reduction scheme through our approach has achieved a remarkable noise reduction effect, more useful information transmission, and stronger practicability.