z-logo
open-access-imgOpen Access
Real-Time Single-Channel Deep Neural Network-Based Speech Enhancement on Edge Devices
Author(s) -
Nikhil Shankar,
Gautam Shreedhar Bhat,
Issa Panahi
Publication year - 2020
Publication title -
interspeech 2022
Language(s) - English
Resource type - Conference proceedings
SCImago Journal Rank - 0.689
H-Index - 100
pISSN - 2308-457X
DOI - 10.21437/interspeech.2020-1901
Subject(s) - pesq , computer science , speech recognition , recurrent neural network , convolutional neural network , deep learning , speech enhancement , artificial neural network , time delay neural network , artificial intelligence , intelligibility (philosophy) , frame (networking) , noise reduction , computer network , philosophy , epistemology
In this paper, we present a deep neural network architecture comprising of both convolutional neural network (CNN) and recurrent neural network (RNN) layers for real-time single-channel speech enhancement (SE). The proposed neural network model focuses on enhancing the noisy speech magnitude spectrum on a frame-by-frame process. The developed model is implemented on the smartphone (edge device), to demonstrate the real-time usability of the proposed method. Perceptual evaluation of speech quality (PESQ) and short-time objective intelligibility (STOI) test results are used to compare the proposed algorithm to previously published conventional and deep learning-based SE methods. Subjective ratings show the performance improvement of the proposed model over the other baseline SE methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom