z-logo
open-access-imgOpen Access
Enhancing direct‐path relative transfer function using deep neural network for robust sound source localization
Author(s) -
Yang Bing,
Ding Runwei,
Ban Yutong,
Li Xiaofei,
Liu Hong
Publication year - 2022
Publication title -
caai transactions on intelligence technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.613
H-Index - 15
ISSN - 2468-2322
DOI - 10.1049/cit2.12024
Subject(s) - reverberation , microphone , transfer function , computer science , noise (video) , context (archaeology) , acoustics , channel (broadcasting) , artificial neural network , path (computing) , speech recognition , artificial intelligence , engineering , physics , sound pressure , telecommunications , image (mathematics) , geology , paleontology , electrical engineering , programming language
This article proposes a deep neural network (DNN)‐based direct‐path relative transfer function (DP‐RTF) enhancement method for robust direction of arrival (DOA) estimation in noisy and reverberant environments. The DP‐RTF refers to the ratio between the direct‐path acoustic transfer functions of the two microphone channels. First, the complex‐value DP‐RTF is decomposed into the inter‐channel intensity difference, and sinusoidal functions of the inter‐channel phase difference in the time‐frequency domain. Then, the decomposed DP‐RTF features from a series of temporal context frames are utilized to train a DNN model, which maps the DP‐RTF features contaminated by noise and reverberation to the clean ones, and meanwhile provides a time‐frequency (TF) weight to indicate the reliability of the mapping. The DP‐RTF enhancement network can help to enhance the DP‐RTF against noise and reverberation. Finally, the DOA of a sound source can be estimated by integrating the weighted matching between the enhanced DP‐RTF features and the DP‐RTF templates. Experimental results on simulated data show the superiority of the proposed DP‐RTF enhancement network for estimating the DOA of the sound source in the environments with various levels of noise and reverberation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here