
Binaural sound source localization based on weighted template matching
Author(s) -
Liu Hong,
Sun Yongheng,
Yang Ge,
Chen Yang
Publication year - 2021
Publication title -
caai transactions on intelligence technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.613
H-Index - 15
ISSN - 2468-2322
DOI - 10.1049/cit2.12009
Subject(s) - binaural recording , weighting , sound localization , signal (programming language) , a weighting , acoustic source localization , computer science , interaural time difference , speech recognition , feature (linguistics) , matching (statistics) , pattern recognition (psychology) , template matching , sound (geography) , acoustics , algorithm , mathematics , artificial intelligence , statistics , physics , linguistics , philosophy , image (mathematics) , programming language
In robot binaural sound source localization (SSL), locating the direction of the sound source accurately in the shortest time is important. It refers to the algorithm complexity, but even more to the shortest duration of the required signal. A novel binaural SSL method based on feature and frequency weighting is proposed. More specifically, in the training stage, the direction‐related interaural cross‐correlation function(CCF) and interaural intensity difference(IID) in each frequency band are calculated under noiseless conditions, which are considered the templates. In the testing stage, first the cosine similarities between the CCF and IID of the test signal and templates are calculated in all features and frequency bands. Then, the direction likelihood can be obtained by weighting the similarities. Finally, the direction with maximum likelihood is specified as the direction of the sound source. Experiments were carried out on CIPIC dataset subject 003 with different noises in the noisex‐92 dataset and demonstrated that the method can accurately locate the sound source with a short signal duration.