Practically Efficient Blind Speech Separation Using Frequency Band Selection Based on Magnitude Squared Coherence and a Small Dodecahedral Microphone Array
Author(s) -
Kazunobu Kondo,
Yusuke Mizuno,
Takanori Nishino,
Kazuya Takeda
Publication year - 2012
Publication title -
journal of electrical and computer engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.318
H-Index - 25
eISSN - 2090-0155
pISSN - 2090-0147
DOI - 10.1155/2012/324398
Subject(s) - microphone , coherence (philosophical gambling strategy) , frequency band , microphone array , computer science , blind signal separation , computational complexity theory , distortion (music) , reduction (mathematics) , acoustics , frequency domain , signal processing , speech recognition , algorithm , mathematics , loudspeaker , physics , telecommunications , computer vision , amplifier , channel (broadcasting) , statistics , geometry , radar , bandwidth (computing)
Small agglomerative microphone array systems have been proposed for use with speech communication and recognition systems. Blind source separation methods based on frequency domain independent component analysis have shown significant separation performance, and the microphone arrays are small enough to make them portable. However, the level of computational complexity involved is very high because the conventional signal collection and processing method uses 60 microphones. In this paper, we propose a band selection method based on magnitude squared coherence. Frequency bands are selected based on the spatial and geometric characteristics of the microphone array device which is strongly related to the dodecahedral shape, and the selected bands are nonuniformly spaced. The estimated reduction in the computational complexity is 90% with a 68% reduction in the number of frequency bands. Separation performance achieved during our experimental evaluation was 7.45 (dB) (signal-to-noise ratio) and 2.30 (dB) (cepstral distortion). These results show improvement in performance compared to the use of uniformly spaced frequency band
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom