Decision-directed speech power spectral density matrix estimation for multichannel speech enhancement
Author(s) -
Yu Jin,
Jong Won Shin,
Nam Soo Kim
Publication year - 2017
Publication title -
the journal of the acoustical society of america
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.619
H-Index - 187
eISSN - 1520-8524
pISSN - 0001-4966
DOI - 10.1121/1.4977098
Subject(s) - speech enhancement , computer science , estimator , spectral density , noise (video) , speech recognition , noise reduction , reduction (mathematics) , noise power , power (physics) , matrix (chemical analysis) , acoustics , mathematics , artificial intelligence , statistics , physics , telecommunications , materials science , geometry , composite material , quantum mechanics , image (mathematics)
In this letter, a multichannel decision-directed approach to estimate the speech power spectral density (PSD) matrix for multichannel speech enhancement is proposed. There have been attempts to build multichannel speech enhancement filters which depend only on the speech and noise PSD matrices, for which the accurate estimate of the clean speech PSD matrix is crucial for a successful noise reduction. In contrast to the maximum likelihood estimator which has been applied conventionally, the proposed decision-directed method is capable of tracking the time-varying speech characteristics more robustly and improves the noise reduction performance under various noise environments.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom