Decision-directed speech power spectral density matrix estimation for multichannel speech enhancement | Zendy

Yu Jin | Zendy; Jong Won Shin | Zendy; Nam Soo Kim | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Decision-directed speech power spectral density matrix estimation for multichannel speech enhancement

Author(s) -

Yu Jin,

Jong Won Shin,

Nam Soo Kim

Publication year - 2017

Publication title -

the journal of the acoustical society of america

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.619

H-Index - 187

eISSN - 1520-8524

pISSN - 0001-4966

DOI - 10.1121/1.4977098

Subject(s) - speech enhancement , computer science , estimator , spectral density , noise (video) , speech recognition , noise reduction , reduction (mathematics) , noise power , power (physics) , matrix (chemical analysis) , acoustics , mathematics , artificial intelligence , statistics , physics , telecommunications , materials science , geometry , composite material , quantum mechanics , image (mathematics)

In this letter, a multichannel decision-directed approach to estimate the speech power spectral density (PSD) matrix for multichannel speech enhancement is proposed. There have been attempts to build multichannel speech enhancement filters which depend only on the speech and noise PSD matrices, for which the accurate estimate of the clean speech PSD matrix is crucial for a successful noise reduction. In contrast to the maximum likelihood estimator which has been applied conventionally, the proposed decision-directed method is capable of tracking the time-varying speech characteristics more robustly and improves the noise reduction performance under various noise environments.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research