A Modified Algorithm for Multiple Input Spectrogram Inversion | Zendy

Dongxiao Wang | Zendy; Hirokazu Kameoka | Zendy; Koichi Shinoda | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A Modified Algorithm for Multiple Input Spectrogram Inversion

Author(s) -

Dongxiao Wang,

Hirokazu Kameoka,

Koichi Shinoda

Publication year - 2019

Publication title -

interspeech 2022

Language(s) - English

Resource type - Conference proceedings

DOI - 10.21437/interspeech.2019-3242

Subject(s) - spectrogram , waveform , inversion (geology) , algorithm , constraint (computer aided design) , computer science , signal (programming language) , source separation , speech recognition , mathematics , telecommunications , paleontology , structural basin , radar , geometry , biology , programming language

We propose a new algorithm to estimate the phase of speech signal in the mixture of audio sources under the assumption that the magnitude spectrum of each source is given. The previous method, multiple input spectrogram inversion algorithm (MISI), often performs poorly when the magnitude spectrograms estimated are not accurate. This may be because it imposes a strict constraint that the summation of source waveforms should be exactly the same as the mixture waveform. Our proposing algorithm employs a new objective function in which this constraint is relaxed. In this objective function, the difference between the summation of source waveforms and the mixture waveform is the target to be minimized. The performance of our method, modified MISI is evaluated on two different experimental settings. In both settings it improves the audio source separation performance compared to MISI.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research