An Improved Unsupervised Single‐Channel Speech Separation Algorithm for Processing Speech Sensor Signals | Zendy

Dazhi Jiang | Zendy; Zhihui He | Zendy; Yingqing Lin | Zendy; Yifei Chen | Zendy; Linyan Xu | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

An Improved Unsupervised Single‐Channel Speech Separation Algorithm for Processing Speech Sensor Signals

Author(s) -

Dazhi Jiang,

Zhihui He,

Yingqing Lin,

Yifei Chen,

Linyan Xu

Publication year - 2021

Publication title -

wireless communications and mobile computing

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.42

H-Index - 64

eISSN - 1530-8677

pISSN - 1530-8669

DOI - 10.1155/2021/6655125

Subject(s) - computer science , speech recognition , speech processing , non negative matrix factorization , voice activity detection , speech enhancement , timit , blind signal separation , signal processing , channel (broadcasting) , source separation , noise (video) , signal (programming language) , artificial intelligence , matrix decomposition , hidden markov model , digital signal processing , noise reduction , computer network , eigenvalues and eigenvectors , physics , image (mathematics) , quantum mechanics , computer hardware , programming language

As network supporting devices and sensors in the Internet of Things are leaping forward, countless real-world data will be generated for human intelligent applications. Speech sensor networks, an important part of the Internet of Things, have numerous application needs. Indeed, the sensor data can further help intelligent applications to provide higher quality services, whereas this data may involve considerable noise data. Accordingly, speech signal processing method should be urgently implemented to acquire low-noise and effective speech data. Blind source separation and enhancement technique refer to one of the representative methods. However, in the unsupervised complex environment, in the only presence of a single-channel signal, many technical challenges are imposed on achieving single-channel and multiperson mixed speech separation. For this reason, this study develops an unsupervised speech separation method CNMF+JADE, i.e., a hybrid method combined with Convolutional Non-Negative Matrix Factorization and Joint Approximative Diagonalization of Eigenmatrix. Moreover, an adaptive wavelet transform-based speech enhancement technique is proposed, capable of adaptively and effectively enhancing the separated speech signal. The proposed method is aimed at yielding a general and efficient speech processing algorithm for the data acquired by speech sensors. As revealed from the experimental results, in the TIMIT speech sources, the proposed method can effectively extract the target speaker from the mixed speech with a tiny training sample. The algorithm is highly general and robust, capable of technically supporting the processing of speech signal acquired by most speech sensors.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research