A Spectra-Based Equalization-Generation Combined Framework for Throat Microphone Speech Enhancement | Zendy

Changyan Zheng | Zendy; Jibin Yang | Zendy; Xiongwei Zhang | Zendy; Meng Sun | Zendy; Tieyong Cao | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A Spectra-Based Equalization-Generation Combined Framework for Throat Microphone Speech Enhancement

Author(s) -

Changyan Zheng,

Jibin Yang,

Xiongwei Zhang,

Meng Sun,

Tieyong Cao

Publication year - 2018

Publication title -

ieee access

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.587

H-Index - 127

ISSN - 2169-3536

DOI - 10.1109/access.2018.2879689

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

Non-acoustic sensors are widely used in speech signal processing tasks, and their immunity to the background acoustic noise shows great benefits to traditional speech enhancement. To avoid using acoustic speech disturbed by strong noise, spectra mapping from throat microphone (TM) speech to acoustic microphone (AM) speech has been studied. However, there is a distinguished difference between the spectra of the two kinds of speech, and the mapping relationship is different in the low-band and high-band spectra, which limits the performance of the traditional full-band spectra mapping model. In this paper, to improve the perceived quality and intelligibility of TM speech, we investigate the low-band and high-band spectral structure between TM and AM speech, respectively, and propose a spectra-based band-division mapping framework for TM speech enhancement based on the investigation. In the framework, the low-band target spectra of AM speech are mapped based on the equalization method, and the high-band spectra are mapped from the full-band TM speech spectra, which are lack of high-frequency components. The overall framework can be viewed as a combination of spectra equalization in the low band and spectra generation in the high band. Both the objective and subjective evaluation results show clear advantages over the existing TM speech enhancement method based on the full-band spectra mapping.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research