z-logo
open-access-imgOpen Access
Inverse filter based excitation model for HMM‐based speech synthesis system
Author(s) -
Reddy Mittapalle Kiran,
Rao Krothapalli Sreenivasa
Publication year - 2018
Publication title -
iet signal processing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.384
H-Index - 42
eISSN - 1751-9683
pISSN - 1751-9675
DOI - 10.1049/iet-spr.2017.0546
Subject(s) - hidden markov model , computer science , excitation , signal (programming language) , residual , inverse filter , speech recognition , speech synthesis , filter (signal processing) , inverse , quality (philosophy) , algorithm , artificial intelligence , mathematics , physics , geometry , quantum mechanics , computer vision , programming language
Even today, the speech generated by hidden Markov model (HMM)‐based speech synthesis system (HTS) still has the buzziness due to the improper modelling of the excitation signal. This study proposes an efficient excitation modelling approach for improving the quality of HTS. In the proposed method, the residual signal obtained from inverse filter is parameterised as excitation features. HMMs are used to model these excitation parameters. During synthesis, the excitation signal is constructed by overlap adding the natural residual segments, and the excitation signal is further modified as per the target source features generated from HMMs. The proposed approach is incorporated in the HTS. Performance evaluation results indicate that the proposed method enhances the quality of synthesis, and is better than the state‐of‐the‐art approaches used for modelling the excitation signal.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here