A Comparison between Syllable, Di-Phone, and Phoneme-based Myanmar Speech Synthesis | Zendy

Aye Thida | Zendy; Chaw Su Hlaing | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A Comparison between Syllable, Di-Phone, and Phoneme-based Myanmar Speech Synthesis

Author(s) -

Aye Thida,

Chaw Su Hlaing

Publication year - 2018

Publication title -

international journal of information technology and computer science

Language(s) - English

Resource type - Journals

eISSN - 2074-9015

pISSN - 2074-9007

DOI - 10.5815/ijitcs.2018.11.06

Subject(s) - computer science , concatenation (mathematics) , phone , speech recognition , speech synthesis , syllable , natural language processing , quality (philosophy) , word (group theory) , speech corpus , speech processing , linguistics , mathematics , arithmetic , philosophy , epistemology

Among the speech synthesis approach, concatenative method is one of the most popular method which can produce more natural sounding speech output. The most important challenge in this method is choosing an appropriate unit for creating a database. The present used speech units are word, syllable, di-phone, tri-phone and phoneme. The speech quality may be trade-off between the selected speech units. This paper presents the three speech synthesis system of Myanmar language, respectively based on syllable, di-phone and phoneme speech units by using concatenation method. Then, we compare the speech quality of the three systems, using the subjective tests.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research