A Comparison between Syllable, Di-Phone, and Phoneme-based Myanmar Speech Synthesis
Author(s) -
Aye Thida,
Chaw Su Hlaing
Publication year - 2018
Publication title -
international journal of information technology and computer science
Language(s) - English
Resource type - Journals
eISSN - 2074-9015
pISSN - 2074-9007
DOI - 10.5815/ijitcs.2018.11.06
Subject(s) - computer science , concatenation (mathematics) , phone , speech recognition , speech synthesis , syllable , natural language processing , quality (philosophy) , word (group theory) , speech corpus , speech processing , linguistics , mathematics , arithmetic , philosophy , epistemology
Among the speech synthesis approach, concatenative method is one of the most popular method which can produce more natural sounding speech output. The most important challenge in this method is choosing an appropriate unit for creating a database. The present used speech units are word, syllable, di-phone, tri-phone and phoneme. The speech quality may be trade-off between the selected speech units. This paper presents the three speech synthesis system of Myanmar language, respectively based on syllable, di-phone and phoneme speech units by using concatenation method. Then, we compare the speech quality of the three systems, using the subjective tests.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom