Efficient Model for Numerical Text-To-Speech Synthesis System in Marathi, Hindi and English Languages
Author(s) -
G. D. Ramteke,
R. J. Ramteke
Publication year - 2017
Publication title -
international journal of image graphics and signal processing
Language(s) - English
Resource type - Journals
eISSN - 2074-9082
pISSN - 2074-9074
DOI - 10.5815/ijigsp.2017.03.01
Subject(s) - computer science , marathi , hindi , concatenation (mathematics) , speech recognition , intelligibility (philosophy) , speech synthesis , natural language processing , active listening , artificial intelligence , linguistics , arithmetic , mathematics , philosophy , communication , epistemology , sociology
The paper proposes a numerical TTSsynthesis system in Marathi, Hindi and English languages. The system is in audible forms based on the sounds generated from several numeric units. A hybrid technique is newly launched for a numerical text-to-speech technology. The technique is divided into different phases. These numerical phases include pre-processing, numeric unit detection, numeric and speech unit matching; speech unit concatenation and speech generation. In order to enhance the syntactic generation of audible forms in three languages, two discipline tests were performed. The prosodic test has been obtained for evaluating on the statistical readings. Overall quality issue (OQI) test is a subjective test which is performed by various persons who are aware of three mentioned languages. On the basis of two distinct parameters of OQI test, all scores are positively provided. Initial parameter compromises with listening quality. The second parameter, awareness rate improves a level of the intelligibility. The ultimate satisfactory results of artificial sound generation in three unrelated languages were touched to humankind voice.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom