
Text to Speech Synthesis using Fraction Based Waveform Concatenation and Optimal Coupling Smoothing Technique
Author(s) -
S. Saranya,
Dr.A. Rathinavelu,
C Jayashree
Publication year - 2020
Publication title -
international journal of recent technology and engineering
Language(s) - English
Resource type - Journals
ISSN - 2277-3878
DOI - 10.35940/ijrte.a2530.059120
Subject(s) - speech synthesis , computer science , speech recognition , concatenation (mathematics) , naturalness , smoothing , intelligibility (philosophy) , waveform , segmentation , speech segmentation , artificial intelligence , natural language processing , mathematics , arithmetic , telecommunications , philosophy , radar , physics , epistemology , quantum mechanics , computer vision
Text to Speech System is a Speech Synthesis application that converts a text to speech. The current project focuses on developing a TTS System for the Tamil Language with the Synthesis Technique as Unit Selection Synthesis. Letter Level Segmentation of an input text helps in the reduction of corpus size compared to Syllable Level Segmentation. The segmented units are retrieved with respect to Unicode values, concatenated and the synthesized speech is produced. Intelligibility and Naturalness of the spoken word can be improved using the Smoothing Techniques. Optimal Coupling Smoothing Technique is implemented for the smooth transition in between the concatenated speech segments to create continuous Speech output like human voice. Fraction based Waveform Concatenation method is used to produce the intelligible speech segments as output from the pre-recorded speech database.