z-logo
open-access-imgOpen Access
Building a natural sounding Text-To-Speech system for the Nepali language: research and development challenges and solutions
Author(s) -
Roop Shree Ratna Bajracharya,
Santosh Regmi,
Bal Krishna Bal,
Balaram Prasain
Publication year - 2019
Publication title -
gipan
Language(s) - English
Resource type - Journals
eISSN - 2795-1561
pISSN - 2594-3456
DOI - 10.3126/gipan.v4i0.35461
Subject(s) - nepali , depth sounding , natural (archaeology) , speech synthesis , computer science , natural language , process (computing) , linguistics , speech recognition , natural language processing , artificial intelligence , geography , cartography , philosophy , archaeology , operating system
Text-to-Speech (TTS) synthesis has come far from its primitive synthetic monotone voices to more natural and intelligible sounding voices. One of the direct applications of a natural sounding TTS systems is the screen reader applications for the visually impaired and the blind community. The Festival Speech Synthesis System uses a concatenative speech synthesis method together with the unit selection process to generate a natural sounding voice. This work primarily gives an account of the efforts put towards developing a Natural sounding TTS system for Nepali using the Festival system. We also shed light on the issues faced and the solutions derived which can be quite overlapping across other similar under-resourced languages in the region.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here