Building a natural sounding Text-To-Speech system for the Nepali language: research and development challenges and solutions | Zendy

Roop Shree Ratna Bajracharya | Zendy; Santosh Regmi | Zendy; Bal Krishna Bal | Zendy; Balaram Prasain | Zendy

Open Access

Building a natural sounding Text-To-Speech system for the Nepali language: research and development challenges and solutions

Author(s) -

Roop Shree Ratna Bajracharya,

Santosh Regmi,

Bal Krishna Bal,

Balaram Prasain

Publication year - 2019

Publication title -

gipan

Language(s) - English

Resource type - Journals

eISSN - 2795-1561

pISSN - 2594-3456

DOI - 10.3126/gipan.v4i0.35461

Subject(s) - nepali , depth sounding , natural (archaeology) , speech synthesis , computer science , natural language , process (computing) , linguistics , speech recognition , natural language processing , artificial intelligence , geography , cartography , philosophy , archaeology , operating system

Text-to-Speech (TTS) synthesis has come far from its primitive synthetic monotone voices to more natural and intelligible sounding voices. One of the direct applications of a natural sounding TTS systems is the screen reader applications for the visually impaired and the blind community. The Festival Speech Synthesis System uses a concatenative speech synthesis method together with the unit selection process to generate a natural sounding voice. This work primarily gives an account of the efforts put towards developing a Natural sounding TTS system for Nepali using the Festival system. We also shed light on the issues faced and the solutions derived which can be quite overlapping across other similar under-resourced languages in the region.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore