z-logo
open-access-imgOpen Access
A Method for Automatic Vietnamese Speech Segmentation
Author(s) -
Phd. Student. Tuan Anh Tran,
Mong Nguyen Huu,
Khanh Nguyen Trong
Publication year - 2019
Publication title -
international journal of innovative technology and exploring engineering
Language(s) - English
Resource type - Journals
ISSN - 2278-3075
DOI - 10.35940/ijitee.k2427.0981119
Subject(s) - vietnamese , segmentation , computer science , speech recognition , speech segmentation , sliding window protocol , set (abstract data type) , process (computing) , voice activity detection , window (computing) , speech processing , artificial intelligence , linguistics , philosophy , programming language , operating system
In speech synthesis and recognition, the segmentation is an important step. The result of further steps depend completely on this process. There are several effective segmentation method in the literature, but for Vietnamese speech, researchers usually base on their experience to set the length while using sliding window. It causes an inefficient segmentation; and they need to try with the other value (length of voice). In this paper, we propose a method supporting in segmentation for Vietnamese speech and automatically determine the suitable length of voices and silent pause. We firstly estimate, by experimenting, the min and average length of a voice and a silent pause for Vietnamese speech in three main type speaking (slow, normal and fast). Then, based on these values, we start to segment the voice and pause by sliding window with proposed algorithm. Experiment results show that the proposed method can be used to effectively segment the Vietnamese speech.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here