z-logo
open-access-imgOpen Access
A Hidden Conditional Random Field-Based Approach for Thai Tone Classification
Author(s) -
Natthawut Kertkeidkachorn,
Proadpran Punyabukkana,
Atiwong Suchato
Publication year - 2014
Publication title -
engineering journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.246
H-Index - 20
ISSN - 0125-8281
DOI - 10.4186/ej.2014.18.3.99
Subject(s) - conditional random field , tone (literature) , field (mathematics) , computer science , speech recognition , pattern recognition (psychology) , artificial intelligence , mathematics , linguistics , pure mathematics , philosophy
In Thai, tonal information is a crucial component for identifying the lexical meaning of a word. Consequently, Thai tone classification can obviously improve performance of Thai speech recognition system. In this article, we therefore reported our study of Thai tone classification. Based on our investigation, most of Thai tone classification studies relied on statistical machine learning approaches, especially the Artificial Neural Network (ANN)-based approach and the Hidden Markov Model (HMM)-based approach. Although both approaches gave reasonable performances, they had some limitations due to their mathematical models. We therefore introduced a novel approach for Thai tone classification using a Hidden Conditional Random Field (HCRF)- based approach. In our study, we also investigated tone configurations involving tone features, frequency scaling and normalization techniques in order to fine-tune performances of Thai tone classification. Experiments were conducted in both isolated word scenario and continuous speech scenario. Results showed that the HCRF-based approach with the feature F_dF_aF, ERB-rate scaling and a z-score normalization technique yielded the highest performance and outperformed a baseline using the ANN- based approach, which had been reported as the best for the Thai tone classification, in both scenarios. The best performance of HCRF-based approach provided the error rate reduction of 10.58% and 12.02% for isolated word scenario and continuous speech scenario respectively when comparing with the best result of baselines.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom