A Discriminative Segmental Speech Model and Its Application to Hungarian Number Recognition
Author(s) -
László Tóth,
András Kocsor,
Kornél Kovács
Publication year - 2000
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
ISBN - 3-540-41042-2
DOI - 10.1007/3-540-45323-7_52
Subject(s) - computer science , speech recognition , discriminative model , hidden markov model , artificial intelligence , pruning , pattern recognition (psychology) , utterance , segmentation , maximum a posteriori estimation , word (group theory) , artificial neural network , maximum likelihood , linguistics , statistics , philosophy , mathematics , agronomy , biology
This paper presents a stochastic segmental speech recogniser that models the a posteriori probabilities directly. The main issues concerning the system are segmental phoneme classification, utterance-level aggregation and the pruning of the search space. For phoneme classification, artificial neural networks and support vector machines are applied. Phonemic segmentation and utterance-level aggregation is performed with the aid of anti-phoneme modelling. At the phoneme level, the system convincingly outperforms the HMM system trained on the same corpus, while at the word level it attains the performance of the HMM system trained without embedded training.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom