z-logo
open-access-imgOpen Access
The Application of Hidden Markov Models in Speech Recognition
Author(s) -
Mark Gales,
Steve Young
Publication year - 2007
Publication title -
foundations and trends® in signal processing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 4.611
H-Index - 18
eISSN - 1932-8354
pISSN - 1932-8346
DOI - 10.1561/2000000004
Subject(s) - hidden markov model , speech recognition , computer science , markov model , artificial intelligence , pattern recognition (psychology) , markov chain , machine learning
Hidden Markov Models (HMMs) provide a simple and effective framework for modelling time-varying spectral vector sequences. As a consequence, almost all present day large vocabulary continuous speech recognition (LVCSR) systems are based on HMMs. Whereas the basic principles underlying HMM-based LVCSR are rather straightforward, the approximations and simplifying assumptions involved in a direct implementation of these principles would result in a system which has poor accuracy and unacceptable sensitivity to changes in operating environment. Thus, the practical application of HMMs in modern systems involves considerable sophistication. The aim of this review is first to present the core architecture of a HMM-based LVCSR system and then describe the various refinements which are needed to achieve state-of-the-art performance. These refinements include feature projection, improved covariance modelling, discriminative parameter estimation, adaptation and normalisation, noise compensation and multi-pass system combination. The review concludes with a case study of LVCSR for Broadcast News and Conversation transcription in order to illustrate the techniques described.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom