z-logo
open-access-imgOpen Access
A Temporal Network of Support Vector Machine Classifiers for the Recognition of Visual Speech
Author(s) -
Mihaela Gordan,
Constantine Kotropoulos,
Ioannis Pitas
Publication year - 2002
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
DOI - 10.1007/3-540-46014-4_32
Subject(s) - computer science , speech recognition , viseme , support vector machine , artificial intelligence , generalization , pattern recognition (psychology) , viterbi algorithm , field (mathematics) , hidden markov model , speech processing , acoustic model , mathematical analysis , mathematics , pure mathematics
Speech recognition based on visual information is an emerging research field. We propose here a new system for the recognition of visual speech based on support vector machines which proved to be powerful classifiers in other visual tasks. We use support vector machines to recognize the mouth shape corresponding to different phones produced. To model the temporal character of the speech we employ the Viterbi decoding in a network of support vector machines. The recognition rate obtained is higher than those reported earlier when the same features were used. The proposed solution offers the advantage of an easy generalization to large vocabulary recognition tasks due to the use of viseme models, as opposed to entire word models.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom