z-logo
open-access-imgOpen Access
Enhancing robustness of speech recognizers by bimodal features
Author(s) -
Inge Gavăt,
Gabriel Costache,
Claudia Iancu
Publication year - 2006
Publication title -
facta universitatis - series electronics and energetics
Language(s) - English
Resource type - Journals
eISSN - 2217-5997
pISSN - 0353-3670
DOI - 10.2298/fuee0602287g
Subject(s) - robustness (evolution) , speech recognition , computer science , concatenation (mathematics) , pattern recognition (psychology) , artificial intelligence , speech processing , feature extraction , feature (linguistics) , mathematics , linguistics , philosophy , combinatorics , gene , biochemistry , chemistry
In this paper a robust speech recognizer is presented based on features ob- tained from the speech signal and also from the image of the speaker. The features were combined by simple concatenation, resulting composed feature vectors to train the models corresponding to each class. For recognition, the classification process relies on a very effective algorithm, namely the multiclass SVM. Under additive noise conditions the bimodal system based on combined features acts better than the uni- modal system, based only on the speech features, the added information obtained from the image playing an important role in robustness improvement.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom