Enhancing robustness of speech recognizers by bimodal features
Author(s) -
Inge Gavăt,
Gabriel Costache,
Claudia Iancu
Publication year - 2006
Publication title -
facta universitatis - series electronics and energetics
Language(s) - English
Resource type - Journals
eISSN - 2217-5997
pISSN - 0353-3670
DOI - 10.2298/fuee0602287g
Subject(s) - robustness (evolution) , speech recognition , computer science , concatenation (mathematics) , pattern recognition (psychology) , artificial intelligence , speech processing , feature extraction , feature (linguistics) , mathematics , linguistics , philosophy , combinatorics , gene , biochemistry , chemistry
In this paper a robust speech recognizer is presented based on features ob- tained from the speech signal and also from the image of the speaker. The features were combined by simple concatenation, resulting composed feature vectors to train the models corresponding to each class. For recognition, the classification process relies on a very effective algorithm, namely the multiclass SVM. Under additive noise conditions the bimodal system based on combined features acts better than the uni- modal system, based only on the speech features, the added information obtained from the image playing an important role in robustness improvement.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom