Enhancing robustness of speech recognizers by bimodal features | Zendy

Inge Gavăt | Zendy; Gabriel Costache | Zendy; Claudia Iancu | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Enhancing robustness of speech recognizers by bimodal features

Author(s) -

Inge Gavăt,

Gabriel Costache,

Claudia Iancu

Publication year - 2006

Publication title -

facta universitatis - series electronics and energetics

Language(s) - English

Resource type - Journals

eISSN - 2217-5997

pISSN - 0353-3670

DOI - 10.2298/fuee0602287g

Subject(s) - robustness (evolution) , speech recognition , computer science , concatenation (mathematics) , pattern recognition (psychology) , artificial intelligence , speech processing , feature extraction , feature (linguistics) , mathematics , linguistics , philosophy , combinatorics , gene , biochemistry , chemistry

In this paper a robust speech recognizer is presented based on features ob- tained from the speech signal and also from the image of the speaker. The features were combined by simple concatenation, resulting composed feature vectors to train the models corresponding to each class. For recognition, the classification process relies on a very effective algorithm, namely the multiclass SVM. Under additive noise conditions the bimodal system based on combined features acts better than the uni- modal system, based only on the speech features, the added information obtained from the image playing an important role in robustness improvement.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research