z-logo
open-access-imgOpen Access
Double sparse learning model for speech emotion recognition
Author(s) -
Zong Yuan,
Zheng Wenming,
Cui Zhen,
Li Qiang
Publication year - 2016
Publication title -
electronics letters
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.375
H-Index - 146
ISSN - 1350-911X
DOI - 10.1049/el.2016.1211
Subject(s) - digital subscriber line , computer science , pyramid (geometry) , novelty , scheme (mathematics) , task (project management) , key (lock) , feature extraction , speech recognition , artificial intelligence , feature (linguistics) , pattern recognition (psychology) , machine learning , engineering , mathematics , psychology , social psychology , telecommunications , mathematical analysis , linguistics , philosophy , geometry , computer security , systems engineering
A double sparse learning (DSL) model with a pyramid structure‐based feature extraction scheme to handle speech emotion recognition (SER) problem is proposed. The key novelty of the method is that the proposed DSL model is able to take into consideration two scales of the pyramid structure‐based features for selecting the features which have great contributions to SER. Extensive experiments on eNTERFACE and AFEW emotion databases to evaluate the method are conducted. The experimental results demonstrate that, compared with some recent competitive methods, DSL with the pyramid structure‐based feature extraction scheme has a more promising performance in dealing with the SER task.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here