z-logo
open-access-imgOpen Access
Learning Salient Features for Multimodal Emotion Recognition with Recurrent Neural Networks and Attention Based Fusion
Author(s) -
Darshana Priyasad,
Tharindu Fernando,
Simon Denman,
Sridha Sridharan,
Clinton Fookes
Publication year - 2019
Publication title -
qut eprints (queensland university of technology)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.21437/avsp.2019-5
Subject(s) - salient , computer science , emotion recognition , artificial intelligence , artificial neural network , recurrent neural network , pattern recognition (psychology) , speech recognition , machine learning
Automatic emotion recognition is a challenging task since emotion is communicated through different modalities. Deep Convolution Neural Networks (DCNN) and transfer learning have shown success in automatic emotion recognition using different modalities. However significant improvement in accuracy is still required for practical applications. Existing methods are still not effective in modelling the temporal relationships within emotional expressions or in identifying the salient features from different modes and fusing them to improve accuracies. In this paper, we present an automatic emotion recognition system using audio and visual modalities. VGG19 models are used to capture frame level facial features followed by a Long Short Term Memory (LSTM) to capture their temporal distribution at a segment level. A separate VGG19 model captures auditory features from Mel Frequency Cepstral Coefficients (MFCC). The extracted auditory and visual features are fused together and a Deep Neural Network (DNN) with attention is used in classification using majority voting. Voice Activity Detection (VAD) on the audio stream improves performance by reducing the outliers in learning. The system is evaluated using Leave One Subject Out (LOSO) and K-fold cross-validation and our system outperforms state of the art methods on two challenging databases.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom