z-logo
open-access-imgOpen Access
The Investigation of Different Loss Functions with Capsule Networks for Speech Emotion Recognition
Author(s) -
Anfernee Joan B. Ng,
Kunhong Liu
Publication year - 2021
Publication title -
scientific programming
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.269
H-Index - 36
eISSN - 1875-919X
pISSN - 1058-9244
DOI - 10.1155/2021/9916915
Subject(s) - spectrogram , computer science , speech recognition , capsule , encode , pattern recognition (psychology) , image (mathematics) , emotion recognition , artificial intelligence , biochemistry , chemistry , botany , biology , gene
Speech emotion recognition (SER) is an important research topic. Image features like spectrograms are one of the common ways of extracting information from speech. In the area of image recognition, a relatively novel type of network called capsule networks has shown good and promising results. This study aims to use capsule networks to encode spatial information from spectrograms and analyse its performance when paired with different loss functions. Experiments comparing the capsule network with models from previous works show that the capsule network performs better than them.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom