The Investigation of Different Loss Functions with Capsule Networks for Speech Emotion Recognition
Author(s) -
Anfernee Joan B. Ng,
Kunhong Liu
Publication year - 2021
Publication title -
scientific programming
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.269
H-Index - 36
eISSN - 1875-919X
pISSN - 1058-9244
DOI - 10.1155/2021/9916915
Subject(s) - spectrogram , computer science , speech recognition , capsule , encode , pattern recognition (psychology) , image (mathematics) , emotion recognition , artificial intelligence , biochemistry , chemistry , botany , biology , gene
Speech emotion recognition (SER) is an important research topic. Image features like spectrograms are one of the common ways of extracting information from speech. In the area of image recognition, a relatively novel type of network called capsule networks has shown good and promising results. This study aims to use capsule networks to encode spatial information from spectrograms and analyse its performance when paired with different loss functions. Experiments comparing the capsule network with models from previous works show that the capsule network performs better than them.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom