
Speech Emotion Recognition Using an Enhanced Kernel Isomap for Human-Robot Interaction
Author(s) -
Shiqing Zhang,
Xiaoming Zhao,
Bicheng Lei
Publication year - 2013
Publication title -
international journal of advanced robotic systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.394
H-Index - 46
eISSN - 1729-8814
pISSN - 1729-8806
DOI - 10.5772/55403
Subject(s) - computer science , human–robot interaction , isomap , speech recognition , robot , emotion recognition , artificial intelligence , human–computer interaction , dimensionality reduction , nonlinear dimensionality reduction
Speech emotion recognition is currently an active research subject and has attracted extensive interest in the science community due to its vital application to human-robot interaction. Most speech emotion recognition systems employ high-dimensional speech features, indicating human emotion expression, to improve emotion recognition performance. To effectively reduce the size of speech features, in this paper, a new nonlinear dimensionality reduction method, called ‘enhanced kernel isometric mapping’ (EKIsomap), is proposed and applied for speech emotion recognition in human-robot interaction. The proposed method is used to nonlinearly extract the low-dimensional discriminating embedded data representations from the original high-dimensional speech features with a striking improvement of performance on the speech emotion recognition tasks. Experimental results on the popular Berlin emotional speech corpus demonstrate the effectiveness of the proposed method