z-logo
open-access-imgOpen Access
Emotion Recognition of Manipuri Speech using Convolution Neural Network
Author(s) -
Gurumayum Robert Michael,
Dr Aditya Bihar Kandali.
Publication year - 2020
Publication title -
international journal of recent technology and engineering
Language(s) - English
Resource type - Journals
ISSN - 2277-3878
DOI - 10.35940/ijrte.f9896.059120
Subject(s) - computer science , sadness , mel frequency cepstrum , speech recognition , happiness , human voice , anger , artificial intelligence , convolutional neural network , field (mathematics) , artificial neural network , feature extraction , human–computer interaction , psychology , social psychology , mathematics , psychiatry , pure mathematics
over the recent years much advancement are made in terms of artificial intelligence, machine learning, human-machine interaction etc. Voice interaction with the machine or giving command to it to perform a specific task is increasingly popular. Many consumer electronics are integrated with SIRI, Alexa, cortana, Google assist etc. But machines have limitation that they cannot interact with a person like a human conversational partner. It cannot recognize Human Emotion and react to them. Emotion Recognition from speech is a cutting edge research topic in the Human machines Interaction field. There is a demand to design a more rugged man-machine communication system, as machines are indispensable to our lives. Many researchers are working currently on speech emotion recognition(SER) to improve the man machines interaction. To achieve this goal, a computer should be able to recognize emotional states and react to them in the same way as we humans do. The effectiveness of the speech emotion recognition(SER) system depends on quality of extracted features and the type of classifiers used . In this paper we tried to identify four basic emotions: anger, sadness, neutral, happiness from speech. Here we used audio file of short Manipuri speech taken from movies as training and testing dataset . This paper use CNN to identify four different emotions using MFCC (Mel Frequency Cepstral Coefficient )as features extraction technique from speech.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here