Binary Classifiers and Latent Sequence Models for Emotion Detection in Suicide Notes
Author(s) -
Colin Cherry,
Saif M. Mohammad,
Berry de Bruijn
Publication year - 2012
Publication title -
biomedical informatics insights
Language(s) - English
Resource type - Journals
ISSN - 1178-2226
DOI - 10.4137/bii.s8933
Subject(s) - sentence , computer science , classifier (uml) , artificial intelligence , task (project management) , margin (machine learning) , binary classification , sequence (biology) , natural language processing , sentiment analysis , machine learning , support vector machine , management , biology , economics , genetics
This paper describes the National Research Council of Canada's submission to the 2011 i2b2 NLP challenge on the detection of emotions in suicide notes. In this task, each sentence of a suicide note is annotated with zero or more emotions, making it a multi-label sentence classification task. We employ two distinct large-margin models capable of handling multiple labels. The first uses one classifier per emotion, and is built to simplify label balance issues and to allow extremely fast development. This approach is very effective, scoring an F-measure of 55.22 and placing fourth in the competition, making it the best system that does not use web-derived statistics or re-annotated training data. Second, we present a latent sequence model, which learns to segment the sentence into a number of emotion regions. This model is intended to gracefully handle sentences that convey multiple thoughts and emotions. Preliminary work with the latent sequence model shows promise, resulting in comparable performance using fewer features.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom