Using Ensemble Models to Classify the Sentiment Expressed in Suicide Notes
Author(s) -
James A. McCart,
Dezon Finch,
Jay Jarman,
Edward J. Hickling,
Jason Lind,
Matthew R. Richardson,
Donald J. Berndt,
Stephen L. Luther
Publication year - 2012
Publication title -
biomedical informatics insights
Language(s) - English
Resource type - Journals
ISSN - 1178-2226
DOI - 10.4137/bii.s8931
Subject(s) - task (project management) , informatics , natural language processing , computer science , artificial intelligence , sentiment analysis , psychology , machine learning , engineering , electrical engineering , systems engineering
In 2007, suicide was the tenth leading cause of death in the U.S. Given the significance of this problem, suicide was the focus of the 2011 Informatics for Integrating Biology and the Bedside (i2b2) Natural Language Processing (NLP) shared task competition (track two). Specifically, the challenge concentrated on sentiment analysis, predicting the presence or absence of 15 emotions (labels) simultaneously in a collection of suicide notes spanning over 70 years. Our team explored multiple approaches combining regular expression-based rules, statistical text mining (STM), and an approach that applies weights to text while accounting for multiple labels. Our best submission used an ensemble of both rules and STM models to achieve a micro-averaged F(1) score of 0.5023, slightly above the mean from the 26 teams that competed (0.4875).
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom