Overcoming Language Variation in Sentiment Analysis with Social Attention
Author(s) -
Yi Yang,
Jacob Eisenstein
Publication year - 2017
Publication title -
transactions of the association for computational linguistics
Language(s) - English
Resource type - Journals
ISSN - 2307-387X
DOI - 10.1162/tacl_a_00062
Subject(s) - homophily , computer science , variation (astronomy) , exploit , metadata , social media , smoothing , social network (sociolinguistics) , artificial intelligence , sentiment analysis , social network analysis , language model , natural language processing , data science , world wide web , sociology , social science , physics , computer security , astrophysics , computer vision
Variation in language is ubiquitous, particularly in newer forms of writing such as social media. Fortunately, variation is not random; it is often linked to social properties of the author. In this paper, we show how to exploit social networks to make sentiment analysis more robust to social language variation. The key idea is emph{linguistic homophily}: the tendency of socially linked individuals to use language in similar ways. We formalize this idea in a novel attention-based neural network architecture, in which attention is divided among several basis models, depending on the authoru0027s position in the social network. This has the effect of smoothing the classification function across the social network, and makes it possible to induce personalized classifiers even for authors for whom there is no labeled data or demographic metadata. This model significantly improves the accuracies of sentiment analysis on Twitter and review data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom