z-logo
open-access-imgOpen Access
Multi-Label Prediction for Political Text-as-Data
Author(s) -
Aaron Erlich,
Stefano G. Dantas,
Benjamin E. Bagozzi,
Daniel Berliner,
Brian PalmerRubin
Publication year - 2021
Publication title -
political analysis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.953
H-Index - 69
eISSN - 1476-4989
pISSN - 1047-1987
DOI - 10.1017/pan.2021.15
Subject(s) - computer science , machine learning , artificial intelligence , code (set theory) , set (abstract data type) , supervised learning , training set , source code , government (linguistics) , data set , association (psychology) , natural language processing , artificial neural network , psychology , linguistics , philosophy , programming language , operating system , psychotherapist
Political scientists increasingly use supervised machine learning to code multiple relevant labels from a single set of texts. The current “best practice” of individually applying supervised machine learning to each label ignores information on inter-label association(s), and is likely to under-perform as a result. We introduce multi-label prediction as a solution to this problem. After reviewing the multi-label prediction framework, we apply it to code multiple features of (i) access to information requests made to the Mexican government and (ii) country-year human rights reports. We find that multi-label prediction outperforms standard supervised learning approaches, even in instances where the correlations among one’s multiple labels are low.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom