Premium
Who Voted in 2016? Using Fuzzy Forests to Understand Voter Turnout
Author(s) -
Kim Seoyoung Silvia,
Alvarez R. Michael,
Ramirez Christina M.
Publication year - 2020
Publication title -
social science quarterly
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.482
H-Index - 90
eISSN - 1540-6237
pISSN - 0038-4941
DOI - 10.1111/ssqu.12777
Subject(s) - turnout , random forest , voting , fuzzy logic , computer science , curse of dimensionality , machine learning , variable (mathematics) , political science , artificial intelligence , econometrics , politics , psychology , economics , mathematics , mathematical analysis , law
Objective What can machine learning tell us about who voted in 2016? There are numerous competing voter turnout theories, and a large number of covariates are required to assess which theory best explains turnout. This article is a proof of concept that machine learning can help overcome this curse of dimensionality and reveal important insights in studies of political phenomena. Methods We use fuzzy forests, an extension of random forests, to screen variables for a parsimonious but accurate prediction. Fuzzy forests achieve accurate variable importance measures in the face of high‐dimensional and highly correlated data. The data that we use are from the 2016 Cooperative Congressional Election Study. Results Fuzzy forests chose only a small number of covariates as major correlates of 2016 turnout and still boasted high predictive performance. Conclusion Our analysis provides three important conclusions about turnout in 2016: registration and voting procedures were important, political issues were important (especially Obamacare, climate change, and fiscal policy), but few demographic variables other than age were strongly associated with turnout. We conclude that fuzzy forests is an important methodology for studying overdetermined questions in social sciences.