Identification of Factors Associated With Variation in US County-Level Obesity Prevalence Rates Using Epidemiologic vs Machine Learning Models
Author(s) -
David Scheinker,
Arelí Valencia,
Fátima Rodríguez
Publication year - 2019
Publication title -
jama network open
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.278
H-Index - 39
ISSN - 2574-3805
DOI - 10.1001/jamanetworkopen.2019.2884
Subject(s) - obesity , socioeconomic status , demography , multivariate statistics , medicine , population , gerontology , environmental health , multivariate analysis , health care , geography , statistics , mathematics , sociology , economics , economic growth
Key Points Question Which factors are associated with county-level variation in obesity prevalence, and how can they be identified using epidemiologic and machine learning methods? Findings This cross-sectional study of 3138 US counties found significant county-level variation in obesity prevalence, with US Census region, median household income, and percentage of population with some college education being most strongly associated with obesity prevalence. Machine learning models explain two-thirds more variation in obesity but were less interpretable than multivariate linear regression models. Meaning Machine learning models of county-level demographic, socioeconomic, health care, and environmental factors explain significantly more variation in obesity prevalence while being less interpretable.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom