Detecting Spatial Patterns of Disease in Large Collections of Electronic Medical Records Using Neighbor-Based Bootstrapping
Author(s) -
Maria T. Patterson,
Robert L. Grossman
Publication year - 2017
Publication title -
big data
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.774
H-Index - 27
eISSN - 2167-647X
pISSN - 2167-6461
DOI - 10.1089/big.2017.0028
Subject(s) - geospatial analysis , bootstrapping (finance) , spatial analysis , spatial epidemiology , computer science , data mining , geography , statistics , cartography , econometrics , mathematics , epidemiology , medicine
We introduce a method called neighbor-based bootstrapping (NB2) that can be used to quantify the geospatial variation of a variable. We applied this method to an analysis of the incidence rates of disease from electronic medical record data (International Classification of Diseases, Ninth Revision codes) for ∼100 million individuals in the United States over a period of 8 years. We considered the incidence rate of disease in each county and its geospatially contiguous neighbors and rank ordered diseases in terms of their degree of geospatial variation as quantified by the NB2 method. We show that this method yields results in good agreement with established methods for detecting spatial autocorrelation (Moran's I method and kriging). Moreover, the NB2 method can be tuned to identify both large area and small area geospatial variations. This method also applies more generally in any parameter space that can be partitioned to consist of regions and their neighbors.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom