z-logo
Premium
Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
Author(s) -
Wang Wei,
Xiao Xiong,
Qian Jian,
Chen Shiqi,
Liao Fang,
Yin Fei,
Zhang Tao,
Li Xiaosong,
Ma Yue
Publication year - 2022
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.9395
Subject(s) - cluster analysis , spatial dependence , spatial analysis , independence (probability theory) , computer science , data mining , econometrics , autoregressive model , spatial econometrics , spatial epidemiology , contrast (vision) , statistic , statistics , mathematics , artificial intelligence , medicine , epidemiology
Most spatial models include a spatial weights matrix ( W ) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic discontinuity) due to natural or social factors. In such cases, adjustment using the first‐law‐of‐geography‐based W might be inappropriate and leads to inaccuracy estimations and loss of statistical power. In this work, we propose a series of data‐driven W s (DDWs) built following the spatial pattern identified by the scan statistic, which can be easily carried out using existing tools such as SaTScan software. The DDWs take both the clustering (or discontinuous) and the intuitive first‐law‐of‐geographic‐based spatial dependence into consideration. Aiming at two common purposes in epidemiology studies (ie, estimating the effect value of explanatory variable X and estimating the risk of each spatial unit in disease mapping), the common spatial autoregressive models and the Leroux‐prior‐based conditional autoregressive (CAR) models were selected to evaluate performance of DDWs, respectively. Both simulation and case studies show that our DDWs achieve considerably better performance than the classic W in datasets with clustering (or discontinuous) spatial dependence. Furthermore, the latest published density‐based spatial clustering models, aiming at dealing with such clustering (or discontinuity) spatial dependence in disease mapping, were also compared as references. The DDWs, incorporated into the CAR models, still show considerable advantage, especially in the datasets for common diseases.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here