z-logo
Premium
Automatic Region Building for Spatial Analysis
Author(s) -
Guo Diansheng,
Wang Hu
Publication year - 2011
Publication title -
transactions in gis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.721
H-Index - 63
eISSN - 1467-9671
pISSN - 1361-1682
DOI - 10.1111/j.1467-9671.2011.01269.x
Subject(s) - spurious relationship , data mining , computer science , exploratory data analysis , spatial analysis , exploratory analysis , homogeneity (statistics) , bayes' theorem , cartography , geography , bayesian probability , data science , artificial intelligence , machine learning , remote sensing
High‐resolution spatial data have become increasingly available with modern data collection techniques and efforts. However, it is often inappropriate to use the default geographic units to perform spatial analysis due to unstable estimates with small areas (e.g. cancer rates for census blocks or tracts). Regionalization is aggregating small units into relatively larger areas while optimizing a homogeneity measure (such as the sum of squared differences). For exploratory spatial analysis, regionalization may help remove spurious data variation through aggregation and discover hidden patterns in data (such as areas of unusually high cancer rates). Towards this goal, this research introduces several improvements to a recent group of regionalization methods – REDCAP (Guo 2008) and conducts evaluation experiments with synthetic data sets to assess and compare the capability of regionalization methods for exploratory spatial analysis. One of the major improvements is the integration of a local empirical Bayes smoother (EBS) with the regionalization methods. We generate a large number of synthetic data sets with controlled spatial patterns to evaluate the performance of both new and existing methods. Evaluation results show that the new methods (integrated with EBS) perform significantly better than their original versions and other methods (including the EBS method on its own) in terms of detecting the true patterns in the synthetic data sets.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here