Premium
Predictive mapping with small field sample data using semi‐supervised machine learning
Author(s) -
Du Fei,
Zhu AXing,
Liu Jing,
Yang Lin
Publication year - 2020
Publication title -
transactions in gis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.721
H-Index - 63
eISSN - 1467-9671
pISSN - 1361-1682
DOI - 10.1111/tgis.12598
Subject(s) - sample (material) , field (mathematics) , representativeness heuristic , computer science , data mining , cluster analysis , artificial intelligence , support vector machine , machine learning , data set , decision tree , supervised learning , pattern recognition (psychology) , statistics , mathematics , artificial neural network , chemistry , chromatography , pure mathematics
Abstract Existing predictive mapping methods usually require a large number of field samples with good representativeness as input to build reliable predictive models. In mapping practice, however, we often face situations when only small sample data are available. In this article, we present a semi‐supervised machine learning approach for predictive mapping in which the natural aggregation (clustering) patterns of environmental covariate data are used to supplement limited samples in prediction. This approach was applied to two soil mapping case studies. Compared with field sample only approaches (decision trees, logistic regression, and support vector machines), maps using the proposed approach can better capture the spatial variation of soil types and achieve higher accuracy with limited samples. A cross validation shows further that the proposed approach is less sensitive to the specific field sample set used and thus more robust when field sample data are small.