z-logo
open-access-imgOpen Access
Comparison performance of machine learning and geostatistical methods for the interpolation of monthly air temperature over Costa Rica
Author(s) -
Mauro Mendez,
Luis-Alexander Calvo-Valverde
Publication year - 2020
Publication title -
iop conference series. earth and environmental science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.179
H-Index - 26
eISSN - 1755-1307
pISSN - 1755-1315
DOI - 10.1088/1755-1315/432/1/012011
Subject(s) - kriging , homoscedasticity , mean squared error , mathematics , correlation coefficient , statistics , digital elevation model , cross validation , interpolation (computer graphics) , multivariate interpolation , mean absolute error , elevation (ballistics) , heteroscedasticity , algorithm , computer science , artificial intelligence , remote sensing , geography , geometry , bilinear interpolation , motion (physics)
The performance of three machine learning (ML) methods; cubist regression (CR), random forest (RF) and generalized additive model using splines (GAM) in generating monthly air temperature grids over Costa Rica was evaluated against two heavily used geostatistical methods; ordinary kriging (OK) and kriging with external drift (KED). The skill of the interpolation methods was evaluated using a 10-fold cross-validation technique; selecting the root-mean square error (RMSE), the mean absolute error (MAE) and the Pearson correlation-coefficient (R) as agreement metrics. To this purpose, data from an irregularly-distributed observational-network comprised by 73 weather-stations were selected for the period 1950-1987. Several spatial fields derived from a high-resolution digital elevation model (DEM) were tested as covariants. Results from the 10-fold cross-validation test show that CR yielded the best individual performance followed by KED, whereas GAM performed worst. Elevation on the other hand, was the only covariant ultimately incorporated in the interpolation process, since the remaining spatial fields exhibited poor correlation with temperature or resulted in data redundancy. While the quantitative and qualitative evaluation of CR and KED can be said to be comparable, CR is considered the best approach since the method is unaffected by assumptions on data normality and homoscedasticity.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here