
Combining Datasets of Satellite-Retrieved Products. Part I: Methodology and Water Budget Closure
Author(s) -
Filipe Aires
Publication year - 2014
Publication title -
journal of hydrometeorology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.733
H-Index - 123
eISSN - 1525-755X
pISSN - 1525-7541
DOI - 10.1175/jhm-d-13-0148.1
Subject(s) - water cycle , weighting , computer science , evapotranspiration , data integration , satellite , data mining , remote sensing , engineering , medicine , ecology , biology , geology , radiology , aerospace engineering
This study addresses in general terms the problem of the optimal combination of multiple observation datasets. Only satellite-retrieved geophysical parameter datasets are considered here (not the raw satellite observations). This study focuses on the terrestrial water cycle and presents methodologies to obtain a coherent dataset of four water cycle key components: precipitation, evapotranspiration, runoff, and terrestrial water storage. Various innovative “integration” methodologies are introduced: simple weighting (SW), constrained linear (CL), optimal interpolation (OI), and neural networks (NN). The term “integration” will be used here, not “assimilation,” as no model will be included in the data fusion process. A simple postprocessing filtering (PF) step can be used to impose the water cycle budget closure after the integration method. It is shown that this constraint actually improves the estimation of the water cycle components. The integration techniques are tested using real observation data over the Mississippi and Niger basins from satellite and in situ measurements. A Monte Carlo experiment with a synthetic uncertainty perturbation model is used to measure the ability of the SW, OI, and NN, with or without the PF step, to retrieve the four water cycle components. Once the PF closure constraint is added, the methodologies have equivalent accuracies. The need for these types of methodologies should increase in the future since multiple observation datasets are now available and the climate community needs to combine them into a unique, optimal, and coherent dataset of multiple parameters. A companion paper will test these methodologies on satellite observation datasets at the basin and global scales.