Premium
A Brief Analysis of Conceptual Model Structure Uncertainty Using 36 Models and 559 Catchments
Author(s) -
Knoben W. J. M.,
Freer J. E.,
Peel M. C.,
Fowler K. J. A.,
Woods R. A.
Publication year - 2020
Publication title -
water resources research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.863
H-Index - 217
eISSN - 1944-7973
pISSN - 0043-1397
DOI - 10.1029/2019wr025975
Subject(s) - equifinality , streamflow , parametrization (atmospheric modeling) , overfitting , benchmark (surveying) , conceptual model , model selection , computer science , calibration , econometrics , environmental science , hydrology (agriculture) , drainage basin , mathematics , statistics , geology , machine learning , geography , artificial intelligence , physics , cartography , geodesy , geotechnical engineering , quantum mechanics , radiative transfer , database , artificial neural network
Abstract The choice of hydrological model structure, that is, a model's selection of states and fluxes and the equations used to describe them, strongly controls model performance and realism. This work investigates differences in performance of 36 lumped conceptual model structures calibrated to and evaluated on daily streamflow data in 559 catchments across the United States. Model performance is compared against a benchmark that accounts for the seasonality of flows in each catchment. We find that our model ensemble struggles to beat the benchmark in snow‐dominated catchments. In most other catchments model structure equifinality (i.e., cases where different models achieve similar high efficiency scores) can be very high. We find no relation between the number of model parameters and performance during either calibration or evaluation periods nor evidence of increased risk of overfitting for models with more parameters. Instead, the choice of model parametrization (i.e., which equations are used and how parameters are used within them) dictates the model's strengths and weaknesses. Results suggest that certain model structures are inherently better suited for certain objective functions and thus for certain study purposes. We find no clear relationships between the catchments where any model performs well and descriptors of those catchments' geology, topography, soil, and vegetation characteristics. Instead, model suitability seems to relate strongest to the streamflow regime each catchment generates, and we have formulated several tentative hypotheses that relate commonalities in model structure to similarities in model performance. Modeling results are made publicly available for further investigation.