A Primer for Model Selection: The Decisive Role of Model Complexity | Zendy

Höge Marvin | Zendy; Wöhling Thomas | Zendy; Nowak Wolfgang | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

A Primer for Model Selection: The Decisive Role of Model Complexity

Author(s) -

Höge Marvin,

Wöhling Thomas,

Nowak Wolfgang

Publication year - 2018

Publication title -

water resources research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.863

H-Index - 217

eISSN - 1944-7973

pISSN - 0043-1397

DOI - 10.1002/2017wr021902

Subject(s) - model selection , selection (genetic algorithm) , computer science , machine learning , context (archaeology) , task (project management) , data mining , bayesian probability , statistical model , artificial intelligence , bayesian information criterion , biology , paleontology , management , economics

Selecting a “best” model among several competing candidate models poses an often encountered problem in water resources modeling (and other disciplines which employ models). For a modeler, the best model fulfills a certain purpose best (e.g., flood prediction), which is typically assessed by comparing model simulations to data (e.g., stream flow). Model selection methods find the “best” trade‐off between good fit with data and model complexity. In this context, the interpretations of model complexity implied by different model selection methods are crucial, because they represent different underlying goals of modeling. Over the last decades, numerous model selection criteria have been proposed, but modelers who primarily want to apply a model selection criterion often face a lack of guidance for choosing the right criterion that matches their goal. We propose a classification scheme for model selection criteria that helps to find the right criterion for a specific goal, i.e., which employs the correct complexity interpretation. We identify four model selection classes which seek to achieve high predictive density, low predictive error, high model probability, or shortest compression of data. These goals can be achieved by following either nonconsistent or consistent model selection and by either incorporating a Bayesian parameter prior or not. We allocate commonly used criteria to these four classes, analyze how they represent model complexity and what this means for the model selection task. Finally, we provide guidance on choosing the right type of criteria for specific model selection tasks. (A quick guide through all key points is given at the end of the introduction.)

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research