The devil is in the detail: unstable response functions in species distribution models challenge bulk ensemble modelling | Zendy

Hannemann Henrik | Zendy; Willis Katherine J. | Zendy; MaciasFauria Marc | Zendy

Premium

The devil is in the detail: unstable response functions in species distribution models challenge bulk ensemble modelling

Author(s) -

Hannemann Henrik,

Willis Katherine J.,

MaciasFauria Marc

Publication year - 2016

Publication title -

global ecology and biogeography

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.164

H-Index - 152

eISSN - 1466-8238

pISSN - 1466-822X

DOI - 10.1111/geb.12381

Subject(s) - climate change , principal component analysis , stability (learning theory) , climate model , truncation (statistics) , variable (mathematics) , species distribution , function (biology) , biodiversity , environmental science , ecology , computer science , mathematics , statistics , biology , machine learning , habitat , mathematical analysis , evolutionary biology

Aim Species distribution models ( SDMs ) are commonly used to determine threats to biodiversity and opportunities under climate change. Despite SDMs being based on the assumption of complete knowledge of the climate space of the modelled species, truncated occurrence datasets (and hence truncated climate spaces) such as national inventories are often employed. This may lead to prediction errors, which have been proposed to stem from: (1) the degree of climate space truncation and/or (2) instability of the modelling algorithms. Our aim was to explore the potential causes of prediction errors in SDMs using truncated training datasets. Location Europe 11° W –32° E, 34°–72° N.Methods SDMs employing commonly used bioclimatic variables were applied to seven forest tree species. We created two model training datasets covering: (1) G ermany only (significantly truncated climate space) and (2) Europe (minimally truncated climate space). Differences between the climate space represented by G ermany‐only and European data were measured on two‐dimensional climate spaces obtained through principal component analysis of the bioclimatic variables. Seven SDM algorithms were run, and the stability of the response function and variable selection for each species and model type were analysed. Results The degree of climate space truncation was less important for model performance than the instability of model algorithms and indiscriminate variable selection. The latter led to irrelevant relationships of species occurrence with bioclimatic variables. These instabilities caused pronounced prediction errors. Main conclusions Our results strongly suggest that erroneous model predictions stem from instability and ecological irrelevance of the statistical functions relating the probability of a species' occurrence to bioclimatic variables, compounded by a lack of consistency in variable selection. Models displaying these characteristics showed lower overall performance when trained with truncated datasets. Further, commonly used ensemble approaches do not compensate for the shortfalls of individual models. Detailed model‐by‐model and species‐by‐species analysis of response functions and variable importance is recommended.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research