z-logo
Premium
Fast and flexible Bayesian species distribution modelling using Gaussian processes
Author(s) -
Golding Nick,
Purse Bethan V.
Publication year - 2016
Publication title -
methods in ecology and evolution
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.425
H-Index - 105
ISSN - 2041-210X
DOI - 10.1111/2041-210x.12523
Subject(s) - hyperparameter , prior probability , bayesian probability , markov chain monte carlo , computer science , gaussian process , laplace's method , machine learning , gaussian , algorithm , artificial intelligence , mathematics , statistics , physics , quantum mechanics
Summary Species distribution modelling ( SDM ) is widely used in ecology, and predictions of species distributions inform both policy and ecological debates. Therefore, methods with high predictive accuracy and those that enable biological interpretation are preferable. Gaussian processes ( GP s) are a highly flexible approach to statistical modelling and have recently been proposed for SDM . GP models fit smooth, but potentially complex response functions that can account for high‐dimensional interactions between predictors. We propose fitting GP SDM s using deterministic numerical approximations, rather than Markov chain Monte Carlo methods in order to make GP s more computationally efficient and easy to use. We introduce GP models and their application to SDM , illustrate how ecological knowledge can be incorporated into GP SDM s via Bayesian priors and formulate a simple GP SDM that can be fitted efficiently. This model can be fitted either by learning the hyperparameters or by using a fixed approximation to them. Using a subset of the North American Breeding Bird Survey data set, we compare the out‐of‐sample predictive accuracy of these models with several commonly used SDM approaches for both presence/absence and presence‐only data. Predictive accuracy of GP SDM s fitted by Laplace approximation was greater than boosted regression trees, generalized additive models ( GAM s) and logistic regression when trained on presence/absence data and greater than all of these models plus MaxEnt when trained on presence‐only data. GP SDM s fitted using a fixed approximation to hyperparameters were no less accurate than those with MAP estimation and on average 70 times faster, equivalent in speed to GAMs. As well as having strong predictive power for this data set, GP SDM s offer a convenient method for incorporating prior knowledge of the species' ecology. By fitting these methods using efficient numerical approximations, they may easily be applied to large data sets and automatically for many species. An r package, GRaF, is provided to enable SDM users to fit GP models.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here