z-logo
Premium
Selecting thresholds for the prediction of species occurrence with presence‐only data
Author(s) -
Liu Canran,
White Matt,
Newell Graeme
Publication year - 2013
Publication title -
journal of biogeography
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.7
H-Index - 158
eISSN - 1365-2699
pISSN - 0305-0270
DOI - 10.1111/jbi.12058
Subject(s) - selection (genetic algorithm) , environmental niche modelling , sensitivity (control systems) , computer science , species distribution , set (abstract data type) , data set , threshold model , statistics , model selection , data mining , ecology , mathematics , machine learning , artificial intelligence , biology , engineering , ecological niche , electronic engineering , habitat , programming language
Aim Species distribution models have been widely used to tackle ecological, evolutionary and conservation problems. Most species distribution modelling techniques produce continuous suitability predictions, but many real applications (e.g. reserve design, species invasion and climate change impact assessment) and model evaluations require binary outputs, and thresholds are needed for these transformations. Although there are many threshold selection methods for presence/absence data, it is unclear whether these are suitable for presence‐only data. In this paper, we investigate mathematically and empirically which of the existing threshold selection methods can be used confidently with presence‐only data. Location We used real spatially explicit environmental data derived from the western part of the state of V ictoria, south‐eastern A ustralia, and simulated species distributions within this area. Methods Thirteen existing threshold selection methods were investigated mathematically to see whether the same threshold can be produced using either presence/absence data or presence‐only data. We further adopted a simulation approach, created many virtual species with differing prevalences in a real landscape in south‐eastern A ustralia, generated data sets with different proportions of pseudo‐absences, built eight types of models with four modelling techniques, and investigated the behaviours of four threshold selection methods in these situations. Results Three threshold selection methods were not affected by pseudo‐absences, including max SSS (which is based on maximizing the sum of sensitivity and specificity), the prevalence of model training data and the mean predicted value of a set of random points. Max SSS produced higher sensitivity in most cases and higher true skill statistic and kappa in many cases than the other methods. The other methods produced different thresholds from presence‐only data to those determined from presence/absence data. Main conclusions Max SSS is a promising method for threshold selection when only presence data are available.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here