z-logo
open-access-imgOpen Access
Disentangling good from bad practices in the selection of spatial or phylogenetic eigenvectors
Author(s) -
Bauman David,
Drouet Thomas,
Dray Stéphane,
Vleminckx Jason
Publication year - 2018
Publication title -
ecography
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.973
H-Index - 128
eISSN - 1600-0587
pISSN - 0906-7590
DOI - 10.1111/ecog.03380
Subject(s) - akaike information criterion , selection (genetic algorithm) , type i and type ii errors , spatial analysis , statistics , mathematics , spatial ecology , eigenvalues and eigenvectors , computer science , data mining , ecology , artificial intelligence , biology , physics , quantum mechanics
Eigenvector mapping techniques are widely used by ecologists and evolutionary biologists to describe and control for spatial and/or phylogenetic patterns in their data. The selection of an appropriate subset of eigenvectors is a critical step (misspecification can lead to highly biased results and interpretations), and there is no consensus yet on how to proceed. We conducted a ten‐year review of the practices of eigenvector selection and highlighted three main procedures: selecting the subset of descriptors minimising the Akaike information criterion (AIC), using a forward selection with double stopping criterion after testing the global model significance (FWD), and selecting the subset minimising the autocorrelation in the model residuals (MIR). We compared the type I error rates, statistical power, and R ² estimation accuracy of these methods using simulated data. Finally, a real dataset was analysed using variation partitioning analysis to illustrate to what extent the different selection approaches affected the ecological interpretation of the results. We show that, while the FWD and MIR approaches presented a correct type I error rate and were accurate, the AIC approach displayed extreme type I error rates (100%), and strongly overestimated the R ². Moreover, the AIC approach resulted in wrong ecological interpretations, as it overestimated the pure spatial fraction (and the joint spatial‐environmental fraction to a lesser extent) of the variation partitioning. Both the FWD and MIR methods performed well at broad and medium scales but had a very low power to detect fine‐scale patterns. The FWD approach selected more eigenvectors than the MIR approach but also returned more accurate R ² estimates. Hence, we discourage any future use of the AIC approach, and advocate choosing between the MIR and FWD approaches depending on the objective of the study: controlling for spatial or phylogenetic autocorrelation (MIR) or describing the patterns as accurately as possible (FWD).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here