z-logo
Premium
GIVING MEANINGFUL INTERPRETATION TO ORDINATION AXES: ASSESSING LOADING SIGNIFICANCE IN PRINCIPAL COMPONENT ANALYSIS
Author(s) -
Peres-Neto Pedro R.,
Jackson Donald A.,
Somers Keith M.
Publication year - 2003
Publication title -
ecology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.144
H-Index - 294
eISSN - 1939-9170
pISSN - 0012-9658
DOI - 10.1890/00-0634
Subject(s) - principal component analysis , resampling , ordination , statistics , multivariate statistics , dimensionality reduction , mathematics , confidence interval , curse of dimensionality , eigenvalues and eigenvectors , dimension (graph theory) , data set , range (aeronautics) , econometrics , computer science , artificial intelligence , physics , materials science , quantum mechanics , pure mathematics , composite material
Principal component analysis (PCA) is one of the most commonly used tools in the analysis of ecological data. This method reduces the effective dimensionality of a multivariate data set by producing linear combinations of the original variables (i.e., components) that summarize the predominant patterns in the data. In order to provide meaningful interpretations for principal components, it is important to determine which variables are associated with particular components. Some data analysts incorrectly test the statistical significance of the correlation between original variables and multivariate scores using standard statistical tables. Others interpret eigenvector coefficients larger than an arbitrary absolute value (e.g., 0.50). Resampling, randomization techniques, and parallel analysis have been applied in a few cases. In this study, we compared the performance of a variety of approaches for assessing the significance of eigenvector coefficients in terms of type I error rates and power. Two novel approaches based on the broken‐stick model were also evaluated. We used a variety of simulated scenarios to examine the influence of the number of real dimensions in the data; unique versus complex variables; the magnitude of eigenvector coefficients; and the number of variables associated with a particular dimension. Our results revealed that bootstrap confidence intervals and a modified bootstrap confidence interval for the broken‐stick model proved to be the most reliable techniques.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here