Premium
Stopping Rules in Principal Components Analysis: A Comparison of Heuristical and Statistical Approaches
Author(s) -
Jackson Donald A.
Publication year - 1993
Publication title -
ecology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.144
H-Index - 294
eISSN - 1939-9170
pISSN - 0012-9658
DOI - 10.2307/1939574
Subject(s) - mathematics , statistics , principal component analysis , sphericity , guttman scale , homogeneity (statistics) , eigenvalues and eigenvectors , statistical hypothesis testing , geometry , physics , quantum mechanics
Approaches to determining the number of components to interpret from principal components analysis were compared. Heuristic procedures included: retaining components with eigenvalues (λ) > 1 (i.e., Kaiser—Guttman criterion); components with bootstrapped λ > 1 (bootstrapped Kaiser—Guttman); the scree plot; the broken—stick model; and components with λ totalling to a fixed amount of the total variance. Statistical approaches included: Bartlett's test of sphericity; Bartlett's test of homogeneity of the correlation matrix, Lawley's test of the second λ bootstrapped confidence limits on successive λ (i.e., significant differences between λ); and bootstrapped confidence limits on eigenvector coefficients (i.e., coefficients that differ significantly from zero). All methods were compared using simulated data matrices of uniform correlation structure, patterned matrices of varying correlation structure and data sets of lake morphometry, water chemistry, and benthic invertebrate abundance. The most consistent results were obtained from the broken—stick model and a combined measure using bootstrapped λ and associated eigenvector coefficients. The traditional and bootstrapped Kaiser—Guttman approaches over—estimated the number of nontrivial dimensions as did the fixed—amount—of—variance model. The scree plot consistently estimated one dimension more than the number of simulated dimensions. Barlett's test of sphericity showed inconsistent results. Both Bartlett's test of homogeneity of the correlation matrix and Lawley's test are limited to testing for only one and two dimensions, respectively.