z-logo
Premium
Control of Population Stratification by Correlation‐Selected Principal Components
Author(s) -
Lee Seunggeun,
Wright Fred A.,
Zou Fei
Publication year - 2011
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/j.1541-0420.2010.01520.x
Subject(s) - population stratification , principal component analysis , population , principal (computer security) , eigenvalues and eigenvectors , statistics , stratification (seeds) , correlation , mathematics , econometrics , computer science , data mining , biology , genotype , genetics , medicine , seed dormancy , physics , germination , environmental health , botany , geometry , quantum mechanics , dormancy , gene , single nucleotide polymorphism , operating system
Summary In genome‐wide association studies, population stratification is recognized as producing inflated type I error due to the inflation of test statistics. Principal component‐based methods applied to genotypes provide information about population structure, and have been widely used to control for stratification. Here we explore the precise relationship between genotype principal components and inflation of association test statistics, thereby drawing a connection between principal component‐based stratification control and the alternative approach of genomic control. Our results provide an inherent justification for the use of principal components, but call into question the popular practice of selecting principal components based on significance of eigenvalues alone. We propose a new approach, called EigenCorr, which selects principal components based on both their eigenvalues and their correlation with the (disease) phenotype. Our approach tends to select fewer principal components for stratification control than does testing of eigenvalues alone, providing substantial computational savings and improvements in power. Analyses of simulated and real data demonstrate the usefulness of the proposed approach.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here