z-logo
Premium
Genetic background comparison using distance‐based regression, with applications in population stratification evaluation and adjustment
Author(s) -
Li Qizhai,
Wacholder Sholom,
Hunter David J.,
Hoover Robert N.,
Chanock Stephen,
Thomas Gilles,
Yu Kai
Publication year - 2009
Publication title -
genetic epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.301
H-Index - 98
eISSN - 1098-2272
pISSN - 0741-0395
DOI - 10.1002/gepi.20396
Subject(s) - population stratification , covariate , genome wide association study , multivariate statistics , statistics , genetic association , statistic , regression , population , principal component analysis , regression analysis , computer science , parametric statistics , marginal model , statistical power , mathematics , biology , genetics , medicine , environmental health , gene , genotype , single nucleotide polymorphism
Population stratification (PS) can lead to an inflated rate of false‐positive findings in genome‐wide association studies (GWAS). The commonly used approach of adjustment for a fixed number of principal components (PCs) could have a deleterious impact on power when selected PCs are equally distributed in cases and controls, or the adjustment of certain covariates, such as self‐identified ethnicity or recruitment center, already included in the association analyses, correctly maps to major axes of genetic heterogeneity. We propose a computationally efficient procedure, PC‐Finder, to identify a minimal set of PCs while permitting an effective correction for PS. A general pseudo F statistic, derived from a non‐parametric multivariate regression model, can be used to assess whether PS exists or has been adequately corrected by a set of selected PCs. Empirical data from two GWAS conducted as part of the Cancer Genetic Markers of Susceptibility (CGEMS) project demonstrate the application of the procedure. Furthermore, simulation studies show the power advantage of the proposed procedure in GWAS over currently used PS correction strategies, particularly when the PCs with substantial genetic variation are distributed similarly in cases and controls and therefore do not induce PS. Genet. Epidemiol . 33:432–441, 2009. © 2009 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here