z-logo
open-access-imgOpen Access
Maximizing the Power of Principal-Component Analysis of Correlated Phenotypes in Genome-wide Association Studies
Author(s) -
Hugues Aschard,
Bjarni J. Vilhjálmsson,
Nicolas Greliche,
PierreEmmanuel Morange,
DavidAlexandre Trégouët,
Peter Kraft
Publication year - 2014
Publication title -
the american journal of human genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 6.661
H-Index - 302
eISSN - 1537-6605
pISSN - 0002-9297
DOI - 10.1016/j.ajhg.2014.03.016
Subject(s) - principal component analysis , trait , genetic association , correlation , robustness (evolution) , pleiotropy , genome wide association study , genetic correlation , multivariate statistics , biology , quantitative trait locus , snp , statistics , statistical power , single nucleotide polymorphism , computational biology , computer science , genetics , genetic variation , phenotype , mathematics , genotype , gene , geometry , programming language
Many human traits are highly correlated. This correlation can be leveraged to improve the power of genetic association tests to identify markers associated with one or more of the traits. Principal component analysis (PCA) is a useful tool that has been widely used for the multivariate analysis of correlated variables. PCA is usually applied as a dimension reduction method: the few top principal components (PCs) explaining most of total trait variance are tested for association with a predictor of interest, and the remaining components are not analyzed. In this study we review the theoretical basis of PCA and describe the behavior of PCA when testing for association between a SNP and correlated traits. We then use simulation to compare the power of various PCA-based strategies when analyzing up to 100 correlated traits. We show that contrary to widespread practice, testing only the top PCs often has low power, whereas combining signal across all PCs can have greater power. This power gain is primarily due to increased power to detect genetic variants with opposite effects on positively correlated traits and variants that are exclusively associated with a single trait. Relative to other methods, the combined-PC approach has close to optimal power in all scenarios considered while offering more flexibility and more robustness to potential confounders. Finally, we apply the proposed PCA strategy to the genome-wide association study of five correlated coagulation traits where we identify two candidate SNPs that were not found by the standard approach.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom