Premium
Efficient Generalized Least Squares Method for Mixed Population and Family‐based Samples in Genome‐wide Association Studies
Author(s) -
Li Jia,
Yang James,
Levin Albert M.,
Montgomery Courtney G.,
Datta Indrani,
Trudeau Sheri,
Adrianto Indra,
McKeigue Paul,
Iannuzzi Michael C.,
Rybicki Benjamin A.
Publication year - 2014
Publication title -
genetic epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.301
H-Index - 98
eISSN - 1098-2272
pISSN - 0741-0395
DOI - 10.1002/gepi.21811
Subject(s) - genome wide association study , genetic association , type i and type ii errors , categorical variable , generalized estimating equation , covariate , statistics , logistic regression , sample size determination , genetics , biology , single nucleotide polymorphism , mathematics , genotype , gene
Genome‐wide association studies (GWAS) that draw samples from multiple studies with a mixture of relationship structures are becoming more common. Analytical methods exist for using mixed‐sample data, but few methods have been proposed for the analysis of genotype‐by‐environment (G×E) interactions. Using GWAS data from a study of sarcoidosis susceptibility genes in related and unrelated African Americans, we explored the current analytic options for genotype association testing in studies using both unrelated and family‐based designs. We propose a novel method—generalized least squares (GLX)—to estimate both SNP and G×E interaction effects for categorical environmental covariates and compared this method to generalized estimating equations (GEE), logistic regression, the Cochran–Armitage trend test, and the W QLS and M QLS methods. We used simulation to demonstrate that the GLX method reduces type I error under a variety of pedigree structures. We also demonstrate its superior power to detect SNP effects while offering computational advantages and comparable power to detect G×E interactions versus GEE. Using this method, we found two novel SNPs that demonstrate a significant genome‐wide interaction with insecticide exposure—rs10499003 and rs7745248, located in the intronic and 3' UTR regions of the FUT9 gene on chromosome 6q16.1.