Premium
Powerful Set‐Based Gene‐Environment Interaction Testing Framework for Complex Diseases
Author(s) -
Jiao Shuo,
Peters Ulrike,
Berndt Sonja,
Bézieau Stéphane,
Brenner Hermann,
Campbell Peter T.,
Chan Andrew T.,
ChangClaude Jenny,
Lemire Mathieu,
Newcomb Polly A.,
Potter John D.,
Slattery Martha L.,
Woods Michael O.,
Hsu Li
Publication year - 2015
Publication title -
genetic epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.301
H-Index - 98
eISSN - 1098-2272
pISSN - 0741-0395
DOI - 10.1002/gepi.21908
Subject(s) - set (abstract data type) , computer science , identification (biology) , test set , independence (probability theory) , variance (accounting) , data mining , aggregate (composite) , computational biology , machine learning , biology , mathematics , statistics , botany , materials science , accounting , business , composite material , programming language
ABSTRACT Identification of gene‐environment interaction (G × E) is important in understanding the etiology of complex diseases. Based on our previously developed Set Based gene EnviRonment InterAction test (SBERIA), in this paper we propose a powerful framework for enhanced set‐based G × E testing (eSBERIA). The major challenge of signal aggregation within a set is how to tell signals from noise. eSBERIA tackles this challenge by adaptively aggregating the interaction signals within a set weighted by the strength of the marginal and correlation screening signals. eSBERIA then combines the screening‐informed aggregate test with a variance component test to account for the residual signals. Additionally, we develop a case‐only extension for eSBERIA (coSBERIA) and an existing set‐based method, which boosts the power not only by exploiting the G‐E independence assumption but also by avoiding the need to specify main effects for a large number of variants in the set. Through extensive simulation, we show that coSBERIA and eSBERIA are considerably more powerful than existing methods within the case‐only and the case‐control method categories across a wide range of scenarios. We conduct a genome‐wide G × E search by applying our methods to Illumina HumanExome Beadchip data of 10,446 colorectal cancer cases and 10,191 controls and identify two novel interactions between nonsteroidal anti‐inflammatory drugs (NSAIDs) and MINK1 and PTCHD3 .