Premium
Application of DNA pooling to large studies of disease
Author(s) -
Law Graham R.,
Rollinson Sara,
Feltbower Richard,
Allan James M.,
Morgan Gareth J.,
Roman Eve
Publication year - 2004
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.1996
Subject(s) - pooling , biobank , computer science , type i and type ii errors , statistics , logistic regression , confidence interval , set (abstract data type) , computational biology , data mining , bioinformatics , biology , machine learning , artificial intelligence , mathematics , programming language
Large collections of individuals are required to investigate the association of commonly occurring genetic variation with disease. The laboratory assessment of one form of variation, single nucleotide polymorphisms, is costly in time and DNA. Robust statistical approaches are developed to allow the successful implementation of a recently described laboratory method for rapidly estimating allele frequency using pools of DNA. A substantial reduction in Type I error is demonstrated using simulation, through the incorporation of measurement error into confidence limits for a case–control study, illustrated on a case–control study of acute leukaemia in adults. A method for creating multiple sub‐pools is described which will allow large studies, such as the proposed U.K. Biobank, to take advantage of this method. Furthermore, a set‐based logistic regression is presented which allows the investigation of joint effects and interactions with other genes or environmental factors. Copyright © 2004 John Wiley & Sons, Ltd.