Accounting for haplotype uncertainty in matched association studies: A comparison of simple and flexible techniques | Zendy

Kraft Peter | Zendy; Cox David G. | Zendy; Paynter Randi A. | Zendy; Hunter David | Zendy; De Vivo Immaculata | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Accounting for haplotype uncertainty in matched association studies: A comparison of simple and flexible techniques

Author(s) -

Kraft Peter,

Cox David G.,

Paynter Randi A.,

Hunter David,

De Vivo Immaculata

Publication year - 2005

Publication title -

genetic epidemiology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.301

H-Index - 98

eISSN - 1098-2272

pISSN - 0741-0395

DOI - 10.1002/gepi.20061

Subject(s) - haplotype , haplotype estimation , imputation (statistics) , covariate , single nucleotide polymorphism , genetic association , biology , genetics , multiple comparisons problem , population stratification , population , null hypothesis , statistics , genotype , missing data , mathematics , gene , medicine , environmental health

Population‐based case‐control studies measuring associations between haplotypes of single nucleotide polymorphisms (SNPs) are increasingly popular, in part because haplotypes of a few “tagging” SNPs may serve as surrogates for variation in relatively large sections of the genome. Due to current technological limitations, haplotypes in cases and controls must be inferred from unphased genotypic data. Using individual‐specific inferred haplotypes as covariates in standard epidemiologic analyses (e.g., conditional logistic regression) is an attractive analysis strategy, as it allows adjustment for nongenetic covariates, provides omnibus and haplotype‐specific tests of association, and can estimate haplotype and haplotype × environment interaction effects. In principle, some adjustment for the uncertainty in inferred haplotypes should be made. Via simulation, we compare the performance (bias and mean squared error of haplotype and haplotype × environment interaction effect estimates) of several analytic strategies using inferred haplotypes in the context of matched case‐control data. These strategies include using only the most likely haplotype assignment, the expectation substitution approach described by Stram et al. ([2003b] Hum. Hered. 55:179–190) and others, and an improper version of multiple imputation. For relatively uncomplicated haplotype structures and moderate haplotype relative risks (≤2), all methods performed comparably well (small bias with appropriately‐sized confidence intervals). For larger relative risks, the most likely haplotype and multiple imputation strategies showed noticeable bias towards the null; the expectation substitution strategy still performed well. When there was more uncertainty in the inferred haplotypes, the most likely and multiple imputation strategies showed even more bias towards the null, while the expectation substitution method had slightly smaller than nominal confidence intervals for larger relative risks (≥5). An application to progesterone‐receptor haplotypes and endometrial cancer further illustrates that the performance of all these methods depends on how well the observed haplotypes “tag” the unobserved causal variant. Genet. Epidemiol. © 2005 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research