A Likelihood-Based Approach for Missing Genotype Data | Zendy

Gina D’Angelo | Zendy; M. Ilyas Kamboh | Zendy; Eleanor Feingold | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A Likelihood-Based Approach for Missing Genotype Data

Author(s) -

Gina D’Angelo,

M. Ilyas Kamboh,

Eleanor Feingold

Publication year - 2010

Publication title -

human heredity

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.423

H-Index - 62

eISSN - 1423-0062

pISSN - 0001-5652

DOI - 10.1159/000273732

Subject(s) - missing data , imputation (statistics) , estimator , expectation–maximization algorithm , sample size determination , statistics , computer science , context (archaeology) , data mining , mathematics , maximum likelihood , biology , paleontology

Missing genotype data in a candidate gene association study can make it difficult to model the effects of multiple genetic variants simultaneously. In particular, when regression models are used to model phenotype as a function of SNP genotypes in several different genes, the most common approach is a complete case analysis, in which only individuals with no missing genotypes are included. But this can lead to substantial reduction in sample size and thus potential bias and loss in efficiency. A number of other methods for handling missing data are applicable, but have rarely been used in this context. The purpose of this paper is to describe how several standard methods for handling missing data can be applied or adapted to this problem, and to compare their performance using a simulation study. We demonstrate these techniques using an Alzheimer's disease association study. We show that the expectation-maximization algorithm and multiple imputation with a bootstrapped expectation-maximization sampling algorithm have the best properties of all the estimators studied.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research