z-logo
Premium
Using missing data methods in genetic studies with missing mutation status
Author(s) -
Leong Traci,
Lipsitz Stuart R.,
Ibrahim Joseph G.
Publication year - 1999
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/(sici)1097-0258(19990228)18:4<473::aid-sim21>3.0.co;2-h
Subject(s) - mutation , missing data , genetics , gene mutation , gene , biology , statistics , mathematics
Because of current techniques of determining gene mutation, investigators are now interested in estimating the odds ratio between genetic status (mutation, no mutation) and an outcome variable such as disease cell type (A,B). In this paper we consider the mutation of the RAS genetic family. To determine if the genes have mutated, investigators look at five specific locations on the RAS gene. RAS mutated is a mutation in at least one of the five gene locations and RAS non‐mutated is no mutation in any of the five locations. Owing to limited time and financial resources, one cannot obtain a complete genetic evaluation of all five locations on the gene for all patients. We propose the use of maximum likelihood (ML) with a 2 6 multinomial distribution formed by cross‐classifying the binary mutation status at five locations by binary disease cell type. This ML method includes all patients regardless of completeness of data, treats the locations not evaluated as missing data, and uses the EM algorithm to estimate the odds ratio between genetic mutation status and the disease type. We compare the ML method to complete case estimates, and a method used by clinical investigators, which excludes patients with data on less than five locations who have no mutations on these sites. Copyright © 1999 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here