z-logo
Premium
Preliminary Implementation of New Data Mining Techniques for the Analysis of Simulation Data from Genetic Analysis Workshop 12: Problem 2
Author(s) -
Flodman Pamela,
Macula Anthony J.,
Spence M. Anne,
Torney David C.
Publication year - 2001
Publication title -
genetic epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.301
H-Index - 98
eISSN - 1098-2272
pISSN - 0741-0395
DOI - 10.1002/gepi.2001.21.s1.s390
Subject(s) - pedigree chart , genetics , data set , genotype , biology , set (abstract data type) , computational biology , data mining , computer science , gene , artificial intelligence , programming language
We introduce a new data mining method applicable to complex disease genetics. Our approach is suited to a broad spectrum of diseases, identifying the noteworthy sharing of combinations of alleles in unrelated affected individuals. Furthermore, this approach may be extended to comprise the common types of genotype data, including single‐nucleotide polymorphisms, candidate‐gene sequences, etc. Using a method derived from data‐mining computer algorithms, we analyze a data set of unrelated affected individuals chosen from the simulated pedigrees of problem 2 of the Genetics Analysis Workshop 12. We observe that most marker subsets containing a flanking marker for each of six or seven of the disease‐gene loci yield significant numbers of individuals manifesting substantially similar genotypes. However, initial attempts (blind to the generating model) to identify the predisposing loci have not been successful. Refining our methods so that such loci may routinely be found and validated is underway. © 2001 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here