z-logo
Premium
Accounting for haplotype phase uncertainty in linkage disequilibrium estimation
Author(s) -
Kulle B.,
Frigessi A.,
Edvardsen H.,
Kristensen V.,
Wojnowski L.
Publication year - 2008
Publication title -
genetic epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.301
H-Index - 98
eISSN - 1098-2272
pISSN - 0741-0395
DOI - 10.1002/gepi.20273
Subject(s) - linkage disequilibrium , haplotype , linkage (software) , expectation–maximization algorithm , algorithm , disequilibrium , computer science , population , weighting , genotyping , data mining , genetics , statistics , biology , mathematics , maximum likelihood , allele , physics , medicine , demography , sociology , acoustics , ophthalmology , genotype , gene
Abstract The characterization of linkage disequilibrium (LD) is applied in a variety of studies including the identification of molecular determinants of the local recombination rate, the migration and population history of populations, and the role of positive selection in adaptation. LD suffers from the phase uncertainty of the haplotypes used in its calculation, which reflects limitations of the algorithms used for haplotype estimation. We introduce a LD calculation method, which deals with phase uncertainty by weighting all possible haplotype pairs according to their estimated probabilities as evaluated by PHASE. In contrast to the expectation‐maximization (EM) algorithm as implemented in the HAPLOVIEW and GENETICS packages, our method considers haplotypes based on the entire genetic information available for the candidate region. We tested the method using simulated and real genotyping data. The results show that, for all practical purposes, the new method is advantageous in comparison with algorithms that calculate LD using only the most probable haplotype or bilocus haplotypes based on the EM algorithm. The new method deals especially well with low LD regions, which contribute strongly to phase uncertainty. Altogether, the method is an attractive alternative to standard LD calculation procedures, including those based on the EM algorithm. We implemented the method in the software suite R, together with an interface to the popular haplotype calculation package PHASE. Genet. Epidemiol . 2008. ©2007 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here