z-logo
Premium
Accounting for Genotyping Errors in Tagging SNP Selection
Author(s) -
Liu W.,
Yang T.,
Zhao W.,
Chase G. A.
Publication year - 2007
Publication title -
annals of human genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.537
H-Index - 77
eISSN - 1469-1809
pISSN - 0003-4800
DOI - 10.1111/j.1469-1809.2007.00354.x
Subject(s) - genotyping , haplotype , selection (genetic algorithm) , snp , snp genotyping , statistics , biology , genetics , computational biology , computer science , genotype , algorithm , single nucleotide polymorphism , mathematics , artificial intelligence , gene
One limitation of the existing tagging SNP selection algorithms is that they assume the reported genotypes are error free. However, genotyping errors are often unavoidable in practice. Many tagging SNP selection methods depend heavily on the estimated haplotype frequencies. Recent studies have demonstrated that even slight genotyping errors can lead to serious consequences with regard to haplotype reconstruction and frequency estimation. Here we present a tagging SNP selection method that allows for genotyping errors. Our method is a modification of the pair-wise r(2) tagging SNP selection algorithm proposed by Carlson et al. (2004). We have replaced the standard EM algorithm in Carlson's method with an EM that accounts for genotyping errors, in an attempt to obtain better estimates of the haplotype frequencies and r(2) measure. Through simulation studies we compared the performance of our modified algorithm with that of the original algorithm. We found that the number of tags selected by both methods increased with increasing genotyping errors, though our method led to smaller increase. The power of haplotype association tests using the selected tags decreased dramatically with increasing genotyping errors. The power of single marker tests also decreased, but the reduction was not as much as the reduction in power of haplotype tests. When restricting the mean number of tags selected by both methods to be similar to the baseline number, Carlson's method and our method led to similar power for the subsequent haplotype and single marker tests. Our results showed that, by accounting for random genotyping errors, our method can select tagging SNPs more efficiently than Carlson's method. The computer program that implements our modified tagging SNP selection algorithm is available at our web site: http://www.personal.psu.edu/tuy104/.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here