Accounting For Alignment Uncertainty in Phylogenomics
Author(s) -
Martin Wu,
Sourav Chatterji,
Jonathan A. Eisen
Publication year - 2012
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0030288
Subject(s) - multiple sequence alignment , phylogenetic tree , masking (illustration) , probabilistic logic , phylogenomics , sequence (biology) , computer science , tree (set theory) , confidence interval , sequence alignment , statistics , artificial intelligence , data mining , biology , computational biology , mathematics , genetics , art , clade , mathematical analysis , gene , peptide sequence , visual arts
Uncertainty in multiple sequence alignments has a large impact on phylogenetic analyses. Little has been done to evaluate the quality of individual positions in protein sequence alignments, which directly impact the accuracy of phylogenetic trees. Here we describe ZORRO, a probabilistic masking program that accounts for alignment uncertainty by assigning confidence scores to each alignment position. Using the BALIBASE database and in simulation studies, we demonstrate that masking by ZORRO significantly reduces the alignment uncertainty and improves the tree accuracy.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom