Premium
The performance of phylogenetic algorithms in estimating haplotype genealogies with migration
Author(s) -
SALZBURGER WALTER,
EWING GREG B.,
Von HAESELER ARNDT
Publication year - 2011
Publication title -
molecular ecology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.619
H-Index - 225
eISSN - 1365-294X
pISSN - 0962-1083
DOI - 10.1111/j.1365-294x.2011.05066.x
Subject(s) - phylogenetic tree , biology , coalescent theory , maximum parsimony , phylogenetic network , tree (set theory) , phylogenetics , haplotype , phylogeography , evolutionary biology , population , algorithm , genetics , computer science , clade , mathematics , gene , combinatorics , demography , sociology , genotype
Genealogies estimated from haplotypic genetic data play a prominent role in various biological disciplines in general and in phylogenetics, population genetics and phylogeography in particular. Several software packages have specifically been developed for the purpose of reconstructing genealogies from closely related, and hence, highly similar haplotype sequence data. Here, we use simulated data sets to test the performance of traditional phylogenetic algorithms, neighbour‐joining, maximum parsimony and maximum likelihood in estimating genealogies from nonrecombining haplotypic genetic data. We demonstrate that these methods are suitable for constructing genealogies from sets of closely related DNA sequences with or without migration. As genealogies based on phylogenetic reconstructions are fully resolved, but not necessarily bifurcating, and without reticulations, these approaches outperform widespread ‘network’ constructing methods. In our simulations of coalescent scenarios involving panmictic, symmetric and asymmetric migration, we found that phylogenetic reconstruction methods performed well, while the statistical parsimony approach as implemented in TCS performed poorly. Overall, parsimony as implemented in the PHYLIP package performed slightly better than other methods. We further point out that we are not making the case that widespread ‘network’ constructing methods are bad, but that traditional phylogenetic tree finding methods are applicable to haplotypic data and exhibit reasonable performance with respect to accuracy and robustness. We also discuss some of the problems of converting a tree to a haplotype genealogy, in particular that it is nonunique.