
Evaluating the clade size effect in alternative measures of branch support
Author(s) -
Amelia Chemisquy María,
Prevosti Francisco J.
Publication year - 2013
Publication title -
journal of zoological systematics and evolutionary research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.769
H-Index - 50
eISSN - 1439-0469
pISSN - 0947-5745
DOI - 10.1111/jzs.12024
Subject(s) - jackknife resampling , resampling , clade , maximum parsimony , biology , bayesian probability , statistics , posterior probability , mathematics , phylogenetics , gene , genetics , estimator
The clade size effect refers to a bias that causes middle‐sized clades to be less supported than small or large‐sized clades. This bias is present in resampling measures of support calculated under maximum likelihood and maximum parsimony and in B ayesian posterior probabilities. Previous analyses indicated that the clade size effect is worst in maximum parsimony, followed by maximum likelihood, while Bayesian inference is the least affected. Homoplasy was interpreted as the main cause of the effect. In this study, we explored the presence of the clade size effect in alternative measures of branch support under maximum parsimony: B remer support and symmetric resampling, expressed as absolute frequencies and frequency differences. Analyses were performed using 50 molecular and morphological matrices. Symmetric resampling showed the same tendency that bootstrap and jackknife did for maximum parsimony and maximum likelihood. Few matrices showed a significant bias using B remer support, presenting a better performance than resampling measures of support and comparable to B ayesian posterior probabilities. Our results indicate that the problem is not maximum parsimony, but resampling measures of support. We corroborated the role of homoplasy as a possible cause of the clade size effect, increasing the number of random trees during the resampling, which together with the higher chances that medium‐sized clades have of being contradicted generates the bias during the perturbation of the original matrix, making it stronger in resampling measures of support.