z-logo
Premium
Incorporation of gap characters and lineage‐specific regions into phylogenetic analyses of gene families from divergent clades: an example from the kinesin superfamily across eukaryotes
Author(s) -
Simmons Mark P.,
Richardson Dale,
Reddy Anireddy S. N.
Publication year - 2008
Publication title -
cladistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.323
H-Index - 92
eISSN - 1096-0031
pISSN - 0748-3007
DOI - 10.1111/j.1096-0031.2007.00183.x
Subject(s) - phylogenetic tree , clade , biology , lineage (genetic) , jackknife resampling , evolutionary biology , gene , context (archaeology) , gene family , tree (set theory) , coalescent theory , maximum parsimony , phylogenetics , inference , genetics , phylogenetic network , computational biology , artificial intelligence , computer science , genome , paleontology , combinatorics , statistics , mathematics , estimator
The kinesin superfamily across eukaryotes was used to examine how incorporation of gap characters scored from conserved regions shared by all members of a gene family and incorporation of amino acid and gap characters scored from lineage‐specific regions affect gene‐tree inference of the gene family as a whole. We addressed these two questions in the context of two different densities of sequence sampling, four alignment programs, and two methods of tree construction. Taken together, our findings suggest the following. First, gap characters should be incorporated into gene‐tree inference, even for divergent sequences. Second, gene regions that are not conserved among all or most sequences sampled should not be automatically discarded without evaluation of potential phylogenetic signal that may be contained in gap and/or sequence characters. Third, among the four alignment programs evaluated using their default alignment parameters, Clustal may be expected to output alignments that result in the greatest gene‐tree resolution and support. Yet, this high resolution and support should be regarded as optimistic, rather than conservative, estimates. Fourth, this same conclusion regarding resolution and support holds for Bayesian gene‐tree analyses relative to parsimony‐jackknife gene‐tree analyses. We suggest that a more conservative approach, such as aligning the sequences using DIALIGN‐T or MAFFT, analyzing the appropriate characters using parsimony, and assessing branch support using the jackknife, is more appropriate for inferring gene trees of divergent gene families. © The Willi Hennig Society 2007.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here