z-logo
open-access-imgOpen Access
PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
Author(s) -
Gaurav Bhardwaj,
Kyung Dae Ko,
Yoojin Hong,
Zhenhai Zhang,
Ngai Lam Ho,
Sree V. Chintapalli,
Lindsay A. Kline,
Matthew Gotlin,
D. Nicholas Hartranft,
Morgen E. Patterson,
Foram Dave,
Evan J. Smith,
Edward C. Holmes,
Randen L. Patterson,
Damian B. van Rossum
Publication year - 2012
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0034261
Subject(s) - phylogenetic tree , multiple sequence alignment , alignment free sequence analysis , sequence alignment , pairwise comparison , biology , indel , maximum parsimony , sequence (biology) , evolutionary biology , computational biology , bayesian probability , sequence analysis , phylogenetics , divergence (linguistics) , tree (set theory) , similarity (geometry) , inference , clade , genetics , artificial intelligence , computer science , peptide sequence , gene , mathematics , combinatorics , image (mathematics) , linguistics , philosophy , single nucleotide polymorphism , genotype
Both multiple sequence alignment and phylogenetic analysis are problematic in the “twilight zone” of sequence similarity (≤25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leading multiple sequence alignment (MSA) methods (MAFFT, T-COFFEE, CLUSTAL, and MUSCLE) and six commonly used programs of tree estimation (Distance-based: Neighbor-Joining; Character-based: PhyML, RAxML, GARLI, Maximum Parsimony, and Bayesian) against a novel MSA-independent method (PHYRN) described here. Strikingly, at “midnight zone” genetic distances (∼7% pairwise identity and 4.0 gaps per position), PHYRN returns high-resolution phylogenies that outperform traditional approaches. We reason this is due to PHRYN's capability to amplify informative positions, even at the most extreme levels of sequence divergence. We also assess the applicability of the PHYRN algorithm for inferring deep evolutionary relationships in the divergent DANGER protein superfamily, for which PHYRN infers a more robust tree compared to MSA-based approaches. Taken together, these results demonstrate that PHYRN represents a powerful mechanism for mapping uncharted frontiers in highly divergent protein sequence data sets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom