z-logo
Premium
SINGLE‐PASS CLASSIFICATION OF ALL NON‐CODING SEQUENCES IN A BACTERIAL GENOME USING PHYLOGENETIC PROFILES
Author(s) -
Marchais Antonin,
Naville Magali,
Bohn Chantal,
Bouloc Philippe,
Gautheret Daniel
Publication year - 2009
Publication title -
the faseb journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.709
H-Index - 277
eISSN - 1530-6860
pISSN - 0892-6638
DOI - 10.1096/fasebj.23.1_supplement.841.2
Subject(s) - genome , computational biology , biology , phylogenetic tree , non coding rna , bacterial genome size , genetics , gene , coding region , rna , evolutionary biology
Identification and characterization of functional elements in the non‐coding regions of genomes is an elusive and time consuming activity whose output does not keep up with the pace of genome sequencing. Hundreds of bacterial genomes lay unexploited in terms of non‐coding sequence analysis although they may conceal a wide diversity of novel RNA genes, riboswitches or other regulatory elements. We describe a strategy that exploits the entirety of available bacterial genomes to classify all non‐coding elements in a reference species in a single pass. This method clusters non‐coding elements based on their profile of presence among species. Most non‐coding RNAs display specific signatures that enable their grouping in distinct clusters, away from sequence conservation noise and other elements such as promoters. We submitted 24 ncRNA candidates from Staphilococcus aureus to experimental validation and confirmed the presence of 7 novel small RNAs or riboswitches. Besides offering a powerful method for de novo ncRNA identification, the analysis of phylogenetic profiles opens a new path towards the identification of functional relationships between co‐evolving coding and non‐coding elements.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here