The Chaperonin-60 Universal Target Is a Barcode for Bacteria That Enables De Novo Assembly of Metagenomic Sequence Data
Author(s) -
Matthew G. Links,
Tim Dumonceaux,
Sean M. Hemmingsen,
Janet E. Hill
Publication year - 2012
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0049755
Subject(s) - barcode , metagenomics , biology , dna barcoding , 16s ribosomal rna , computational biology , phylum , pyrosequencing , genetics , genome , ribosomal rna , gene , bacteria , evolutionary biology , operating system , computer science
Barcoding with molecular sequences is widely used to catalogue eukaryotic biodiversity. Studies investigating the community dynamics of microbes have relied heavily on gene-centric metagenomic profiling using two genes (16S rRNA and cpn 60) to identify and track Bacteria. While there have been criteria formalized for barcoding of eukaryotes, these criteria have not been used to evaluate gene targets for other domains of life. Using the framework of the International Barcode of Life we evaluated DNA barcodes for Bacteria. Candidates from the 16S rRNA gene and the protein coding cpn 60 gene were evaluated. Within complete bacterial genomes in the public domain representing 983 species from 21 phyla, the largest difference between median pairwise inter- and intra-specific distances (“barcode gap”) was found from cpn 60. Distribution of sequence diversity along the ∼555 bp cpn 60 target region was remarkably uniform. The barcode gap of the cpn 60 universal target facilitated the faithful de novo assembly of full-length operational taxonomic units from pyrosequencing data from a synthetic microbial community. Analysis supported the recognition of both 16S rRNA and cpn 60 as DNA barcodes for Bacteria. The cpn 60 universal target was found to have a much larger barcode gap than 16S rRNA suggesting cpn 60 as a preferred barcode for Bacteria. A large barcode gap for cpn 60 provided a robust target for species-level characterization of data. The assembly of consensus sequences for barcodes was shown to be a reliable method for the identification and tracking of novel microbes in metagenomic studies.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom