Open Access
Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life
Author(s) -
Donovan H. Parks,
Christian Rinke,
Maria Chuvochina,
Pierre-Alain Chaumeil,
Ben J. Woodcroft,
Paul N. Evans,
Philip Hugenholtz,
Gene W. Tyson
Publication year - 2017
Publication title -
nature microbiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 7.305
H-Index - 79
ISSN - 2058-5276
DOI - 10.1038/s41564-017-0012-7
Subject(s) - genome , metagenomics , phylum , biology , phylogenetic tree , phylogenetic diversity , bacterial genome size , evolutionary biology , computational biology , tree of life (biology) , phylogenetics , genetics , gene
Challenges in cultivating microorganisms have limited the phylogenetic diversity of currently available microbial genomes. This is being addressed by advances in sequencing throughput and computational techniques that allow for the cultivation-independent recovery of genomes from metagenomes. Here, we report the reconstruction of 7,903 bacterial and archaeal genomes from >1,500 public metagenomes. All genomes are estimated to be ≥50% complete and nearly half are ≥90% complete with ≤5% contamination. These genomes increase the phylogenetic diversity of bacterial and archaeal genome trees by >30% and provide the first representatives of 17 bacterial and three archaeal candidate phyla. We also recovered 245 genomes from the Patescibacteria superphylum (also known as the Candidate Phyla Radiation) and find that the relative diversity of this group varies substantially with different protein marker sets. The scale and quality of this data set demonstrate that recovering genomes from metagenomes provides an expedient path forward to exploring microbial dark matter.