EBI Metagenomics in 2017: enriching the analysis of microbial communities, from sequence reads to assemblies
Author(s) -
Alex Mitchell,
Maxim Scheremetjew,
Hubert Denise,
Simon Potter,
Aleksandra Tarkowska,
Matloob Qureshi,
Gustavo A Salazar,
Sebastien Pesseat,
Miguel Boland,
Fiona Hunter,
Petra ten Hoopen,
Blaise Alako,
Clara Amid,
Darren J. Wilkinson,
Thomas P. Curtis,
Guy Cochrane,
ROBERT FINN
Publication year - 2017
Publication title -
nucleic acids research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.008
H-Index - 537
eISSN - 1362-4954
pISSN - 0305-1048
DOI - 10.1093/nar/gkx967
Subject(s) - metagenomics , biology , workflow , metadata , computational biology , pipeline (software) , sample (material) , bioinformatics , database , computer science , world wide web , genetics , gene , chemistry , chromatography , programming language
EBI metagenomics (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the analysis and archiving of sequence data derived from the microbial populations found in a particular environment. Over the past two years, EBI metagenomics has increased the number of datasets analysed 10-fold. In addition to increased throughput, the underlying analysis pipeline has been overhauled to include both new or updated tools and reference databases. Of particular note is a new workflow for taxonomic assignments that has been extended to include assignments based on both the large and small subunit RNA marker genes and to encompass all cellular micro-organisms. We also describe the addition of metagenomic assembly as a new analysis service. Our pilot studies have produced over 2400 assemblies from datasets in the public domain. From these assemblies, we have produced a searchable, non-redundant protein database of over 50 million sequences. To provide improved access to the data stored within the resource, we have developed a programmatic interface that provides access to the analysis results and associated sample metadata. Finally, we have integrated the results of a series of statistical analyses that provide estimations of diversity and sample comparisons.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom