z-logo
open-access-imgOpen Access
CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis
Author(s) -
Sebastian Hupfauf,
Mohammad Etemadi,
Marina Fernández-Delgado Juárez,
María Gómez-Brandòn,
Heribert Insam,
Sabine Marie Podmirseg
Publication year - 2020
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0243241
Subject(s) - computer science , data mining , pipeline (software) , amplicon sequencing , amplicon , workflow , data visualization , visualization , metagenomics , data science , biology , database , operating system , 16s ribosomal rna , polymerase chain reaction , bacteria , gene , genetics , biochemistry
In recent years, there has been a veritable boost in next-generation sequencing (NGS) of gene amplicons in biological and medical studies. Huge amounts of data are produced and need to be analyzed adequately. Various online and offline analysis tools are available; however, most of them require extensive expertise in computer science or bioinformatics, and often a Linux-based operating system. Here, we introduce “CoMA–Comparative Microbiome Analysis” as a free and intuitive analysis pipeline for amplicon-sequencing data, compatible with any common operating system. Moreover, the tool offers various useful services including data pre-processing, quality checking, clustering to operational taxonomic units (OTUs), taxonomic assignment, data post-processing, data visualization, and statistical appraisal. The workflow results in highly esthetic and publication-ready graphics, as well as output files in standardized formats (e.g. tab-delimited OTU-table, BIOM, NEWICK tree) that can be used for more sophisticated analyses. The CoMA output was validated by a benchmark test, using three mock communities with different sample characteristics (primer set, amplicon length, diversity). The performance was compared with that of Mothur, QIIME and QIIME2-DADA2, popular packages for NGS data analysis. Furthermore, the functionality of CoMA is demonstrated on a practical example, investigating microbial communities from three different soils (grassland, forest, swamp). All tools performed well in the benchmark test and were able to reveal the majority of all genera in the mock communities. Also for the soil samples, the results of CoMA were congruent to those of the other pipelines, in particular when looking at the key microbial players.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here