z-logo
open-access-imgOpen Access
Genomic diversity affects the accuracy of bacterial single-nucleotide polymorphism–calling pipelines
Author(s) -
Stephen J. Bush,
Dona Foster,
David W. Eyre,
Emily L. Clark,
Nicola De Maio,
Liam P. Shaw,
Nicole Stoesser,
Tim Peto,
Derrick W. Crook,
Timothy M. Walker
Publication year - 2020
Publication title -
gigascience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.947
H-Index - 54
ISSN - 2047-217X
DOI - 10.1093/gigascience/giaa007
Subject(s) - biology , genome , genetics , single nucleotide polymorphism , computational biology , genomics , snp , bacterial genome size , reference genome , gene , genotype
Accurately identifying single-nucleotide polymorphisms (SNPs) from bacterial sequencing data is an essential requirement for using genomics to track transmission and predict important phenotypes such as antimicrobial resistance. However, most previous performance evaluations of SNP calling have been restricted to eukaryotic (human) data. Additionally, bacterial SNP calling requires choosing an appropriate reference genome to align reads to, which, together with the bioinformatic pipeline, affects the accuracy and completeness of a set of SNP calls obtained. This study evaluates the performance of 209 SNP-calling pipelines using a combination of simulated data from 254 strains of 10 clinically common bacteria and real data from environmentally sourced and genomically diverse isolates within the genera Citrobacter, Enterobacter, Escherichia, and Klebsiella.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom