z-logo
open-access-imgOpen Access
A genomic data resource for predicting antimicrobial resistance from laboratory-derived antimicrobial susceptibility phenotypes
Author(s) -
Margo VanOeffelen,
Marcus Nguyen,
Derya Aytan-Aktug,
Thomas Brettin,
Emily Dietrich,
Ronald W. Kenyon,
Dustin Machi,
Chunhong Mao,
Robert Olson,
Gordon D. Pusch,
Maulik Shukla,
Rick Stevens,
Veronika Vonstein,
Andrew Warren,
Alice R. Wattam,
Hyunseung Yoo,
James J. Davis
Publication year - 2021
Publication title -
briefings in bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.204
H-Index - 113
eISSN - 1477-4054
pISSN - 1467-5463
DOI - 10.1093/bib/bbab313
Subject(s) - file transfer protocol , resource (disambiguation) , genome , antibiotic resistance , metadata , computer science , data set , computational biology , data collection , data science , workflow , biology , data mining , artificial intelligence , world wide web , the internet , genetics , database , gene , computer network , statistics , mathematics , antibiotics
Antimicrobial resistance (AMR) is a major global health threat that affects millions of people each year. Funding agencies worldwide and the global research community have expended considerable capital and effort tracking the evolution and spread of AMR by isolating and sequencing bacterial strains and performing antimicrobial susceptibility testing (AST). For the last several years, we have been capturing these efforts by curating data from the literature and data resources and building a set of assembled bacterial genome sequences that are paired with laboratory-derived AST data. This collection currently contains AST data for over 67 000 genomes encompassing approximately 40 genera and over 100 species. In this paper, we describe the characteristics of this collection, highlighting areas where sampling is comparatively deep or shallow, and showing areas where attention is needed from the research community to improve sampling and tracking efforts. In addition to using the data to track the evolution and spread of AMR, it also serves as a useful starting point for building machine learning models for predicting AMR phenotypes. We demonstrate this by describing two machine learning models that are built from the entire dataset to show where the predictive power is comparatively high or low. This AMR metadata collection is freely available and maintained on the Bacterial and Viral Bioinformatics Center (BV-BRC) FTP site ftp://ftp.bvbrc.org/RELEASE_NOTES/PATRIC_genomes_AMR.txt.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom