z-logo
open-access-imgOpen Access
A novel method of literature mining to identify candidate COVID-19 drugs
Author(s) -
Tomonari Muramatsu,
Masaru Tanokura
Publication year - 2021
Publication title -
bioinformatics advances
Language(s) - English
Resource type - Journals
ISSN - 2635-0041
DOI - 10.1093/bioadv/vbab013
Subject(s) - kegg , encyclopedia , covid-19 , concatenation (mathematics) , computer science , drug discovery , computational biology , bioinformatics , infectious disease (medical specialty) , medicine , biology , disease , gene , genetics , mathematics , gene ontology , gene expression , pathology , combinatorics , library science
Summary COVID-19 is a serious infectious disease that has recently emerged and continues to spread worldwide. Its spreading rate is too high to expect that new specific drugs will be developed in sufficient time. As an alternative, drugs already developed for other diseases have been tested for use in the treatment of COVID-19 (drug repositioning). However, to select candidate drugs from a large number of compounds, numerous inhibition assays involving viral infection of cultured cells are required. For efficiency, it would be useful to narrow the list of candidates down using logical considerations prior to performing these assays. We have developed a powerful tool to predict candidate drugs for the treatment of COVID-19 and other diseases. This tool is based on the concatenation of events/substances, each of which is linked to a KEGG (Kyoto Encyclopedia of Genes and Genomes) code based on a relationship obtained from text mining of the vast literature in the PubMed database. By analyzing 21 589 326 records with abstracts from PubMed, 98 556 KEGG codes with NAME/DEFINITION fields were connected. Among them, 9799 KEGG drug codes were connected to COVID-19, of which 7492 codes had no direct connection to COVID-19. Although this report focuses on COVID-19, the program developed here can be applied to other infectious diseases and used to quickly identify drug candidates when new infectious diseases appear in the future. Availability and implementation The programs and data underlying this article will be shared on reasonable request to the corresponding authors. Contact atmuramatsu@g.ecc.u-tokyo.ac.jp, amtanok@mail.ecc.u-tokyo.ac.jp Supplementary information Supplementary data are available at Bioinformatics Advances online.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom