Does ‘Big Data’ Exist in Medicinal Chemistry, and If So, How can It Be Harnessed?
Author(s) -
Igor V. Tetko,
Ola Engkvist,
Hongming Chen
Publication year - 2016
Publication title -
future medicinal chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.708
H-Index - 69
eISSN - 1756-8927
pISSN - 1756-8919
DOI - 10.4155/fmc-2016-0163
Subject(s) - pubchem , chembl , library science , big data , chemistry , computer science , drug discovery , data mining , biochemistry
The term ‘Big Data’ has gained increasing popularity within the chemistry field and across science broadly in recent years. Chemical databases have seen a dramatic growth over the past decade, with, for example, ChEMBL, REAXYS and PubChem providing hundreds of millions of experimental facts for tens of millions of compounds. Moreover, even larger datasets of experimental measurements are held within in-house data collections at pharma companies. Overall, the total number of entries across these databases is in the range of a billion, 109; however, although this number may seem impressive, it pales into comparison relative to other fields, where the amount of data is frequently measured in exabytes, 1018. Thus, does Big Data really exist within the chemistry field? What are such data within medicinal chemistry specifically and where do the challenges lie in analysis of these data? Big Data refer to data out of the scale of traditional applications, which require efforts beyond the traditional analysis. In this article, we will be discussing how it applies to medicinal chemistry, as well as providing an overview of some of the most important trends in the medicinal chemistry–Big Data field
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom