An Infrastructure to Mine Molecular Descriptors for Ligand Selection on Virtual Screening
Author(s) -
Vinicius Seus,
Giovanni Xavier Perazzo,
Ana T. Winck,
Adriano Velasque Werhli,
Karina Machado
Publication year - 2014
Publication title -
biomed research international
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.772
H-Index - 126
eISSN - 2314-6141
pISSN - 2314-6133
DOI - 10.1155/2014/325959
Subject(s) - virtual screening , selection (genetic algorithm) , data mining , computer science , ligand (biochemistry) , set (abstract data type) , c4.5 algorithm , molecular descriptor , decision tree , feature selection , computational biology , machine learning , quantitative structure–activity relationship , drug discovery , artificial intelligence , chemistry , bioinformatics , biology , receptor , support vector machine , biochemistry , programming language , naive bayes classifier
The receptor-ligand interaction evaluation is one important step in rational drug design. The databases that provide the structures of the ligands are growing on a daily basis. This makes it impossible to test all the ligands for a target receptor. Hence, a ligand selection before testing the ligands is needed. One possible approach is to evaluate a set of molecular descriptors. With the aim of describing the characteristics of promising compounds for a specific receptor we introduce a data warehouse-based infrastructure to mine molecular descriptors for virtual screening (VS). We performed experiments that consider as target the receptor HIV-1 protease and different compounds for this protein. A set of 9 molecular descriptors are taken as the predictive attributes and the free energy of binding is taken as a target attribute. By applying the J48 algorithm over the data we obtain decision tree models that achieved up to 84% of accuracy. The models indicate which molecular descriptors and their respective values are relevant to influence good FEB results. Using their rules we performed ligand selection on ZINC database. Our results show important reduction in ligands selection to be applied in VS experiments; for instance, the best selection model picked only 0.21% of the total amount of drug-like ligands.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom