Premium
Computational chemistry development of a unified free energy Markov model for the distribution of 1300 chemicals to 38 different environmental or biological systems
Author(s) -
CruzMonteagudo Maykel,
GonzálezDíaz Humberto,
AgüeroChapín Guillermín,
Santana Lourdes,
Borges Fernanda,
Domínguez Elena Rosa,
Podda Gianni,
Uriarte Eugenio
Publication year - 2007
Publication title -
journal of computational chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.907
H-Index - 188
eISSN - 1096-987X
pISSN - 0192-8651
DOI - 10.1002/jcc.20730
Subject(s) - markov chain , biochemical engineering , energy (signal processing) , distribution (mathematics) , statistical physics , computer science , markov model , development (topology) , biological system , chemistry , mathematics , engineering , physics , biology , machine learning , statistics , mathematical analysis
Predicting tissue and environmental distribution of chemicals is of major importance for environmental and life sciences. Most of the molecular descriptors used in computational prediction of chemicals partition behavior consider molecular structure but ignore the nature of the partition system. Consequently, computational models derived up‐to‐date are restricted to the specific system under study. Here, a free energy‐based descriptor (Δ G k ) is introduced, which circumvent this problem. Based on Δ G k , we developed for the first time a single linear classification model to predict the partition behavior of a broad number of structurally diverse drugs and other chemicals (1300) for 38 different partition systems of biological and environmental significance. The model presented training/predicting set accuracies of 91.79/88.92%. Parametrical assumptions were checked. Desirability analysis was used to explore the levels of the predictors that produce the most desirable partition properties. Finally, inversion of the partition direction for each one of the 38 partition systems evidences that our models correctly classified 89.08% of compounds with an uncertainty of only ±0.17% independently of the direction of the partition process used to seek the model. Other 10 different classification models (linear, neural networks, and genetic algorithms) were also tested for the same purposes. None of these computational models favorably compare with respect to the linear model indicating that our approach capture the main aspects that govern chemicals partition in different systems. © 2007 Wiley Periodicals, Inc. J Comput Chem, 2007