Premium
Estimation of infinite dilution activity coefficients of organic compounds in water with neural classifiers
Author(s) -
Giralt Francesc,
Espinosa G.,
Arenas A.,
FerreGine J.,
Amat L.,
Gironés X.,
CarbóDorca R.,
Cohen Y.
Publication year - 2004
Publication title -
aiche journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.958
H-Index - 167
eISSN - 1547-5905
pISSN - 0001-1541
DOI - 10.1002/aic.10116
Subject(s) - quantitative structure–activity relationship , molecular descriptor , artificial neural network , polarizability , similarity (geometry) , chemistry , artificial intelligence , biological system , dipole , computational chemistry , mathematics , pattern recognition (psychology) , machine learning , computer science , molecule , organic chemistry , biology , image (mathematics)
A new approach is presented for the development of quantitative structure–property relations (QSPR) based on the extraction of relevant molecular features with self‐organizing maps and the use of a modified fuzzy‐ARTMAP classifier for variable prediction. The present methodology is demonstrated for the development of a QSPR for the aqueous‐phase infinite dilution activity coefficient γ ∞ , based on a data set of 325 diverse organic compounds. The QSPR was developed using a set of 11 molecular descriptors (four connectivities v χ 1–4 , Coulomb self‐similarity measure, electron–nuclear attraction, dipole moment, sum of atomic numbers, number of filled levels, average polarizability, and nuclear–nuclear repulsion). The final set of molecular descriptors was selected from an initial pool of 23 topological and quantum chemical descriptors, including six molecular quantum similarity measures, by means of a topological analysis of self‐organization of the data set. Additional interpolated information to enhance the training of the neural system was obtained from the self‐organization analysis. The resulting fuzzy‐ARTMAP–based QSPRs performed with errors that were on the average seven times smaller compared to previous published models. The use of only four molecular quantum similarity measures proved to be sufficient for building a lnγ ∞ fuzzy‐ARTMAP–based QSPR with reasonable accuracy. © 2004 American Institute of Chemical Engineers AIChE J, 50:1315–1343, 2004