Premium
Application of PC‐ANN to Acidity Constant Prediction of Various Phenols and Benzoic Acids in Water
Author(s) -
HABIBIYANGJEH Aziz,
ESMAILIAN Mahdi
Publication year - 2008
Publication title -
chinese journal of chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.28
H-Index - 41
eISSN - 1614-7065
pISSN - 1001-604X
DOI - 10.1002/cjoc.200890162
Subject(s) - principal component analysis , chemistry , benzoic acid , phenols , principal component regression , artificial neural network , set (abstract data type) , linear regression , quantitative structure–activity relationship , molecular descriptor , biological system , ranking (information retrieval) , constant (computer programming) , data set , statistics , artificial intelligence , stereochemistry , mathematics , organic chemistry , computer science , biology , programming language
Principal component regression (PCR) and principal component‐artificial neural network (PC‐ANN) models were applied to prediction of the acidity constant for various benzoic acids and phenols (242 compounds) in water at 25 °C. A large number of theoretical descriptors were calculated for each molecule. The first fifty principal components (PC) were found to explain more than 95% of variances in the original data matrix. From the pool of these PC's, the eigenvalue ranking method was employed to select the best set of PC for PCR and PC‐ANN models. The PC‐ANN model with architecture 47‐20‐1 was generated using 47 principal components as inputs and its output is p K a . For evaluation of the predictive power of the PCR and PC‐ANN models, p K a values of 37 compounds in the prediction set were calculated. Mean percentage deviation (MPD) for PCR and PC‐ANN models are 18.45 and 0.6448, respectively. These improvements are due to the fact that the p K a of the compounds demonstrate non‐linear correlations with the principal components. Comparison of the results obtained by the models reveals superiority of the PC‐ANN model relative to the PCR model.