Premium
Exploration of Linear and Nonlinear Modeling Techniques to Predict of Retention Index of Essential Oils
Author(s) -
Noorizadeh Hadi,
Farmany Abbas
Publication year - 2010
Publication title -
journal of the chinese chemical society
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.329
H-Index - 45
eISSN - 2192-6549
pISSN - 0009-4536
DOI - 10.1002/jccs.201000188
Subject(s) - chemistry , kovats retention index , correlation coefficient , essential oil , quantitative structure–activity relationship , linear regression , artificial neural network , biological system , partial least squares regression , test set , kernel (algebra) , molecular descriptor , coefficient of determination , artificial intelligence , training set , chromatography , pattern recognition (psychology) , statistics , gas chromatography , mathematics , stereochemistry , computer science , combinatorics , biology
The essential oils are widely used in pharmaceutical, cosmetic and perfume industry, and for flavouring and preservation of several food products. GC and GC‐MS is one of the most powerful tools in analytical volatile compound (such as essential oils). Genetic algorithm and multiple linear regression (GA‐MLR), partial least square (GA‐PLS), kernel PLS (GA‐KPLS) and Levenberg‐Marquardt artificial neural network (L‐M ANN) techniques were used to investigate the correlation between retention index (RI) and descriptors for 113 diverse compounds in essential oils of four Teucrium species which obtained by GC and GC‐MS. Five simple one‐ and two‐dimensional descriptors were selected by GA‐KPLS and considered as input for developing L‐M ANN. The applied internal (leave‐group‐out cross validation (LGO‐CV)) and external (test set) validation methods were used for the predictive power of four models. The correlation coefficient LGO‐CV (Q 2 ) between experimental and predicted RI for training and test sets by GA‐MLR, GA‐PLS, GA‐KPLS and L‐M ANN was 0.91, 0.92, 0.96 and 0.99 (for 88 compounds), 0.88, 0.91, 0.94 and 0.97 (for 25 compounds), respectively. This indicates that L‐M ANN can be used as an alternative modeling tool for quantitative structure‐property/retention relationship (QSPR/QSRR) studies. This is the first research on the QSRR of the essential oil compounds against the RI using the GA‐KPLS and L‐M ANN.