z-logo
open-access-imgOpen Access
Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation
Author(s) -
Satoshi Endo
Publication year - 2022
Publication title -
environmental science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.851
H-Index - 397
eISSN - 1520-5851
pISSN - 0013-936X
DOI - 10.1021/acs.est.2c00865
Subject(s) - extrapolation , leverage (statistics) , applicability domain , calibration , mathematics , mean squared error , interpolation (computer graphics) , taft equation , range (aeronautics) , statistics , algorithm , computer science , chemistry , quantitative structure–activity relationship , artificial intelligence , machine learning , materials science , composite material , motion (physics) , stereochemistry , substituent
Polyparameter linear free energy relationships (PP-LFERs) are accurate and robust models employed to predict equilibrium partition coefficients ( K ) of organic chemicals. The accuracy of predictions by a PP-LFER depends on the composition of the respective calibration data set. Generally, extrapolation outside the domain defined by the calibration data is likely to be less accurate than interpolation. In this study, the applicability domain (AD) of PP-LFERs was systematically evaluated by calculating the leverage ( h ) and prediction interval (PI). Repeated simulations with experimental data showed that the root mean squared error of predictions increased with h . However, the analysis also showed that PP-LFERs calibrated with a large number (e.g., 100) of training data were highly robust against extrapolation error. For such PP-LFERs, the common definition of extrapolation ( h > 3 h mean , where h mean is the mean h of all training compounds) may be excessively strict. Alternatively, the PI is proposed as a metric to define the AD of PP-LFERs, as it provides a concrete estimate of the error range that agrees well with the observed errors, even for extreme extrapolations. Additionally, published PP-LFERs were evaluated in terms of their AD using the new concept of AD probes, which indicated the varying predictive performance of PP-LFERs in the existing literature for environmentally relevant compounds.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here