On further application of  r        m    2     as a metric for validation of QSAR models | Zendy

Mitra Indrani | Zendy; Roy Partha Pratim | Zendy; Kar Supratik | Zendy; Ojha Probir Kumar | Zendy; Roy Kunal | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

On further application of r m 2 as a metric for validation of QSAR models

Author(s) -

Mitra Indrani,

Roy Partha Pratim,

Kar Supratik,

Ojha Probir Kumar,

Roy Kunal

Publication year - 2010

Publication title -

journal of chemometrics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.47

H-Index - 92

eISSN - 1099-128X

pISSN - 0886-9383

DOI - 10.1002/cem.1268

Subject(s) - quantitative structure–activity relationship , model validation , set (abstract data type) , computer science , metric (unit) , test set , data mining , predictability , loo , model selection , data set , cross validation , artificial intelligence , machine learning , mathematics , statistics , engineering , operations management , data science , programming language

Validation is a crucial aspect for quantitative structure–activity relationship (QSAR) model development. External validation is considered, in general, as the most conclusive proof of predictive capacity of a QSAR model. In the absence of truly external data set, external validation is usually performed on test set compounds, which are members of the original data set but not used in model development exercise. In the case of small data sets, QSAR researchers experience problem in model development due to the fact that the developed models may be less reliable on account of the small number of training set compounds and such models may also show poor external predictability because the models may not have captured all necessary features required for the particular structure–activity relationships. The present paper attempts to show that ‘true r m 2(LOO) ’ statistic calculated based on the model derived from the undivided data set with application of variable selection strategy at each cycle of leave‐one‐out (LOO) validation may reflect external validation characteristics of the developed model thus obviating the requirement of splitting of the data set into training and test sets. This approach may be helpful in the case of small data sets as it uses all available data for model development and validation thus making the resulting model more reliable. Copyright © 2009 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research