z-logo
Premium
Target‐Driven Subspace Mapping Methods and Their Applicability Domain Estimation
Author(s) -
Soto Axel J.,
Vazquez Gustavo E.,
Strickert Marc,
Ponzoni Ignacio
Publication year - 2011
Publication title -
molecular informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.481
H-Index - 68
eISSN - 1868-1751
pISSN - 1868-1743
DOI - 10.1002/minf.201100053
Subject(s) - computer science , applicability domain , data mining , subspace topology , cheminformatics , domain (mathematical analysis) , interpolation (computer graphics) , machine learning , probabilistic logic , set (abstract data type) , artificial intelligence , mathematics , quantitative structure–activity relationship , bioinformatics , motion (physics) , mathematical analysis , biology , programming language
Abstract This work describes a methodology for assisting virtual screening of drugs during the early stages of the drug development process. This methodology is proposed to improve the reliability of in silico property prediction and it is structured in two steps. Firstly, a transformation is sought for mapping a high‐dimensional space defined by potentially redundant or irrelevant molecular descriptors into a low‐dimensional application‐related space. For this task we evaluate three different target‐driven subspace mapping methods, out of which we highlight the recent Correlative Matrix Mapping (CMM) as the most stable. Secondly, we apply an applicability domain model on the low‐dimensional space for assessing confidentiality of compound classification. By a probabilistic framework the applicability domain approach identifies poorly represented compounds in the training set (extrapolation problems) and regions in the space where the uncertainty about the correct class is higher than normal (interpolation problems). This two‐step approach represents an important contribution to the development of confident prediction tools in the chemoinformatics area, where the field is in need of both interpretable models and methods that estimate the confidence of predictions.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here