z-logo
Premium
Transductive Support Vector Machines: Promising Approach to Model Small and Unbalanced Datasets
Author(s) -
Kondratovich Evgeny,
Baskin Igor I.,
Varnek Alexandre
Publication year - 2013
Publication title -
molecular informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.481
H-Index - 68
eISSN - 1868-1751
pISSN - 1868-1743
DOI - 10.1002/minf.201200135
Subject(s) - support vector machine , machine learning , artificial intelligence , cheminformatics , margin (machine learning) , computer science , weighting , quantitative structure–activity relationship , function (biology) , pattern recognition (psychology) , data mining , bioinformatics , biology , medicine , radiology , evolutionary biology
Semi‐supervised methods dealing with a combination of labeled and unlabeled data become more and more popular in machine‐learning area, but not still used in chemoinformatics. Here, we demonstrate that Transductive Support Vector Machines (TSVM) – a semi‐supervised large‐margin classification method – can be particularly useful to build the models on small and unbalanced datasets which often represent a difficult problem in QSAR. Both TSVM and ordinary SVM have been applied to build classification models on 10 DUD datasets. The “transductive effect” (the difference in predictive performance between transductive and ordinary support vector machines) was investigated as a function of: (a) active/inactive ratio, (b) descriptor weighting, and (c) the training and test sets size and composition.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here