z-logo
open-access-imgOpen Access
Purely Structural Protein Scoring Functions Using Support Vector Machine and Ensemble Learning
Author(s) -
Shokoufeh Mirzaei,
Tomer Sidi,
Chen Keasar,
Silvia Crivelli
Publication year - 2016
Publication title -
ieee/acm transactions on computational biology and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.745
H-Index - 71
eISSN - 1557-9964
pISSN - 1545-5963
DOI - 10.1109/tcbb.2016.2602269
Subject(s) - decoy , computer science , machine learning , support vector machine , artificial intelligence , selection (genetic algorithm) , function (biology) , quality (philosophy) , similarity (geometry) , protein structure prediction , data mining , casp , protein structure , image (mathematics) , biochemistry , chemistry , philosophy , receptor , physics , epistemology , nuclear magnetic resonance , evolutionary biology , biology
The function of a protein is determined by its structure, which creates a need for efficient methods of protein structure determination to advance scientific and medical research. Because current experimental structure determination methods carry a high price tag, computational predictions are highly desirable. Given a protein sequence, computational methods produce numerous 3D structures known as decoys. Selection of the best quality decoys is both challenging and essential as the end users can handle only a few ones. Therefore, scoring functions are central to decoy selection. They combine measurable features into a single number indicator of decoy quality. Unfortunately, current scoring functions do not consistently select the best decoys. Machine learning techniques offer great potential to improve decoy scoring. This paper presents two machine-learning based scoring functions to predict the quality of proteins structures, i.e., the similarity between the predicted structure and the experimental one without knowing the latter. We use different metrics to compare these scoring functions against three state-of-the-art scores. This is a first attempt at comparing different scoring functions using the same non-redundant dataset for training and testing and the same features. The results show that adding informative features may be more significant than the method used.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom