Using physical potentials and learned models to distinguish native binding interfaces from de novo designed interfaces that do not bind | Zendy

Demerdash Omar N. A. | Zendy; Mitchell Julie C. | Zendy

Premium

Using physical potentials and learned models to distinguish native binding interfaces from de novo designed interfaces that do not bind

Author(s) -

Demerdash Omar N. A.,

Mitchell Julie C.

Publication year - 2013

Publication title -

proteins: structure, function, and bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.699

H-Index - 191

eISSN - 1097-0134

pISSN - 0887-3585

DOI - 10.1002/prot.24337

Subject(s) - electrostatics , support vector machine , computer science , kernel (algebra) , hydrogen bond , set (abstract data type) , rational design , machine learning , test set , artificial intelligence , biological system , chemistry , nanotechnology , materials science , biology , molecule , mathematics , organic chemistry , combinatorics , programming language

Protein–protein interactions are a fundamental aspect of many biological processes. The advent of recombinant protein and computational techniques has allowed for the rational design of proteins with novel binding capabilities. It is therefore desirable to predict which designed proteins are capable of binding in vitro . To this end, we have developed a learned classification model that combines energetic and non‐energetic features. Our feature set is adapted from specialized potentials for aromatic interactions, hydrogen bonds, electrostatics, shape, and desolvation. A binding model built on these features was initially developed for CAPRI Round 21, achieving top results in the independent assessment. Here, we present a more thoroughly trained and validated model, and compare various support‐vector machine kernels. The Gaussian kernel model classified both high‐resolution complexes and designed nonbinders with 79–86% accuracy on independent test data. We also observe that multiple physical potentials for dielectric‐dependent electrostatics and hydrogen bonding contribute to the enhanced predictive accuracy, suggesting that their combined information is much greater than that of any single energetics model. We also study the change in predictive performance as the model features or training data are varied, observing unusual patterns of prediction in designed interfaces as compared with other data types. Proteins 2013; 81:1919–1930. © 2013 Wiley Periodicals, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research