Structure‐based identification of catalytic residues | Zendy

Yahalom Ran | Zendy; Reshef Dan | Zendy; Wiener Ayana | Zendy; Frankel Sagiv | Zendy; Kalisman Nir | Zendy; Lerner Boaz | Zendy; Keasar Chen | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Structure‐based identification of catalytic residues

Author(s) -

Yahalom Ran,

Reshef Dan,

Wiener Ayana,

Frankel Sagiv,

Kalisman Nir,

Lerner Boaz,

Keasar Chen

Publication year - 2011

Publication title -

proteins: structure, function, and bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.699

H-Index - 191

eISSN - 1097-0134

pISSN - 0887-3585

DOI - 10.1002/prot.23020

Subject(s) - structural genomics , support vector machine , classifier (uml) , artificial intelligence , computer science , machine learning , pattern recognition (psychology) , protein structure , computational biology , chemistry , biology , biochemistry

The identification of catalytic residues is an essential step in functional characterization of enzymes. We present a purely structural approach to this problem, which is motivated by the difficulty of evolution-based methods to annotate structural genomics targets that have few or no homologs in the databases. Our approach combines a state-of-the-art support vector machine (SVM) classifier with novel structural features that augment structural clues by spatial averaging and Z scoring. Special attention is paid to the class imbalance problem that stems from the overwhelming number of non-catalytic residues in enzymes compared to catalytic residues. This problem is tackled by: (1) optimizing the classifier to maximize a performance criterion that considers both Type I and Type II errors in the classification of catalytic and non-catalytic residues; (2) under-sampling non-catalytic residues before SVM training; and (3) during SVM training, penalizing errors in learning catalytic residues more than errors in learning non-catalytic residues. Tested on four enzyme datasets, one specifically designed by us to mimic the structural genomics scenario and three previously evaluated datasets, our structure-based classifier is never inferior to similar structure-based classifiers and comparable to classifiers that use both structural and evolutionary features. In addition to the evaluation of the performance of catalytic residue identification, we also present detailed case studies on three proteins. This analysis suggests that many false positive predictions may correspond to binding sites and other functional residues. A web server that implements the method, our own-designed database, and the source code of the programs are publicly available at http://www.cs.bgu.ac.il/∼meshi/functionPrediction.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research