z-logo
Premium
Predicting protein p K a by environment similarity
Author(s) -
Milletti Francesca,
Storchi Loriano,
Cruciani Gabriele
Publication year - 2009
Publication title -
proteins: structure, function, and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.699
H-Index - 191
eISSN - 1097-0134
pISSN - 0887-3585
DOI - 10.1002/prot.22363
Subject(s) - similarity (geometry) , mean squared error , chemistry , metric (unit) , residue (chemistry) , root mean square , biological system , protein structure prediction , training set , fingerprint (computing) , data mining , protein structure , computer science , mathematics , artificial intelligence , statistics , biochemistry , engineering , biology , operations management , image (mathematics) , electrical engineering
A statistical method to predict protein p K a has been developed by using the 3D structure of a protein and a database of 434 experimental protein p K a values. Each p K a in the database is associated with a fingerprint that describes the chemical environment around an ionizable residue. A computational tool, MoKaBio, has been developed to identify automatically ionizable residues in a protein, generate fingerprints that describe the chemical environment around such residues, and predict p K a from the experimental p K a values in the database by using a similarity metric. The method, which retrieved the p K a of 429 of the 434 ionizable sites in the database correctly, was crossvalidated by leave‐one‐out and yielded root mean square error (RMSE) = 0.95, a result that is superior to that obtained by using the Null Model (RMSE 1.07) and other well‐established protein p K a prediction tools. This novel approach is suitable to rationalize protein p K a by comparing the region around the ionizable site with similar regions whose ionizable site p K a is known. The p K a of residues that have a unique environment not represented in the training set cannot be predicted accurately, however, the method offers the advantage of being trainable to increase its predictive power. Proteins 2009. © 2009 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom