Data mining the protein data bank: Residue interactions | Zendy

Oldfield T.J. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Data mining the protein data bank: Residue interactions

Author(s) -

Oldfield T.J.

Publication year - 2002

Publication title -

proteins: structure, function, and bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.699

H-Index - 191

eISSN - 1097-0134

pISSN - 0887-3585

DOI - 10.1002/prot.10221

Subject(s) - protein data bank , computer science , data mining , protein data bank (rcsb pdb) , set (abstract data type) , context (archaeology) , protein structure , information retrieval , computational biology , chemistry , biology , paleontology , biochemistry , stereochemistry , programming language

The protein databank contains a vast wealth of structural and functional information. The analysis of this macromolecular information has been the subject of considerable work in order to advance knowledge beyond the collection of molecular coordinates. This article presents a method that determines local structural information within proteins using mathematical data mining techniques. The mine program described returns many known configurations of residues such as the catalytic triad, metal binding sites and the N‐linked glycosylation site; as well as many other multiple residue interactions not previously categorized. Because mathematical constructs are used as targets, this method can identify new information not previously known, and also provide unbiased results of typical structure and their expected deviations. Because the results are defined mathematically, they cannot indicate the biological implications of the results. Therefore two support programs are described that provide insight into the biological context for the mine results. The first allows a weighted RMSD search between a template set of coordinates and a list of PDB files, and the second allows the labeling of a protein with the template results from mining to aid in the classification of this protein. Proteins 2002;49:510–528. © 2002 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research