Premium
Database and search techniques for two‐dimensional gel protein data: A comparison of paradigms for exploratory data analysis and prospects for biological modeling
Author(s) -
Lemkin Peter F.,
Lester Eric P.
Publication year - 1989
Publication title -
electrophoresis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.666
H-Index - 158
eISSN - 1522-2683
pISSN - 0173-0835
DOI - 10.1002/elps.1150100207
Subject(s) - computer science , database , data mining , exploratory data analysis , context (archaeology) , object (grammar) , artificial intelligence , biology , paleontology
Two‐dimensional (2‐D) polyacrylamide gel electrophoresis can detect thousands of polypeptides, separating them by apparent molecular weight ( M r ) and isoelectric point (p I ). Thus it provides a more realistic and global view of cellular genetic expression than any other technique. This technique has been useful for finding sets of key proteins of biological significance. However, a typical experiment with more than a few gels often results in an unwieldy data management problem. In this paper, the GELLAB‐II system is discussed with respect to how data reduction and exploratory data analysis can be aided by computer data management and statistical search techniques. By encoding the gel patterns in a “three‐dimensional” (3‐D) database, an exploratory data analysis can be carried out in an environment that might be called a “spread sheet for 2‐D gel protein data”. From such databases, complex parametric network models of protein expression during events such as differentiation might be constructed. For this, 2‐D gel databases must be able to include data from other domains external to the gel itself. Because of the increasing complexity of such databases, new tools are required to help manage this complexity. Two such tools, object‐oriented databases and expert‐system rule‐based analysis, are discussed in this context. Comparisons are made between GELLAB and other 2‐D gel databases analysis systems to illustrate some of the analysis paradigms common to these systems and where this technology may be heading.