
Protein Secondary Structure Prediction Using RT-RICO: A Rule-Based Approach
Author(s) -
Leong Lee,
Jennifer L. Leopold,
Cyriac Kandoth,
Ronald Frank
Publication year - 2010
Publication title -
the open bioinformatics journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.259
H-Index - 8
ISSN - 1875-0362
DOI - 10.2174/1875036201004010017
Subject(s) - protein structure prediction , computer science , rule induction , protein secondary structure , sequence (biology) , data mining , protein structure , protein sequencing , artificial intelligence , algorithm , machine learning , peptide sequence , chemistry , biochemistry , gene
Protein structure prediction has always been an important research area in biochemistry. In particular, the prediction of protein secondary structure has been a well-studied research topic. The experimental methods currently used to determine protein structure are accurate, yet costly both in terms of equipment and time. Despite the recent breakthrough of combining multiple sequence alignment information and artificial intelligence algorithms to predict protein secondary structure, the Q 3 accuracy of various computational prediction methods rarely has exceeded 75%. In this paper, a newly developed rule-based data-mining approach called RT-RICO (Relaxed Threshold Rule Induction from Coverings) is presented. This method identifies dependencies between amino acids in a protein sequence and generates rules that can be used to predict secondary structure. RT-RICO achieved a Q 3 score of 81.75% on the standard test dataset RS126 and a Q 3 score of 79.19% on the standard test dataset CB396, an improvement over comparable computational methods.