A machine discovery from amino acid sequences by decision trees over regular patterns
Author(s) -
Setsuo Arikawa,
Satoru Miyano,
Ayumi Shinohara,
Satoru Kuhara,
Yasuhito Mukouchi,
Takeshi Shinohara
Publication year - 1993
Publication title -
new generation computing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.277
H-Index - 27
eISSN - 1882-7055
pISSN - 0288-3635
DOI - 10.1007/bf03037183
Subject(s) - computer science , decision tree , artificial intelligence , machine learning , class (philosophy) , identification (biology) , time complexity , domain (mathematical analysis) , tree (set theory) , theoretical computer science , algorithm , combinatorics , mathematics , biology , mathematical analysis , botany
This paper describes a machine learning system that discovered a “negative motif”, in transmembrane domain identification from amino acid sequences, and reports its experiments on protein data using PIR database. We introduce a decision tree whose nodes are labeled with regular patterns. As a hypothesis, the system produces such a decision tree for a small number of randomly chosen positive and negative examples from PIR. Experiments show that our system finds reasonable hypotheses very successfully. As a theoretical foundation, we show that the class of languages defined by decesion trees of depth at mostd overk-variable regular patterns is polynomial-time learnable in the sense of probably approximately correct (PAC) learning for any fixedd, k≥0.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom