Human Papillomavirus Risk Type Classification from Protein Sequences Using Support Vector Machines
Author(s) -
Sun Kim,
ByoungTak Zhang
Publication year - 2006
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
ISBN - 3-540-33237-5
DOI - 10.1007/11732242_6
Subject(s) - human papillomavirus , support vector machine , kernel (algebra) , computer science , cervical cancer , artificial intelligence , string (physics) , type (biology) , pattern recognition (psychology) , machine learning , cancer , computational biology , medicine , biology , mathematics , discrete mathematics , mathematical physics , ecology
Infection by the human papillomavirus (HPV) is associated with the development of cervical cancer. HPV can be classified to high- and low-risk type according to its malignant potential, and detection of the risk type is important to understand the mechanisms and diagnose potential patients. In this paper, we classify the HPV protein sequences by support vector machines. A string kernel is introduced to discriminate HPV protein sequences. The kernel emphasizes amino acids pairs with a distance. In the experiments, our approach is compared with previous methods in accuracy and F1-score, and it has showed better performance. Also, the prediction results for unknown HPV types are presented.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom