Application of latent semantic analysis to protein remote homology detection
Author(s) -
Qiwen Dong,
Xiaolong Wang,
Lei Lin
Publication year - 2005
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/bti801
Subject(s) - computer science , latent semantic analysis , support vector machine , discriminative model , pattern recognition (psychology) , artificial intelligence , feature extraction , feature vector , protein sequencing , kernel (algebra) , peptide sequence , mathematics , biology , genetics , combinatorics , gene
Remote homology detection between protein sequences is a central problem in computational biology. The discriminative method such as the support vector machine (SVM) is one of the most effective methods. Many of the SVM-based methods focus on finding useful representations of protein sequence, using either explicit feature vector representations or kernel functions. Such representations may suffer from the peaking phenomenon in many machine-learning methods because the features are usually very large and noise data may be introduced. Based on these observations, this research focuses on feature extraction and efficient representation of protein vectors for SVM protein classification.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom