DP-Bind: a web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins
Author(s) -
Seungwoo Hwang,
Zhenkun Gou,
Igor B. Kuznetsov
Publication year - 2007
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btl672
Subject(s) - web server , logistic regression , sequence (biology) , computer science , support vector machine , artificial intelligence , machine learning , kernel (algebra) , dna , regression , web site , consensus sequence , computational biology , bioinformatics , biology , genetics , mathematics , base sequence , the internet , world wide web , statistics , combinatorics
This article describes DP-Bind, a web server for predicting DNA-binding sites in a DNA-binding protein from its amino acid sequence. The web server implements three machine learning methods: support vector machine, kernel logistic regression and penalized logistic regression. Prediction can be performed using either the input sequence alone or an automatically generated profile of evolutionary conservation of the input sequence in the form of PSI-BLAST position-specific scoring matrix (PSSM). PSSM-based kernel logistic regression achieves the accuracy of 77.2%, sensitivity of 76.4% and specificity of 76.6%. The outputs of all three individual methods are combined into a consensus prediction to help identify positions predicted with high level of confidence.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom