z-logo
open-access-imgOpen Access
Positive-unlabeled learning for disease gene identification
Author(s) -
Peng Yang,
Xiaoli Li,
Jian-Ping Mei,
Chee-Keong Kwoh,
See-Kiong Ng
Publication year - 2012
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/bts504
Subject(s) - classifier (uml) , support vector machine , artificial intelligence , machine learning , computer science , set (abstract data type) , gene , disease , training set , identification (biology) , test set , pattern recognition (psychology) , computational biology , biology , genetics , medicine , pathology , programming language , botany
Identifying disease genes from human genome is an important but challenging task in biomedical research. Machine learning methods can be applied to discover new disease genes based on the known ones. Existing machine learning methods typically use the known disease genes as the positive training set P and the unknown genes as the negative training set N (non-disease gene set does not exist) to build classifiers to identify new disease genes from the unknown genes. However, such kind of classifiers is actually built from a noisy negative set N as there can be unknown disease genes in N itself. As a result, the classifiers do not perform as well as they could be.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom