Revisiting the negative example sampling problem for predicting protein–protein interactions
Author(s) -
Yungki Park,
Edward M. Marcotte
Publication year - 2011
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btr514
Subject(s) - sampling (signal processing) , computer science , confusion , sequence (biology) , population , data mining , machine learning , protein sequencing , artificial intelligence , biology , peptide sequence , medicine , genetics , psychology , environmental health , filter (signal processing) , gene , psychoanalysis , computer vision
A number of computational methods have been proposed that predict protein-protein interactions (PPIs) based on protein sequence features. Since the number of potential non-interacting protein pairs (negative PPIs) is very high both in absolute terms and in comparison to that of interacting protein pairs (positive PPIs), computational prediction methods rely upon subsets of negative PPIs for training and validation. Hence, the need arises for subset sampling for negative PPIs.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom