Automatic selection of reference taxa for protein–protein interaction prediction with phylogenetic profiling
Author(s) -
Martin Simonsen,
Stefan Maetschke,
Mark A. Ragan
Publication year - 2012
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btr720
Subject(s) - phylogenetic tree , computer science , selection (genetic algorithm) , profiling (computer programming) , taxon , machine learning , data mining , tree (set theory) , artificial intelligence , computational biology , biology , genetics , ecology , mathematics , gene , mathematical analysis , operating system
Phylogenetic profiling methods can achieve good accuracy in predicting protein-protein interactions, especially in prokaryotes. Recent studies have shown that the choice of reference taxa (RT) is critical for accurate prediction, but with more than 2500 fully sequenced taxa publicly available, identifying the most-informative RT is becoming increasingly difficult. Previous studies on the selection of RT have provided guidelines for manual taxon selection, and for eliminating closely related taxa. However, no general strategy for automatic selection of RT is currently available.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom