Target–Decoy MineR for determining the biological relevance of variables in noisy datasets
Author(s) -
Cesaré OvandoVázquez,
Daniel Cázarez-García,
Robert Winkler
Publication year - 2021
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btab369
Subject(s) - decoy , relevance (law) , computer science , data mining , biology , biochemistry , receptor , political science , law
Machine learning algorithms excavate important variables from big data. However, deciding on the relevance of identified variables is challenging. The addition of artificial noise, 'decoy' variables, to raw data, 'target' variables, enables calculating a false-positive rate and a biological relevance probability for each variable rank. These scores allow the setting of a cut-off for informative variables, depending on the required sensitivity/specificity of a scientific question.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom