Gentle Introduction to the Statistical Foundations of False Discovery Rate in Quantitative Proteomics
Author(s) -
Thomas Bürger
Publication year - 2017
Publication title -
journal of proteome research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.644
H-Index - 161
eISSN - 1535-3907
pISSN - 1535-3893
DOI - 10.1021/acs.jproteome.7b00170
Subject(s) - false discovery rate , intuition , vocabulary , data science , identification (biology) , computer science , context (archaeology) , proteomics , epistemology , linguistics , biology , philosophy , biochemistry , gene , paleontology , botany
The vocabulary of theoretical statistics can be difficult to embrace from the viewpoint of computational proteomics research, even though the notions it conveys are essential to publication guidelines. For example, "adjusted p-values", "q-values", and "false discovery rates" are essentially similar concepts, whereas "false discovery rate" and "false discovery proportion" must not be confused, even though "rate" and "proportion" are related in everyday language. In the interdisciplinary context of proteomics, such subtleties may cause misunderstandings. This article aims to provide an easy-to-understand explanation of these four notions (and a few other related ones). Their statistical foundations are dealt with from a perspective that largely relies on intuition, addressing mainly protein quantification but also, to some extent, peptide identification. In addition, a clear distinction is made between concepts that define an individual property (i.e., related to a peptide or a protein) and those that define a set property (i.e., related to a list of peptides or proteins).
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom