
Pathway crosstalk effects: shrinkage and disentanglement using a Bayesian hierarchical model
Author(s) -
Alin Tomoiaga,
Peter H. Westfall,
Michèle L. Donato,
Sorin Drăghici,
Sonia S. Hassan,
Roberto Romero,
Paola Tellaroli
Publication year - 2016
Publication title -
statistics in biosciences
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.57
H-Index - 14
eISSN - 1867-1772
pISSN - 1867-1764
DOI - 10.1007/s12561-016-9160-1
Subject(s) - crosstalk , computer science , bayesian probability , pathway analysis , type i and type ii errors , computational biology , data mining , artificial intelligence , statistics , biology , mathematics , genetics , gene expression , physics , optics , gene
Identifying the biological pathways that are related to various clinical phenotypes is an important concern in biomedical research. Based on estimated expression levels and/or p-values, over-representation analysis (ORA) methods provide rankings of pathways, but they are tainted because pathways overlap. This crosstalk phenomenon has not been rigorously studied and classical ORA does not take into consideration: (i) that crosstalk effects in cases of overlapping pathways can cause incorrect rankings of pathways, (ii) that crosstalk effects can cause both excess type I errors and type II errors, (iii) that rankings of small pathways are unreliable and (iv) that type I error rates can be inflated due to multiple comparisons of pathways. We develop a Bayesian hierarchical model that addresses these problems, providing sensible estimates and rankings, and reducing error rates. We show, on both real and simulated data, that the results of our method are more accurate than the results produced by the classical over-representation analysis, providing a better understanding of the underlying biological phenomena involved in the phenotypes under study. The R code and the binary datasets for implementing the analyses described in this article are available online at: http://www.eng.wayne.edu/page.php?id=6402.