z-logo
Premium
A Bayesian extension of the hypergeometric test for functional enrichment analysis
Author(s) -
Cao Jing,
Zhang Song
Publication year - 2014
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/biom.12122
Subject(s) - hypergeometric distribution , constraint (computer aided design) , computer science , inference , extension (predicate logic) , bayesian probability , gene ontology , bayesian network , function (biology) , property (philosophy) , mathematics , artificial intelligence , statistics , biology , gene , genetics , philosophy , gene expression , geometry , epistemology , programming language
Summary Functional enrichment analysis is conducted on high‐throughput data to provide functional interpretation for a list of genes or proteins that share a common property, such as being differentially expressed (DE). The hypergeometric P ‐value has been widely used to investigate whether genes from pre‐defined functional terms, for example, Gene Ontology (GO), are enriched in the DE genes. The hypergeometric P ‐value has three limitations: (1) computed independently for each term, thus neglecting biological dependence; (2) subject to a size constraint that leads to the tendency of selecting less‐specific terms; (3) repeated use of information due to overlapping annotations by the true‐path rule. We propose a Bayesian approach based on the non‐central hypergeometric model. The GO dependence structure is incorporated through a prior on non‐centrality parameters. The likelihood function does not include overlapping information. The inference about enrichment is based on posterior probabilities that do not have a size constraint. This method can detect moderate but consistent enrichment signals and identify sets of closely related and biologically meaningful functional terms rather than isolated terms. We also describe the basic ideas of assumption and implementation of different methods to provide some theoretical insights, which are demonstrated via a simulation study. A real application is presented.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here