Premium
Some Bayesian biclustering methods: Modeling and inference
Author(s) -
Chakraborty Abhishek,
Vardeman Stephen B.
Publication year - 2022
Publication title -
statistical analysis and data mining: the asa data science journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.381
H-Index - 33
eISSN - 1932-1872
pISSN - 1932-1864
DOI - 10.1002/sam.11584
Subject(s) - biclustering , cluster analysis , computer science , row , data mining , row and column spaces , set (abstract data type) , pattern recognition (psychology) , homogeneous , bayesian probability , inference , bayesian inference , cluster (spacecraft) , bayes' theorem , artificial intelligence , mathematics , correlation clustering , cure data clustering algorithm , database , combinatorics , programming language
Standard one‐way clustering methods form homogeneous groups in a set of objects. Biclustering (or, two‐way clustering) methods simultaneously cluster rows and columns of a rectangular data array in such a way that responses are homogeneous for all row‐cluster by column‐cluster cells. We propose a Bayes methodology for biclustering and corresponding MCMC algorithms. Our method not only identifies homogeneous biclusters, but also provides posterior probabilities that particular instances or features are clustered together. We further extend our proposal to address the biclustering problem under the commonly occurring situation of incomplete datasets. In addition to identifying homogeneous sets of rows and sets of columns, as in the complete data scenario, our approach also generates plausible predictions for missing/unobserved entries in the rectangular data array. Performances of our methodology are illustrated through simulation studies and applications to real datasets.