Open Access
THUNDER: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data
Author(s) -
Bryce Rowland,
Ruth Huh,
Zoey Hou,
Cheynna Crowley,
Jia Wen,
Yin Shen,
Ming Hu,
Paola GiustiRodríguez,
Patrick F. Sullivan,
Yun Li
Publication year - 2022
Publication title -
plos genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.587
H-Index - 233
eISSN - 1553-7404
pISSN - 1553-7390
DOI - 10.1371/journal.pgen.1010102
Subject(s) - thunder , deconvolution , population , cell type , biology , cell , computer science , algorithm , physics , genetics , demography , sociology , meteorology
Hi-C data provide population averaged estimates of three-dimensional chromatin contacts across cell types and states in bulk samples. Effective analysis of Hi-C data entails controlling for the potential confounding factor of differential cell type proportions across heterogeneous bulk samples. We propose a novel unsupervised deconvolution method for inferring cell type composition from bulk Hi-C data, the Two-step Hi-c UNsupervised DEconvolution appRoach (THUNDER). We conducted extensive simulations to test THUNDER based on combining two published single-cell Hi-C (scHi-C) datasets. THUNDER more accurately estimates the underlying cell type proportions compared to reference-free methods (e.g., TOAST, and NMF) and is more robust than reference-dependent methods (e.g. MuSiC). We further demonstrate the practical utility of THUNDER to estimate cell type proportions and identify cell-type-specific interactions in Hi-C data from adult human cortex tissue samples. THUNDER will be a useful tool in adjusting for varying cell type composition in population samples, facilitating valid and more powerful downstream analysis such as differential chromatin organization studies. Additionally, THUNDER estimated contact profiles provide a useful exploratory framework to investigate cell-type-specificity of the chromatin interactome while experimental data is still rare.