z-logo
open-access-imgOpen Access
Addressing the mean-correlation relationship in co-expression analysis
Author(s) -
ShingTung Yau,
Stephanie C. Hicks,
Kasper D. Hansen
Publication year - 2022
Publication title -
plos computational biology/plos computational biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.628
H-Index - 182
eISSN - 1553-7358
pISSN - 1553-734X
DOI - 10.1371/journal.pcbi.1009954
Subject(s) - normalization (sociology) , correlation , computational biology , quantile , biology , expression (computer science) , gene expression , gene , gene expression profiling , gene regulatory network , genetics , statistics , mathematics , computer science , geometry , sociology , anthropology , programming language
Estimates of correlation between pairs of genes in co-expression analysis are commonly used to construct networks among genes using gene expression data. As previously noted, the distribution of such correlations depends on the observed expression level of the involved genes, which we refer to this as a mean-correlation relationship in RNA-seq data, both bulk and single-cell. This dependence introduces an unwanted technical bias in co-expression analysis whereby highly expressed genes are more likely to be highly correlated. Such a relationship is not observed in protein-protein interaction data, suggesting that it is not reflecting biology. Ignoring this bias can lead to missing potentially biologically relevant pairs of genes that are lowly expressed, such as transcription factors. To address this problem, we introduce spatial quantile normalization (SpQN), a method for normalizing local distributions in a correlation matrix. We show that spatial quantile normalization removes the mean-correlation relationship and corrects the expression bias in network reconstruction.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here