z-logo
open-access-imgOpen Access
Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
Author(s) -
Łukasz Huminiecki
Publication year - 2018
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0198961
Subject(s) - housekeeping gene , computational biology , biology , transcription factor , repressor , promoter , gene expression , genetics , linear model , gene , expression (computer science) , regulation of gene expression , bayesian probability , computer science , mathematics , statistics , programming language
Understanding how regulatory elements control mammalian gene expression is a challenge of post-genomic era. We previously reported that size of proximal promoter architecture predicted the breadth of expression (fraction of tissues in which a gene is expressed). Herein, the contributions of individual transcription factors (TFs) were quantified. Several technologies of statistical modelling were utilized and compared: tree models, generalized linear models (GLMs, without and with regularization), Bayesian GLMs and random forest. Both linear and non-linear modelling strategies were explored. Encouragingly, different models led to similar statistical conclusions and biological interpretations. The majority of ENCODE TFs correlated positively with housekeeping expression, a minority correlated negatively. Thus, housekeeping expression can be understood as a cumulative effect of many types of TF binding sites. This is accompanied by the exclusion of fewer types of binding sites for TFs which are repressors, or support cell lineage commitment or temporarily inducible or spatially-restricted expression.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here