A Discriminative Model for Identifying Spatial cis-Regulatory Modules
Author(s) -
Eran Segal,
Roded Sharan
Publication year - 2005
Publication title -
journal of computational biology
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.585
H-Index - 95
eISSN - 1557-8666
pISSN - 1066-5277
ISBN - 1-58113-755-9
DOI - 10.1089/cmb.2005.12.822
Subject(s) - cis regulatory module , computational biology , transcription factor , gene , regulatory sequence , biology , dna binding site , sequence (biology) , transcription (linguistics) , discriminative model , genetics , computer science , promoter , enhancer , gene expression , artificial intelligence , linguistics , philosophy
Transcriptional regulation is mediated by the coordinated binding of transcription factors to the upstream regions of genes. In higher eukaryotes, the binding sites of cooperating transcription factors are organized into short sequence units, called cis-regulatory modules. In this paper, we propose a method for identifying modules of transcription factor binding sites in a set of co-regulated genes, using only the raw sequence data as input. Our method is based on a novel probabilistic model that describes the mechanism of cis-regulation, including the binding sites of cooperating transcription factors, the organization of these binding sites into short sequence modules, and the regulation of a gene by its modules. We show that our method is successful in discovering planted modules in simulated data and known modules in yeast. More importantly, we applied our method to a large collection of human gene sets and found 83 significant cis-regulatory modules, which included 36 known motifs and many novel ones. Thus, our results provide one of the first comprehensive compendiums of putative cis-regulatory modules in human.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom