z-logo
open-access-imgOpen Access
Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function
Author(s) -
Yili Chen,
Thomas W. Blackwell,
Ji Chen,
Jing Gao,
Angel W. Lee,
David J. States
Publication year - 2007
Publication title -
plos computational biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.628
H-Index - 182
eISSN - 1553-7358
pISSN - 1553-734X
DOI - 10.1371/journal.pcbi.0030063
Subject(s) - biology , chromatin immunoprecipitation , gene , chromatin , genetics , computational biology , dna microarray , chip sequencing , dna binding site , chip on chip , genome , human genome , dna methylation , gene expression , promoter , chromatin remodeling
The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c- MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c- MYC recognition sites can be identified in c- MYC responsive genes, other motif matches—even experimentally confirmed sites—are associated with genes showing no c- MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c- MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c- MYC targets. We assess the performance of our prediction results by comparing them with the c- myc targets identified in the biomedical literature. In total, we predict 460 likely c- MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c- MYC targets and suggests many novel sites. Our findings suggest that to identify c- MYC genomic targets, integration of different data sources helps to improve the accuracy.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom