Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function | Zendy

Yili Chen | Zendy; Thomas W. Blackwell | Zendy; Ji Chen | Zendy; Jing Gao | Zendy; Angel W. Lee | Zendy; David J. States | Zendy

Open Access

Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function

Author(s) -

Yili Chen,

Thomas W. Blackwell,

Ji Chen,

Jing Gao,

Angel W. Lee,

David J. States

Publication year - 2007

Publication title -

plos computational biology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 2.628

H-Index - 182

eISSN - 1553-7358

pISSN - 1553-734X

DOI - 10.1371/journal.pcbi.0030063

Subject(s) - biology , chromatin immunoprecipitation , gene , chromatin , genetics , computational biology , dna microarray , chip sequencing , dna binding site , chip on chip , genome , human genome , dna methylation , gene expression , promoter , chromatin remodeling

The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c- MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c- MYC recognition sites can be identified in c- MYC responsive genes, other motif matches—even experimentally confirmed sites—are associated with genes showing no c- MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c- MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c- MYC targets. We assess the performance of our prediction results by comparing them with the c- myc targets identified in the biomedical literature. In total, we predict 460 likely c- MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c- MYC targets and suggests many novel sites. Our findings suggest that to identify c- MYC genomic targets, integration of different data sources helps to improve the accuracy.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research