PICS: Probabilistic Inference for ChIP‐seq | Zendy

Zhang Xuekui | Zendy; Robertson Gordon | Zendy; Krzywinski Martin | Zendy; Ning Kaida | Zendy; Droit Arnaud | Zendy; Jones Steven | Zendy; Gottardo Raphael | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

PICS: Probabilistic Inference for ChIP‐seq

Author(s) -

Zhang Xuekui,

Robertson Gordon,

Krzywinski Martin,

Ning Kaida,

Droit Arnaud,

Jones Steven,

Gottardo Raphael

Publication year - 2011

Publication title -

biometrics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 2.298

H-Index - 130

eISSN - 1541-0420

pISSN - 0006-341X

DOI - 10.1111/j.1541-0420.2010.01441.x

Subject(s) - false discovery rate , chromatin immunoprecipitation , computer science , inference , dna binding site , probabilistic logic , computational biology , statistical model , bayesian probability , bayesian inference , synthetic data , event (particle physics) , data mining , algorithm , biology , artificial intelligence , genetics , promoter , gene , gene expression , physics , quantum mechanics

Summary ChIP‐seq combines chromatin immunoprecipitation with massively parallel short‐read sequencing. While it can profile genome‐wide in vivo transcription factor‐DNA association with higher sensitivity, specificity, and spatial resolution than ChIP‐chip, it poses new challenges for statistical analysis that derive from the complexity of the biological systems characterized and from variability and biases in its sequence data. We propose a method called PICS (Probabilistic Inference for ChIP‐seq) for identifying regions bound by transcription factors from aligned reads. PICS identifies binding event locations by modeling local concentrations of directional reads, and uses DNA fragment length prior information to discriminate closely adjacent binding events via a Bayesian hierarchical t ‐mixture model. It uses precalculated, whole‐genome read mappability profiles and a truncated t ‐distribution to adjust binding event models for reads that are missing due to local genome repetitiveness. It estimates uncertainties in model parameters that can be used to define confidence regions on binding event locations and to filter estimates. Finally, PICS calculates a per‐event enrichment score relative to a control sample, and can use a control sample to estimate a false discovery rate. Using published GABP and FOXA1 data from human cell lines, we show that PICS' predicted binding sites were more consistent with computationally predicted binding motifs than the alternative methods MACS, QuEST, CisGenome, and USeq. We then use a simulation study to confirm that PICS compares favorably to these methods and is robust to model misspecification.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore