z-logo
open-access-imgOpen Access
Optimizing ChIP-seq peak detectors using visual labels and supervised machine learning
Author(s) -
Toby Dylan Hocking,
Patricia Goerner-Potvin,
Andréanne Morin,
Xiaojian Shao,
Tomi Pastinen,
Guillaume Bourque
Publication year - 2016
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btw672
Subject(s) - computer science , encode , artificial intelligence , set (abstract data type) , genome browser , pattern recognition (psychology) , supervised learning , labeled data , machine learning , genome , genomics , artificial neural network , gene , biology , programming language , biochemistry
Many peak detection algorithms have been proposed for ChIP-seq data analysis, but it is not obvious which algorithm and what parameters are optimal for any given dataset. In contrast, regions with and without obvious peaks can be easily labeled by visual inspection of aligned read counts in a genome browser. We propose a supervised machine learning approach for ChIP-seq data analysis, using labels that encode qualitative judgments about which genomic regions contain or do not contain peaks. The main idea is to manually label a small subset of the genome, and then learn a model that makes consistent peak predictions on the rest of the genome.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom