Discovering Gene Regulatory Elements Using Coverage-Based Heuristics
Author(s) -
Rami Al-Ouran,
Robert Schmidt,
Ashwini Naik,
Jeffrey Jones,
Frank Drews,
David Juedes,
Laura Elnitski,
Lonnie Welch
Publication year - 2015
Publication title -
ieee/acm transactions on computational biology and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.745
H-Index - 71
eISSN - 1557-9964
pISSN - 1545-5963
DOI - 10.1109/tcbb.2015.2496261
Subject(s) - bioengineering , computing and processing
Data mining algorithms and sequencing methods (such as RNA-seq and ChIP-seq) are being combined to discover genomic regulatory motifs that relate to a variety of phenotypes. However, motif discovery algorithms often produce very long lists of putative transcription factor binding sites, hindering the discovery of phenotype-related regulatory elements by making it difficult to select a manageable set of candidate motifs for experimental validation. To address this issue, the authors introduce the motif selection problem and provide coverage-based search heuristics for its solution. Analysis of 203 ChIP-seq experiments from the ENCyclopedia of DNA Elements project shows that our algorithms produce motifs that have high sensitivity and specificity and reveals new insights about the regulatory code of the human genome. The greedy algorithm performs the best, selecting a median of two motifs per ChIP-seq transcription factor group while achieving a median sensitivity of 77 percent.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom