z-logo
open-access-imgOpen Access
NPEST: a nonparametric method and a database for transcription start site prediction
Author(s) -
Tatarinova Tatiana,
Kryshchenko Alona,
Triska Martin,
Hassan Mehedi,
Murphy Denis,
Neely Michael,
Schumitzky Alan
Publication year - 2013
Publication title -
quantitative biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.707
H-Index - 15
eISSN - 2095-4697
pISSN - 2095-4689
DOI - 10.1007/s40484-013-0022-2
Subject(s) - probabilistic logic , statistical model , computer science , nonparametric statistics , rna splicing , locus (genetics) , computational biology , data mining , gene , biology , artificial intelligence , genetics , mathematics , statistics , rna
In this paper we present NPEST, a novel tool for the analysis of expressed sequence tags (EST) distributions and transcription start site (TSS) prediction. This method estimates an unknown probability distribution of ESTs using a maximum likelihood (ML) approach, which is then used to predict positions of TSS. Accurate identification of TSS is an important genomics task, since the position of regulatory elements with respect to the TSS can have large effects on gene regulation, and performance of promoter motif‐finding methods depends on correct identification of TSSs. Our probabilistic approach expands recognition capabilities to multiple TSS per locus that may be a useful tool to enhance the understanding of alternative splicing mechanisms. This paper presents analysis of simulated data as well as statistical analysis of promoter regions of a model dicot plant Arabidopsis thaliana . Using our statistical tool we analyzed 16520 loci and developed a database of TSS, which is now publicly available at www.glacombio.net/NPEST .

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here