Universal Count Correction for High-Throughput Sequencing
Author(s) -
Tatsunori Hashimoto,
Matthew D. Edwards,
David K. Gifford
Publication year - 2014
Publication title -
plos computational biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.628
H-Index - 182
eISSN - 1553-7358
pISSN - 1553-734X
DOI - 10.1371/journal.pcbi.1003494
Subject(s) - count data , overdispersion , computer science , rna seq , base (topology) , computational biology , statistics , biology , mathematics , transcriptome , gene , poisson distribution , genetics , mathematical analysis , gene expression
We show that existing RNA-seq, DNase-seq, and ChIP-seq data exhibit overdispersed per-base read count distributions that are not matched to existing computational method assumptions. To compensate for this overdispersion we introduce a nonparametric and universal method for processing per-base sequencing read count data called F ixseq . We demonstrate that F ixseq substantially improves the performance of existing RNA-seq, DNase-seq, and ChIP-seq analysis tools when compared with existing alternatives.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom