z-logo
open-access-imgOpen Access
Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes
Author(s) -
Chien-Chih Chen,
WenDar Lin,
Yu-Jung Chang,
Chuen-Liang Chen,
Jan-Ming Ho
Publication year - 2012
Publication title -
isrn bioinformatics
Language(s) - English
Resource type - Journals
eISSN - 2090-7346
pISSN - 2090-7338
DOI - 10.5402/2012/816402
Subject(s) - transcriptome , de novo transcriptome assembly , sequence assembly , computational biology , biology , computer science , genetics , gene , gene expression
Background . The emergence of next-generation sequencing platform gives rise to a new generation of assembly algorithms. Compared with the Sanger sequencing data, the next-generation sequence data present shorter reads, higher coverage depth, and different error profiles. These features bring new challenging issues for de novo transcriptome assembly. Methodology . To explore the influence of these features on assembly algorithms, we studied the relationship between read overlap size, coverage depth, and error rate using simulated data. According to the relationship, we propose a de novo transcriptome assembly procedure, called Euler-mix, and demonstrate its performance on a real transcriptome dataset of mice. The simulation tool and evaluation tool are freely available as open source. Significance . Euler-mix is a straightforward pipeline; it focuses on dealing with the variation of coverage depth of short reads dataset. The experiment result showed that Euler-mix improves the performance of de novo transcriptome assembly.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom