z-logo
open-access-imgOpen Access
Tiling Assembly: a new tool for reference annotation-independent transcript assembly and novel gene identification by RNA-sequencing
Author(s) -
Kenneth A. Watanabe,
Arielle L. Homayouni,
Tara Tufano,
J. M. Andrade López,
Patricia Ringler,
Paul J. Rushton,
Qingxi J. Shen
Publication year - 2015
Publication title -
dna research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.647
H-Index - 98
eISSN - 1756-1663
pISSN - 1340-2838
DOI - 10.1093/dnares/dsv015
Subject(s) - sequence assembly , biology , annotation , genome , gene annotation , gene , computational biology , genetics , genome project , transcriptome , gene expression
Annotation of the rice (Oryza sativa) genome has evolved significantly since release of its draft sequence, but it is far from complete. Several published transcript assembly programmes were tested on RNA-sequencing (RNA-seq) data to determine their effectiveness in identifying novel genes to improve the rice genome annotation. Cufflinks, a popular assembly software, did not identify all transcripts suggested by the RNA-seq data. Other assembly software was CPU intensive, lacked documentation, or lacked software updates. To overcome these shortcomings, a heuristic ab initio transcript assembly algorithm, Tiling Assembly, was developed to identify genes based on short read and junction alignment. Tiling Assembly was compared with Cufflinks to evaluate its gene-finding capabilities. Additionally, a pipeline was developed to eliminate false-positive gene identification due to noise or repetitive regions in the genome. By combining Tiling Assembly and Cufflinks, 767 unannotated genes were identified in the rice genome, demonstrating that combining both programmes proved highly efficient for novel gene identification. We also demonstrated that Tiling Assembly can accurately determine transcription start sites by comparing the Tiling Assembly genes with their corresponding full-length cDNA. We applied our pipeline to additional organisms and identified numerous unannotated genes, demonstrating that Tiling Assembly is an organism-independent tool for genome annotation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom