z-logo
Premium
Detecting alternatively spliced transcript isoforms from single‐molecule long‐read sequences without a reference genome
Author(s) -
Liu Xiaoxian,
Mei Wenbin,
Soltis Pamela S.,
Soltis Douglas E.,
Barbazuk W. Brad
Publication year - 2017
Publication title -
molecular ecology resources
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.96
H-Index - 136
eISSN - 1755-0998
pISSN - 1755-098X
DOI - 10.1111/1755-0998.12670
Subject(s) - biology , computational biology , reference genome , rna seq , alternative splicing , gene , genome , genetics , proteome , locus (genetics) , pipeline (software) , transcriptome , rna splicing , gene isoform , rna , gene expression , computer science , programming language
Alternative splicing ( AS ) is a major source of transcript and proteome diversity, but examining AS in species without well‐annotated reference genomes remains difficult. Research on both human and mouse has demonstrated the advantages of using Iso‐Seq™ data for isoform‐level transcriptome analysis, including the study of AS and gene fusion. We applied Iso‐Seq™ to investigate AS in Amborella trichopoda, a phylogenetically pivotal species that is sister to all other living angiosperms. Our data show that, compared with RNA ‐Seq data, the Iso‐Seq™ platform provides better recovery on large transcripts, new gene locus identification and gene model correction. Reference‐based AS detection with Iso‐Seq™ data identifies AS within a higher fraction of multi‐exonic genes than observed for published RNA ‐Seq analysis (45.8% vs. 37.5%). These data demonstrate that the Iso‐Seq™ approach is useful for detecting AS events. Using the Iso‐Seq‐defined transcript collection in Amborella as a reference, we further describe a pipeline for detection of AS isoforms from PacBio Iso‐Seq™ without using a reference sequence ( de novo ). Results using this pipeline show a 66%–76% overall success rate in identifying AS events. This de novo AS detection pipeline provides a method to accurately characterize and identify bona fide alternatively spliced transcripts in any nonmodel system that lacks a reference genome sequence. Hence, our pipeline has huge potential applications and benefits to the broader biology community.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here