z-logo
open-access-imgOpen Access
De Novo Assembly of a New Solanum pennellii Accession Using Nanopore Sequencing
Author(s) -
Maximilian Schmidt,
Alexander Vogel,
Alisandra K. Denton,
Benjamin Istace,
Alexandra Wormit,
Henri van de Geest,
Marie Bolger,
Saleh Alseekh,
Janina Maß,
Christian Pfaff,
Ulrich Schurr,
Roger T. Chetelat,
Florian Maumus,
JeanMarc Aury,
Sergey Koren,
Alisdair R. Fernie,
Dani Zamir,
Anthony Bolger,
Björn Usadel
Publication year - 2017
Publication title -
the plant cell
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 5.324
H-Index - 341
eISSN - 1532-298X
pISSN - 1040-4651
DOI - 10.1105/tpc.17.00521
Subject(s) - contig , nanopore sequencing , sequence assembly , biology , nanopore , hybrid genome assembly , genome , pipeline (software) , accession number (library science) , whole genome sequencing , computational biology , minion , illumina dye sequencing , reference genome , genetics , gene , computer science , nanotechnology , materials science , transcriptome , gene expression , genbank , programming language
Updates in nanopore technology have made it possible to obtain gigabases of sequence data. Prior to this, nanopore sequencing technology was mainly used to analyze microbial samples. Here, we describe the generation of a comprehensive nanopore sequencing data set with a median read length of 11,979 bp for a self-compatible accession of the wild tomato species Solanum pennellii We describe the assembly of its genome to a contig N50 of 2.5 MB. The assembly pipeline comprised initial read correction with Canu and assembly with SMARTdenovo. The resulting raw nanopore-based de novo genome is structurally highly similar to that of the reference S. pennellii LA716 accession but has a high error rate and was rich in homopolymer deletions. After polishing the assembly with Illumina reads, we obtained an error rate of <0.02% when assessed versus the same Illumina data. We obtained a gene completeness of 96.53%, slightly surpassing that of the reference S. pennellii Taken together, our data indicate that such long read sequencing data can be used to affordably sequence and assemble gigabase-sized plant genomes.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom