z-logo
open-access-imgOpen Access
An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations
Author(s) -
Bernardo Clavijo,
Luca Venturini,
Christian Schudoma,
Gonzalo Garcia Accinelli,
Gemy Kaithakottil,
Jonathan Wright,
Philippa Borrill,
George Kettleborough,
Darren Heavens,
Helen Chapman,
James Lipscombe,
Tom Barker,
Fu-Hao Lu,
Neil McKenzie,
Dina Raats,
Ricardo H. Ramírez-González,
Aurore Coince,
Ned Peel,
Lawrence PercivalAlwyn,
Owen Duncan,
Josua Trösch,
Guotai Yu,
Dan Bolser,
Guy Namaati,
Arnaud Kerhornou,
M. Spannagl,
Heidrun Gundlach,
Georg Haberer,
Robert Davey,
Nigel Fosker,
Federica Di Palma,
Andrew L. Phillips,
A. Harvey Millar,
Paul Kersey,
Cristóbal Uauy,
Ksenia V. Krasileva,
David Swarbreck,
Michael W. Bevan,
Matthew D. Clark
Publication year - 2017
Publication title -
genome research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.556
H-Index - 297
eISSN - 1549-5469
pISSN - 1088-9051
DOI - 10.1101/gr.217117.116
Subject(s) - biology , genome , sequence assembly , genetics , computational biology , genome project , gene annotation , shotgun sequencing , gene , gene prediction , polyploid , hybrid genome assembly , reference genome , whole genome sequencing , transcriptome , gene expression
Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom