Open Access
Genotyping‐by‐sequencing through transcriptomics: implementation in a range of crop species with varying reproductive habits and ploidy levels
Author(s) -
Malmberg M. Michelle,
Pembleton Luke W.,
Baillie Rebecca C.,
Drayton Michelle C.,
Sudheesh Shimna,
Kaur Sukhjiwan,
Shinozuka Hiroshi,
Verma Preeti,
Spangenberg German C.,
Daetwyler Hans D.,
Forster John W.,
Cogan Noel O.I.
Publication year - 2018
Publication title -
plant biotechnology journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.525
H-Index - 115
eISSN - 1467-7652
pISSN - 1467-7644
DOI - 10.1111/pbi.12835
Subject(s) - biology , outbreeding depression , inbreeding , outcrossing , ploidy , genetics , genome , whole genome sequencing , genomics , microbiology and biotechnology , population , ecology , gene , pollen , demography , sociology
Summary The application of genomics in crops has the ability to significantly improve genetic gain for agriculture. Many marker‐dense tools have been developed, but few have seen broad adoption in plant genomics due to issues of significant variations of genome size, levels of ploidy, single nucleotide polymorphism ( SNP ) frequency and reproductive habit. When combined with limited breeding activities, small research communities and scant sequence resources, the suitability of popular systems is often suboptimal and routinely fails to effectively balance cost‐effectiveness and sample throughput. Genotyping‐by‐sequencing ( GBS ) encompasses a range of protocols including resequencing of the transcriptome. This study describes a skim GBS ‐transcriptomics ( GBS ‐t) approach developed to be broadly applicable, cost‐effective and high‐throughput while still assaying a significant number of SNP loci. A range of crop species with differing levels of ploidy and degree of inbreeding/outbreeding were chosen, including perennial ryegrass, a diploid outbreeding forage grass; phalaris, a putative segmental allotetraploid outbreeding forage grass; lentil, a diploid inbreeding grain legume; and canola, an allotetraploid partially outbreeding oilseed. GBS ‐t was validated as a simple and largely automated, cost‐effective method which generates sufficient SNP s (from 89 738 to 231 977) with acceptable levels of missing data and even genome coverage from c. 3 million sequence reads per sample. GBS ‐t is therefore a broadly applicable system suitable for many crops, offering advantages over other systems. The correct choice of subsequent sequence analysis software is important, and the bioinformatics process should be iterative and tailored to the specific challenges posed by ploidy variation and extent of heterozygosity.