Premium
Relatedness and Genotype × Environment Interaction Affect Prediction Accuracies in Genomic Selection: A Study in Cassava
Author(s) -
Ly Delphine,
Hamblin Martha,
Rabbi Ismail,
Melaku Gedil,
Bakare Moshood,
Gauch Hugh G.,
Okechukwu Richardson,
Dixon Alfred G.O.,
Kulakow Peter,
Jannink JeanLuc
Publication year - 2013
Publication title -
crop science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.76
H-Index - 147
eISSN - 1435-0653
pISSN - 0011-183X
DOI - 10.2135/cropsci2012.11.0653
Subject(s) - biology , selection (genetic algorithm) , cross validation , genomic selection , population , statistics , genotype , gene–environment interaction , heritability , genotyping , microbiology and biotechnology , genetics , machine learning , mathematics , computer science , gene , demography , single nucleotide polymorphism , sociology
Before implementation of genomic selection, evaluation of the potential accuracy of prediction can be obtained by cross‐validation. In this procedure, a population with both phenotypes and genotypes is split into training and validation sets. The prediction model is fitted using the training set, and its accuracy is calculated on the validation set. The degree of genetic relatedness between the training and validation sets may influence the expected accuracy as may the genotype × environment (G×E) interaction in those sets. We developed a method to assess these effects and tested it in cassava ( Manihot esculenta Crantz). We used historical phenotypic data available from the International Institute of Tropical Agriculture Genetic Gain trial and performed genotyping by sequencing for these clones. We tested cross‐validation sampling schemes preventing the training and validation sets from sharing (i) genetically close clones or (ii) similar evaluation locations. For 19 traits, plot‐basis heritabilities ranged from 0.04 to 0.66. The correlation between predicted and observed phenotypes ranged from 0.15 to 0.47. Across traits, predicting for less related clones decreased accuracy from 0 to 0.07, a small but consistent effect. For 17 traits, predicting for different locations decreased accuracy between 0.01 and 0.18. Genomic selection has potential to accelerate gains in cassava and the existing training population should give a reasonable estimate of future prediction accuracies.