z-logo
Premium
Accuracy of genomic prediction using imputed whole‐genome sequence data in white layers
Author(s) -
Heidaritabar M.,
Calus M.P.L.,
Megens HJ.,
Vereijken A.,
Groenen M.A.M.,
Bastiaansen J.W.M.
Publication year - 2016
Publication title -
journal of animal breeding and genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.689
H-Index - 51
eISSN - 1439-0388
pISSN - 0931-2668
DOI - 10.1111/jbg.12199
Subject(s) - best linear unbiased prediction , imputation (statistics) , single nucleotide polymorphism , biology , genetics , whole genome sequencing , selection (genetic algorithm) , snp , genome , statistics , computational biology , computer science , mathematics , genotype , artificial intelligence , gene , missing data
Summary There is an increasing interest in using whole‐genome sequence data in genomic selection breeding programmes. Prediction of breeding values is expected to be more accurate when whole‐genome sequence is used, because the causal mutations are assumed to be in the data. We performed genomic prediction for the number of eggs in white layers using imputed whole‐genome resequence data including ~4.6 million SNP s. The prediction accuracies based on sequence data were compared with the accuracies from the 60 K SNP panel. Predictions were based on genomic best linear unbiased prediction ( GBLUP ) as well as a Bayesian variable selection model (BayesC). Moreover, the prediction accuracy from using different types of variants (synonymous, non‐synonymous and non‐coding SNP s) was evaluated. Genomic prediction using the 60 K SNP panel resulted in a prediction accuracy of 0.74 when GBLUP was applied. With sequence data, there was a small increase (~1%) in prediction accuracy over the 60 K genotypes. With both 60 K SNP panel and sequence data, GBLUP slightly outperformed BayesC in predicting the breeding values. Selection of SNP s more likely to affect the phenotype (i.e. non‐synonymous SNP s) did not improve the accuracy of genomic prediction. The fact that sequence data were based on imputation from a small number of sequenced animals may have limited the potential to improve the prediction accuracy. A small reference population (n = 1004) and possible exclusion of many causal SNP s during quality control can be other possible reasons for limited benefit of sequence data. We expect, however, that the limited improvement is because the 60 K SNP panel was already sufficiently dense to accurately determine the relationships between animals in our data.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here