Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction

2019 
Genomic prediction with imputed whole genome sequencing (WGS) data is an attractive approach to improve predictive ability with low cost. However, high accuracy has not been realized using this method in livestock. In this study, we imputed 435 individuals from 600 K SNP chip data to WGS data using different reference panels. We also investigated the prediction accuracy of genomic best linear unbiased predication (GBLUP) using imputed WGS data from different reference panels, linkage disequilibrium (LD)-based marker pruning, and pre-selected variants based on GWAS results. Results showed that the imputation accuracy from 600 K to WGS data was 0.873±0.038, 0.906±0.036, and 0.979±0.010 for the internal, external, and combined reference panels, respectively. In most traits of chickens, the prediction accuracies of imputed WGS data obtained from the internal reference panel were greater than or equal to the combined reference panel; the external reference panel resulted in the lowest prediction accuracy. Compared with 600 K chip data, GBLUP with imputed WGS data had only a small increase (1%-3%) in prediction accuracy. Using only variants selected from imputed WGS data based on GWAS results resulted in almost no increase for most traits and even increased the bias of the regression coefficient. The impact of the degree of LD of selected and remaining variants on prediction accuracy were different. For average daily gain (ADG), residual feed intake (RFI), intestine length (IL), and body weight in 91 days (BW91), the accuracy of GBLUP increased as the degree of LD of selected variants decreased, but the opposite relationship occurred for remaining variants. But for breast muscle weight (BMW) and average daily feed intake (ADFI), the accuracy of GBLUP increased as the degree of LD of selected variants increased, and the degree of LD of remaining variants had a small effect on prediction accuracy. Overall, the optimal imputation strategy to obtain WGS data for genomic prediction should consider the relationship between selected individuals and target population individuals to avoid heterogeneity of imputation. LD-based marker pruning can be used to improve the accuracy of genomic prediction using imputed WAS data.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    39
    References
    6
    Citations
    NaN
    KQI
    []