| Literature DB >> 24531728 |
Yvonne M Badke1, Ronald O Bates, Catherine W Ernst, Justin Fix, Juan P Steibel.
Abstract
Genomic selection has the potential to increase genetic progress. Genotype imputation of high-density single-nucleotide polymorphism (SNP) genotypes can improve the cost efficiency of genomic breeding value (GEBV) prediction for pig breeding. Consequently, the objectives of this work were to: (1) estimate accuracy of genomic evaluation and GEBV for three traits in a Yorkshire population and (2) quantify the loss of accuracy of genomic evaluation and GEBV when genotypes were imputed under two scenarios: a high-cost, high-accuracy scenario in which only selection candidates were imputed from a low-density platform and a low-cost, low-accuracy scenario in which all animals were imputed using a small reference panel of haplotypes. Phenotypes and genotypes obtained with the PorcineSNP60 BeadChip were available for 983 Yorkshire boars. Genotypes of selection candidates were masked and imputed using tagSNP in the GeneSeek Genomic Profiler (10K). Imputation was performed with BEAGLE using 128 or 1800 haplotypes as reference panels. GEBV were obtained through an animal-centric ridge regression model using de-regressed breeding values as response variables. Accuracy of genomic evaluation was estimated as the correlation between estimated breeding values and GEBV in a 10-fold cross validation design. Accuracy of genomic evaluation using observed genotypes was high for all traits (0.65-0.68). Using genotypes imputed from a large reference panel (accuracy: R(2) = 0.95) for genomic evaluation did not significantly decrease accuracy, whereas a scenario with genotypes imputed from a small reference panel (R(2) = 0.88) did show a significant decrease in accuracy. Genomic evaluation based on imputed genotypes in selection candidates can be implemented at a fraction of the cost of a genomic evaluation using observed genotypes and still yield virtually the same accuracy. On the other side, using a very small reference panel of haplotypes to impute training animals and candidates for selection results in lower accuracy of genomic evaluation.Entities:
Keywords: GenPred; genomic selection; genotype imputation; shared data resources; swine
Mesh:
Year: 2014 PMID: 24531728 PMCID: PMC4059235 DOI: 10.1534/g3.114.010504
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Descriptive statistics of EBVs
| BF | D250 | LEA | |
|---|---|---|---|
| −0.03 | 4.57 | 0.61 | |
| 0.74 | 0.67 | 0.75 | |
| N | 965 | 936 | 938 |
| 0.45 | 0.26 | 0.47 |
EBV, estimated breeding values; BF, backfat thickness; D250, number of days to 250 lb; LEA, loin muscle area.
Average reliability of EBV.
Number of animals with usable EBV.
Estimates of accuracy for genomic evaluation and individual GEBV across imputation scenarios
| Trait | Scenario | Imputation Accuracy | |||||
|---|---|---|---|---|---|---|---|
| BF | 1 | (1, 1) | 0.68101 | 0.8510 | 0.7998 | 0.6852 | [0.5395, 0.8211] |
| 2 | (1, 0.95) | 0.67951 | 0.7981 | 0.6861 | [0.5467, 0.8164] | ||
| 3 | (0.88, 0.88) | 0.65982 | 0.7749 | 0.7014 | [0.5727, 0.8267] | ||
| 4 | (1,1) | 0.7210 | 0.8405 | 0.8560 | [0.8174, 0.8768] | ||
| D250 | 1 | (1, 1) | 0.66031 | 0.8020 | 0.8229 | 0.6575 | [0.5073, 0.7948] |
| 2 | (1, 0.95) | 0.65551,2 | 0.8170 | 0.6585 | [0.5187, 0.7962] | ||
| 3 | (0.88, 0.88) | 0.64632 | 0.8054 | 0.6750 | [0.5345, 0.7985] | ||
| 4 | (1,1) | 0.5354 | 0.6550 | 0.8438 | [0.8048, 0.8704] | ||
| LEA | 1 | (1, 1) | 0.65161 | 0.8529 | 0.7639 | 0.6859 | [0.5386, 0.8325] |
| 2 | (1, 0.95) | 0.64911 | 0.7610 | 0.6868 | [0.5377, 0.8214] | ||
| 3 | (0.88, 0.88) | 0.63642 | 0.7461 | 0.7040 | [0.5667, 0.8330] | ||
| 4 | (1,1) | 0.7165 | 0.8201 | 0.8549 | [0.8223, 0.8787] |
GEBV, genomic breeding value; EBV, estimated breeding values; HPD, highest posterior density; BF, backfat thickness; D250, number of days to 250 lb; LEA, loin muscle area.
Scenarios 1: all observed genotypes, 2: genotypes in prediction animals imputed with large reference haplotype panel (~1800), 3: genotypes in prediction animals imputed with small haplotype reference panel (128), and 4: validation animals with at least one close relative in the reference panel.
Accuracy of genotype imputation R2 for training and validation animals: .
Tukey honest significant difference post-hoc comparison of accuracy of genomic evaluation across imputation scenarios.
Average accuracy of EBV in the validation panel.
95% HPD interval of GEBV accuracy across validation animals.
Scenario with young animals in the validation panel that almost all have at least one close relative in the training panel.
1,2Means with different superscript differ significantly according to Tukey post-hoc tests with α = 0.05.
Significance of variables affecting accuracy of genomic evaluation
| dataset | ||||||
|---|---|---|---|---|---|---|
| trait | ||||||
| BF | 258 | < 0.001 | 2.83 | 0.1013 | 11.73 | 0.0016** |
| D250 | 229 | < 0.001 | 5.18 | 0.0291 | 7.238 | 0.0109 |
| LEA | 311 | < 0.001 | 2.06 | 0.1605 | 3.430 | 0.0725 |
EBV, estimated breeding values; BF, backfat thickness; D250, number of days to 250 lb; LEA, loin muscle area.
Accuracy of genomic evaluation was estimated for a total of 10 randomly assigned datasets of the cross-validation, such that we could assess whether accuracy of genomic evaluation was significantly different across these 10 datasets.
Accuracy of genomic evaluation by average of the top 10 genomic relationship estimates of animals in the validation set.
Accuracy of genomic evaluation by average accuracy of EBV of validation animals by cross-validation dataset.
df = c(9, 27).
df = c(1, 35).
P < 0.05, **P < 0.01.
Figure 1Accuracy of GEBV by observed accuracy of EBV for (A) BF, (B) D250, and (C) LEA r in relation to the animals r, with the 1-1 line of the regression (green line) and a loess smoother (red line), which is a local weighted mean of the r. GEBV, genomic breeding value; EBV, estimated breeding value; BF, backfat thickness; D250, number of days to 250 lb; LEA, loin muscle area.
Figure 2Accuracy of GEBV by average top 10 relatedness between the individual and training panel for (A) BF, (B) D250, and (C) LEA r in relation to the animals rel10, a loess smoother (red line), which is a local weighted mean of the r. GEBV, genomic breeding value; BF, backfat thickness; D250, number of days to 250 lb; LEA, loin muscle area.