| Literature DB >> 34849779 |
Shaohua Zhu1,2, Tingting Guo1,2, Chao Yuan1,2, Jianbin Liu1,2, Jianye Li1,2, Mei Han1,2, Hongchang Zhao1,2, Yi Wu1,2, Weibo Sun1,2, Xijun Wang3, Tianxiang Wang3, Jigang Liu3, Christian Keambou Tiambo4, Yaojing Yue2, Bohui Yang1.
Abstract
The marker density, the heritability level of trait and the statistical models adopted are critical to the accuracy of genomic prediction (GP) or selection (GS). If the potential of GP is to be fully utilized to optimize the effect of breeding and selection, in addition to incorporating the above factors into simulated data for analysis, it is essential to incorporate these factors into real data for understanding their impact on GP accuracy, more clearly and intuitively. Herein, we studied the GP of six wool traits of sheep by two different models, including Bayesian Alphabet (BayesA, BayesB, BayesCπ, and Bayesian LASSO) and genomic best linear unbiased prediction (GBLUP). We adopted fivefold cross-validation to perform the accuracy evaluation based on the genotyping data of Alpine Merino sheep (n = 821). The main aim was to study the influence and interaction of different models and marker densities on GP accuracy. The GP accuracy of the six traits was found to be between 0.28 and 0.60, as demonstrated by the cross-validation results. We showed that the accuracy of GP could be improved by increasing the marker density, which is closely related to the model adopted and the heritability level of the trait. Moreover, based on two different marker densities, it was derived that the prediction effect of GBLUP model for traits with low heritability was better; while with the increase of heritability level, the advantage of Bayesian Alphabet would be more obvious, therefore, different models of GP are appropriate in different traits. These findings indicated the significance of applying appropriate models for GP which would assist in further exploring the optimization of GP.Entities:
Keywords: Alpine Merino sheep; Bayesian alphabet; GBLUP; GenPred; genomic prediction; marker density; shared data resource; wool traits
Mesh:
Year: 2021 PMID: 34849779 PMCID: PMC8527494 DOI: 10.1093/g3journal/jkab206
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Descriptive statistics of phenotypic values of traits
| Trait | Abbreviation | SE | Mean ± SD | Numbers |
|---|---|---|---|---|
| Clean fleece weight rate (%) | CFWR | 0.25 | 63.58 ± 7.10 | 817 |
| Staple strength (N/ktex) | SS | 0.28 | 33.81 ± 7.98 | 813 |
| Fleece extension rate (%) | FER | 0.18 | 19.67 ± 5.07 | 811 |
| Mean fiber diameter (mm) | FD | 0.07 | 20.81 ± 2.11 | 811 |
| Coefficient of variation of FD | FD_CV | 0.11 | 20.22 ± 3.26 | 816 |
| Staple length (mm) | SL | 0.46 | 90.63 ± 13.16 | 812 |
SE, standard error; SD, standard deviation.
Different GS methods and effects distribution in this study
| Method | Assumed distribution of effect | Formula of effect distribution |
|---|---|---|
| GBLUP | Normal |
|
| BayesA | t |
|
| BayesB | Point-t |
|
| BayesC | t mixture |
|
| Bayesian LASSO | Double exponential or Laplace |
|
Estimates of additive and residual components of variance obtained under GBLUP methodology using BGLR package for different datasets
| Traits | Dataset type |
|
|
|
|---|---|---|---|---|
| CFWR | L-Datasets | 26.47 (0.33) | 0.56 (0.01) | 20.77 (0.24) |
| H-Datasets | 23.04 (0.26) | 0.46 (0.01) | 27.06 (0.25) | |
| SS | L-Datasets | 28.64 (0.42) | 0.46 (0.01) | 33.46 (0.35) |
| H-Datasets | 23.20 (0.43) | 0.35 (0.01) | 42.53 (0.38) | |
| FER | L-Datasets | 9.04 (0.17) | 0.37 (0.02) | 16.75 (0.16) |
| H-Datasets | 7.57 (0.18) | 0.29 (0.01) | 18.77 (0.18) | |
| FD | L-Datasets | 1.91 (0.03) | 0.45 (0.02) | 2.26 (0.02) |
| H-Datasets | 2.04 (0.03) | 0.44 (0.01) | 2.46 (0.02) | |
| FD_CV | L-Datasets | 5.46 (0.06) | 0.56 (0.01) | 4.13 (0.05) |
| H-Datasets | 5.75 (0.08) | 0.55 (0.01) | 4.65 (0.06) | |
| SL | L-Datasets | 89.63 (0.72) | 0.70 (0.01) | 37.73 (0.55) |
| H-Datasets | 106.99 (0.86) | 0.68 (0.01) | 50.64 (0.76) |
CFWR, clean fleece weight rate; SS, staple strength; FER, fleece extension rate; FD, mean FD; FD_CV, coefficient of variation of FD; SL, staple length
Polygenic heritability, the proportion of the additive effect variance to the total phenotypic variance.
Comparison of prediction accuracies of six traits based on two datasets via five models
| Trait | Prediction Accuracy | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | BA | BB | BC | BL | GB | ||||||
| Dataset | L | H | L | H | L | H | L | H | L | H | |
| CFWR | 0.47 (0.01) | 0.47 (0.03) | 0.48 (0.02) | 0.49 (0.02) | 0.52 (0.01) | 0.50 (0.03) | 0.51 (0.02) | 0.51 (0.03) | 0.52 (0.01) | 0.53 (0.02) | |
| SS | 0.33 (0.01) | 0.34 (0.01) | 0.31 (0.02) | 0.32 (0.03) | 0.32 (0.02) | 0.33 (0.02) | 0.29 (0.02) | 0.33 (0.04) | 0.35 (0.03) | 0.35 (0.02) | |
| FER | 0.28 (0.01) | 0.31 (0.03) | 0.30 (0.03) | 0.32 (0.02) | 0.32 (0.01) | 0.33 (0.03) | 0.32 (0.01) | 0.34 (0.02) | 0.34 (0.01) | 0.36 (0.01) | |
| FD | 0.49 (0.01) | 0.48 (0.04) | 0.44 (0.02) | 0.45 (0.06) | 0.44 (0.02) | 0.44 (0.04) | 0.56 (0.01) | 0.53 (0.01) | 0.52 (0.02) | 0.53 (0.01) | |
| FD_CV | 0.45 (0.02) | 0.45 (0.03) | 0.52 (0.01) | 0.53 (0.00) | 0.47 (0.02) | 0.48 (0.02) | 0.50 (0.02) | 0.52 (0.01) | 0.51 (0.01) | 0.55 (0.02) | |
| SL | 0.59 (0.02) | 0.58 (0.01) | 0.60 (0.01) | 0.58 (0.01) | 0.59 (0.01) | 0.53 (0.02) | 0.59 (0.02) | 0.58 (0.02) | 0.60 (0.03) | 0.57 (0.02) | |
Abbreviations of traits explained in Table 3
SE are in parenthesis
BA, BayesA; BB, BayesB; BC, BayesCπ; BL, Bayesian LASSO; GB, genomic best linear unbiased prediction, GBLUP.
Figure 1Comparison of GP accuracy based on different density genotype datasets. The six traits were CFWR, SS, FER, mean FD, FD_CV, and SL.
Figure 2GP accuracy of five models in different heritability level. On the left is the result for the H-datasets, and on the right is the result for the L-datasets. The six traits were CFWR, SS, FER, mean FD, FD_CV, and SL. The five models were: BayesA (BA); BayesB (BB); BayesCπ (BC); Bayesian LASSO (BL); and GBLUP (GB).