| Literature DB >> 35752161 |
Tesfaye K Belay1, Leiv S Eikje2, Arne B Gjuvsland2, Øyvind Nordbø2, Thierry Tribout3, Theo Meuwissen1.
Abstract
Bias and inflation in genomic evaluation with the single-step methods have been reported in several studies. Incompatibility between the base-populations of the pedigree-based and the genomic relationship matrix (G) could be a reason for these biases. Inappropriate ways of accounting for missing parents could be another reason for biases in genetic evaluations with or without genomic information. To handle these problems, we fitted and evaluated a fixed covariate (J) that contains ones for genotyped animals and zeros for unrelated non-genotyped animals, or pedigree-based regression coefficients for related non-genotyped animals. We also evaluated alternative ways of fitting the J covariate together with genetic groups on biases and stability of breeding value estimates, and of including it into G as a random effect. In a whole vs. partial data set comparison, four scenarios were investigated for the partial data: genotypes missing, phenotypes missing, both genotypes and phenotypes missing, and pedigree missing. Fitting J either as fixed or random reduced level-bias and inflation and increased stability of genomic predictions as compared to the basic model where neither J nor genetic groups were fitted. In most models, genomic predictions were largely biased for scenarios with missing genotype and phenotype information. The biases were reduced for models which combined group and J effects. Models with these corrected group covariates performed better than the recently published model where genetic groups were encapsulated and fitted as random via the Quaas and Pollak transformation. In our Norwegian Red cattle data, a model which combined group and J regression coefficients was preferred because it showed least bias and highest stability of genomic predictions across the scenarios.Entities:
Keywords: J factor; Norwegian Red cattle; genetic groups; inflation; level-bias; single-step genomic BLUP
Mesh:
Year: 2022 PMID: 35752161 PMCID: PMC9467032 DOI: 10.1093/jas/skac227
Source DB: PubMed Journal: J Anim Sci ISSN: 0021-8812 Impact factor: 3.338
Number of records (N) and descriptive statistics of J covariate and lactation milk yield in tons (T)
| Item |
| Mean | SD | Min | Max |
|---|---|---|---|---|---|
| Milk yield, T | 3,390,184 | 6.58 | 1.39 | 0.56 | 19.70 |
| J1 | 4,593,369 | 0.79 | 0.29 | −0.01 | 1.68 |
| J2 | 30,729 | 1 | 0 | 1 | 1 |
J and J are part of the J covariate vector that pertain to non-genotyped and genotyped animals in pedigree, respectively.
Figure 1.Distribution of animals with or without phenotypes per genetic group.
Summary of model options (t), numbers (#), acronyms, and descriptions
| # | Option(t) | Acronym | Brief description |
|---|---|---|---|
| 1 | — | SSGBLUP_N | A base model that fitted neither |
| 2 |
| SSGBLUP_J |
|
| 3 | u | SSGBLUP_Jr |
|
| 4 |
| SSGBLUP_Q |
|
| 5 | Qg + | SSGBLUP_QJ |
|
| 6 |
| SSGBLUP_QJr | Fixed |
| 7 |
| SSGBLUP_Q* | The |
| 8 |
| SSGBLUP_ | Q* was obtained using |
| 9 |
| SSGBLUP_Q-Q+ | The |
| 10 |
| SSGBLUP_Q- | Minimum value of |
| 11 |
| SSGBLUP_EUPG |
|
u is a vector of random animal effects, which implicitly account for J effects; is effects of J covariate; g, , , , or is a vector of genetic group Q, Q*, Q-Q, , or estimates, respectively.
Information included (x) or excluded (−) in the analysis for the 674 animals constituting the validation population in the scenarios considered
| Scenario | Phenotypes | Pedigree | Genotypes |
|---|---|---|---|
| GPPed | x | x | x |
| GPed-only | − | x | x |
| PPed-only | x | x | − |
| Ped-only | − | x | − |
| NoPed | x | − | x |
Regression coefficient estimates of J covariate effects from the different genomic prediction models under various scenarios
| Model | Scenario | ||||
|---|---|---|---|---|---|
| GPPed | GPed-only | PPed-only | Ped-only | NoPed | |
| SSGBLUP_J | 0.834 | 0.833 | 0.830 | 0.829 | 0.833 |
| SSGBLUP_QJ | 2.619 | 2.620 | 2.614 | 2.614 | 2.543 |
Scenarios are as described in Table 3.
Figure 2.Trends for the original Q-Q+ (SSGBLUP_Q-Q+) (a) and restricted Q-Q+ (where minimum value of the Q-Q+ set to zero: ) (b) effects estimated using the partial datasets (where phenotypes (GPed-only), or genotypes (PPed-only), or both phenotypes and genotypes (Ped-only), or pedigree information (NoPed) of the 674 cows masked) and whole dataset (GPPed).
Figure 3.Trends for Q* (SSGBLUP_Q*) effects estimated using partial datasets (where phenotypes (GPed-only), or genotypes (PPed-only), or both phenotypes and genotypes (Ped-only), or pedigree information (NoPed) of the 674 cows masked) and whole dataset (GPPedd).
Figure 4.Trends for the QP transformed genetic groups (SSGBLUP_EUPG) effects estimated using partial datasets (where phenotypes (GPed-only), or genotypes (PPed-only), or both phenotypes and genotypes (Ped-only), or pedigree information (NoPed) of the 674 cows masked) and whole dataset (GPPedd). Note: Trends for genetic group effects shown in Figures 2–4 were adjusted by the first group prediction in each model and prediction scenario.
Regression coefficients (as measure of inflation) of GEBV from the whole dataset on GEBV from the partial datasets (scenarios) for validation animals and their standard errors (in parenthesis) using the alternative models
| Model | Scenario | |||
|---|---|---|---|---|
| GPed-only | PPed-only | Ped-only | NoPed | |
| SSGBLUP_N | 0.997(0.008) | 0.921(0.028) | 0.932(0.037) | 1.003(0.003) |
| SSGBLUP_J | 1.007(0.008) | 0.946(0.025) | 0.973(0.034) | 1.005(0.003) |
| SSGBLUP_Jr | 1.007(0.008) | 0.942(0.026) | 0.960(0.034) | 1.005(0.003) |
| SSGBLUP_Q | 1.009(0.008) | 0.928(0.025) | 0.946(0.034) | 0.991(0.004) |
| SSGBLUP_QJ | 1.001(0.007) | 0.970(0.025) | 0.990(0.033) | 1.022(0.009) |
| SSGBLUP_QJr | 1.001(0.007) | 0.955(0.028) | 0.966(0.037) | 1.022(0.009) |
| SSGBLUP_Q* | 1.007(0.008) | 0.947(0.025) | 0.971(0.034) | 1.004(0.003) |
|
| 1.008(0.008) | 0.945(0.025) | 0.970(0.034) | 1.004(0.003) |
| SSGBLUP_Q-Q+ | 1.006(0.008) | 0.947(0.026) | 0.968(0.034) | 1.003(0.003) |
|
| 1.008(0.008) | 0.943(0.025) | 0.965(0.034) | 1.001(0.029) |
| SSGBLUP_EUPG | 1.006(0.008) | 0.947(0.026) | 0.962(0.035) | 1.005(0.003) |
| Average | 1.005(0.008) | 0.945(0.026) | 0.964(0.034) | 1.006(0.006) |
Scenarios are as described in Table 3.
Level-biases and standard errors (in parenthesis) of GEBV estimated as mean differences (in genetic standard deviations) between GEBV from whole and partial (scenarios) datasets using the alternative models
| Model | Scenario | |||
|---|---|---|---|---|
| GPed-only | PPed-only | Ped-only | NoPed | |
| SSGBLUP_N | −0.041(0.008) | 0.197(0.026) | 0.139(0.029) | −0.021(0.003) |
| SSGBLUP_J | −0.024(0.008) | −0.125(0.024) | −0.178(0.028) | −0.016(0.003) |
| SSGBLUP_Jr | −0.024(0.008) | −0.145(0.024) | −0.201(0.028) | −0.017(0.003) |
| SSGBLUP_Q | 0.031(0.008) | 0.033(0.023) | −0.022(0.027) | 0.181(0.004) |
| SSGBLUP_QJ | 0.040(0.008) | −0.081(0.024) | −0.082(0.028) | 1.119(0.010) |
| SSGBLUP_QJr | 0.039(0.008) | −0.137(0.026) | −0.152(0.030) | 1.100(0.010) |
| SSGBLUP_Q* | −0.022(0.008) | −0.099(0.023) | −0.149(0.027) | −0.014(0.009) |
|
| −0.023(0.008) | −0.086(0.023) | −0.137(0.027) | −0.014(0.009) |
| SSGBLUP_Q-Q+ | −0.023(0.008) | −0.089(0.024) | −0.141(0.028) | −0.012(0.003) |
|
| −0.023(0.008) | −0.070(0.023) | −0.121(0.028) | −0.009(0.003) |
| SSGBLUP_EUPG | −0.033(0.008) | −0.131(0.023) | −0.187(0.028) | −0.082(0.003) |
| Average | −0.009 (0.008) | −0.067(0.024) | −0.112(0.028) | 0.201(0.005) |
Scenarios are as described in Table 3.
Stabilities or correlations between GEBV from whole and partial datasets (scenarios) for validation animals using the alternative models
| Model | Scenario | |||
|---|---|---|---|---|
| GPed-only | PPed-only | Ped-only | NoPed | |
| SSGBLUP_N | 0.979 | 0.784 | 0.701 | 0.997 |
| SSGBLUP_J | 0.980 | 0.821 | 0.741 | 0.997 |
| SSGBLUP_Jr | 0.980 | 0.817 | 0.733 | 0.997 |
| SSGBLUP_Q | 0.979 | 0.815 | 0.728 | 0.995 |
| SSGBLUP_QJ | 0.982 | 0.829 | 0.757 | 0.974 |
| SSGBLUP_QJr | 0.982 | 0.798 | 0.713 | 0.975 |
| SSGBLUP_Q* | 0.981 | 0.821 | 0.740 | 0.997 |
|
| 0.980 | 0.822 | 0.740 | 0.997 |
| SSGBLUP_Q-Q+ | 0.981 | 0.818 | 0.736 | 0.997 |
|
| 0.981 | 0.819 | 0.756 | 0.997 |
| SSGBLUP_EUPG | 0.981 | 0.816 | 0.731 | 0.997 |
| Average | 0.980 | 0.815 | 0.734 | 0.993 |
Scenarios are as described in Table 3.