| Literature DB >> 30726939 |
Ivan Pocrnic1, Daniela A L Lourenco1, Ching-Yi Chen2, William O Herring2, Ignacy Misztal1.
Abstract
Genomic selection (GS) is routinely applied to many purebreds and lines of farm species. However, this method can be extended to predictions across purebreds as well as for crossbreds. This is useful for swine and poultry, for which selection in nucleus herds is typically performed on purebred animals, whereas the commercial product is the crossbred animal. Single-step genomic BLUP (ssGBLUP) is a widely applied method that can explore the recently developed algorithm for proven and young (APY). The APY allows for greater efficiency in computing BLUP solutions by exploiting the theory of limited dimensionality of genomic information and chromosome segments (Me). This study investigates the predictivity as a proxy for accuracy across and within 2 purebred pig lines and their crosses, under the application of APY in ssGBLUP setup, and different levels of Me overlapping across populations. The data consisted of approximately 210k phenotypic records for 2 traits (T1 and T2) with moderate heritability. Genotypes for 43k SNP markers were available for approximately 46k animals, from which 26k and 16k belong to 2 pure lines (L1 and L2), and approximately 4k are crossbreds. The complete pedigree had more than 720k animals. Different multivariate ssGBLUP models were applied, either with the regular or APY inverse of the genomic relationship matrix (G). The models included a standard bivariate animal model with 3 lines evaluated as 1 joint line, and for each trait individually, a 3-trait animal model with each line treated as a separate trait. Both models provided the same predictivity across and within the lines. Using either of the pure lines data as a training set resulted in a similar predictivity for the crossbreed animals (0.18 to 0.21). Across-line predictive ability was limited to less than half of the maximum predictivity for each line (L1T1 0.33, L1T2 0.25, L2T1 0.35, L2T2 0.36). For crossbred predictions, APY performed equivalently to regular G inverse when the number of core animals was equal to the number of eigenvalues explaining between 98% and 99% of the variance of G (4k to 8k) including all lines. Predictivity across the lines is achievable because of the shared Me between them. The number of those shared segments can be obtained via eigenvalue decomposition of genomic information available for each line.Entities:
Keywords: across-breed prediction; chromosome segments; genomic selection; multibreed evaluation
Mesh:
Year: 2019 PMID: 30726939 PMCID: PMC6447278 DOI: 10.1093/jas/skz042
Source DB: PubMed Journal: J Anim Sci ISSN: 0021-8812 Impact factor: 3.159
Correlations between genomic EBV and phenotypes adjusted for fixed effects, for different groups of validation animals (purebred animals L1 and L2, and their crosses C) with a different source of phenotypes available, shown for traits 1 (T1) and 2 (T2), under the first model (2-trait animal model without the distinction between the lines)
| Phenotypes1 | T1 | T2 | ||||
|---|---|---|---|---|---|---|
| L1 | L2 | C | L1 | L2 | C | |
| L1 + L2 + C | 0.33 | 0.34 | 0.26 | 0.24 | 0.36 | 0.25 |
| L1 + L2 | 0.33 | 0.34 | 0.26 | 0.24 | 0.36 | 0.25 |
| L1 | 0.33 | 0.15 | 0.19 | 0.25 | 0.14 | 0.19 |
| L2 | 0.18 | 0.35 | 0.21 | 0.11 | 0.36 | 0.18 |
1Phenotypes coming from L1, L2, and C jointly, L1 and L2 jointly, L1 solely, or L2 solely.
Correlations between genomic EBV and phenotypes adjusted for fixed effects, for different groups of validation animals (purebred animals L1 and L2, and their crosses C) with a different source of phenotypes available, shown for trait 1 (T1) when that trait was separated into 3 traits based on the line of the animals (second model)
| Phenotypes1 | L1 | L2 | C |
|---|---|---|---|
| L1 + L2 + C | 0.33 | 0.35 | 0.24 |
| L1 | 0.33 | 0.15 | 0.19 |
| L2 | 0.18 | 0.35 | 0.20 |
1Phenotypes coming from L1, L2, and C jointly, L1 solely, or L2 solely.
Correlations between genomic EBV and phenotypes adjusted for fixed effects, for different groups of validation animals (purebred animals L1 and L2, and their crosses C) with a different source of phenotypes available, shown for trait 2 (T2) when that trait was separated into 3 traits based on the line of the animals (second model)
| Phenotypes1 | L1 | L2 | C |
|---|---|---|---|
| L1 + L2 + C | 0.25 | 0.38 | 0.22 |
| L1 | 0.25 | 0.15 | 0.20 |
| L2 | 0.12 | 0.38 | 0.19 |
1Phenotypes coming from L1, L2, and C jointly, L1 solely, or L2 solely.
Regression coefficients (b1) of adjusted phenotypes on genomic EBV, for different groups of validation animals (purebred animals L1 and L2, and their crosses C) with a different source of phenotypes available, shown for traits 1 (T1) and 2 (T2), under the first model (M1; 2-trait animal model without the distinction between the lines) and second model (M2; when that trait was separated into 3 traits based on the line of the animals)
| Phenotypes1 | M1–T1 | M1–T2 | ||||
|---|---|---|---|---|---|---|
| L1 | L2 | C | L1 | L2 | C | |
| L1 + L2 + C | 0.82 | 0.99 | 0.78 | 0.48 | 0.97 | 0.56 |
| L1 + L2 | 0.82 | 0.99 | 0.78 | 0.48 | 0.97 | 0.56 |
| L1 | 0.81 | 0.59 | 0.60 | 0.50 | 0.59 | 0.52 |
| L2 | 0.64 | 1.03 | 0.76 | 0.32 | 0.93 | 0.48 |
| M2–T1 | M2–T2 | |||||
| L1 | L2 | C | L1 | L2 | C | |
| L1 + L2 + C | 0.79 | 1.06 | 0.77 | 0.49 | 0.94 | 0.79 |
| L1 | 0.79 | 2.08 | 1.09 | 0.51 | 0.98 | 0.80 |
| L2 | 2.11 | 1.07 | 0.78 | 0.31 | 0.92 | 0.90 |
1Phenotypes coming from L1, L2 and C jointly, L1 and L2 jointly, L1 solely, or L2 solely
Numbers of largest eigenvalues (Eig) explaining 50%, 80%, 90%, 95%, 98%, and 99% of the variance in the genomic relationship matrix with a different source of genotypes available
| Genotypes1 | Number genotyped | Eig50 | Eig80 | Eig90 | Eig95 | Eig98 | Eig99 |
|---|---|---|---|---|---|---|---|
| L1 | 26,543 | 119 | 531 | 1,068 | 1,944 | 3,888 | 5,957 |
| L2 | 15,976 | 125 | 602 | 1,209 | 2,112 | 3,884 | 5,601 |
| L1 + L2 | 42,519 | 126 | 728 | 1,528 | 2,763 | 5,381 | 8,137 |
| C | 3,969 | 105 | 479 | 864 | 1,315 | 1,968 | 2,459 |
| L1 + L2 + C | 46,488 | 130 | 735 | 1,533 | 2,759 | 5,368 | 8,141 |
1Purebred animals L1 and L2, and their crosses C.
Figure 1.Projection of genomic relationships into first 2 principal components (PC), showing purebred animals (L1 and L2), and their crosses (C). The percentage of variance explained by each PC is shown in parentheses.
Correlations between genomic EBV and phenotypes adjusted for fixed effects, for different groups of validation animals (purebred animals L1 and L2, and their crosses C) shown for traits 1 and 2 (T1 and T2) under the first model, obtained by the algorithm for proven and young (APY) inverse of genomic relationship matrix (G) where the number of core animals was based on the largest eigenvalues (Eig) explaining 90%, 98%, and 99% variance of G and was randomly sampled from different groups of genotyped animals (L1, L2, or L1 + L2 + C)
| Core | T1 | T2 | ||||
|---|---|---|---|---|---|---|
| L1 | L2 | C | L1 | L2 | C | |
| L1_Eig90 | 0.33 | 0.22 | 0.22 | 0.23 | 0.27 | 0.21 |
| L1_Eig98 | 0.33 | 0.29 | 0.24 | 0.23 | 0.31 | 0.23 |
| L1_Eig99 | 0.33 | 0.30 | 0.25 | 0.24 | 0.32 | 0.23 |
| L2_Eig90 | 0.31 | 0.32 | 0.24 | 0.20 | 0.35 | 0.20 |
| L2_Eig98 | 0.32 | 0.34 | 0.25 | 0.23 | 0.36 | 0.24 |
| L2_Eig99 | 0.33 | 0.34 | 0.26 | 0.23 | 0.36 | 0.24 |
| L1L2C_Eig90 | 0.32 | 0.30 | 0.24 | 0.23 | 0.33 | 0.22 |
| L1L2C_Eig98 | 0.33 | 0.33 | 0.25 | 0.24 | 0.36 | 0.24 |
| L1L2C_Eig99 | 0.33 | 0.34 | 0.26 | 0.24 | 0.36 | 0.24 |
Correlations between genomic EBV obtained by the algorithm for proven and young (APY) inverse and regular (direct) inverse of genomic relationship matrix (G), for different groups of validation animals (purebred animals L1 and L2, and their crosses C) shown for traits 1 and 2 (T1 and T2) under the first model
| T1 | T2 | |||||
|---|---|---|---|---|---|---|
| Core | L1 | L2 | C | L1 | L2 | C |
| L1_Eig90 | 0.97 | 0.77 | 0.87 | 0.96 | 0.81 | 0.88 |
| L1_Eig98 | 0.99 | 0.91 | 0.95 | 0.99 | 0.92 | 0.95 |
| L1_Eig99 | 0.99 | 0.93 | 0.97 | 0.99 | 0.94 | 0.97 |
| L2_Eig90 | 0.89 | 0.93 | 0.89 | 0.89 | 0.94 | 0.90 |
| L2_Eig98 | 0.97 | 0.99 | 0.97 | 0.96 | 0.99 | 0.97 |
| L2_Eig99 | 0.98 | 0.99 | 0.98 | 0.97 | 0.99 | 0.98 |
| L1L2C_Eig90 | 0.96 | 0.92 | 0.94 | 0.95 | 0.93 | 0.94 |
| L1L2C_Eig98 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 |
| L1L2C_Eig99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 |
For the APY inverse the number of core animals was based on the largest eigenvalues (Eig) explaining 90%, 98%, and 99% variance of G and was randomly sampled from different groups of genotyped animals (L1, L2, or L1 + L2 + C).