| Literature DB >> 31533617 |
Claire Oget1, Marc Teissier2, Jean-Michel Astruc3, Gwenola Tosser-Klopp2, Rachel Rupp2.
Abstract
BACKGROUND: Genomic evaluation is usually based on a set of markers assumed to be linked with causal mutations. Selection and precise management of major genes and the remaining polygenic component might be improved by including causal polymorphisms in the evaluation models. In this study, various methods involving a known mutation were used to estimate prediction accuracy. The SOCS2 gene, which influences body growth, milk production and somatic cell scores, a proxy for mastitis, was studied as an example in dairy sheep.Entities:
Keywords: Causal mutation; Dairy sheep; Genome-wide association study; Genomic evaluation; Genomics
Mesh:
Year: 2019 PMID: 31533617 PMCID: PMC6751880 DOI: 10.1186/s12864-019-6068-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Evolution of the frequency of the SOCS2 T allele, associated with increased susceptibility to mastitis in 4699 rams used for artificial insemination
Genetic correlations between the SOCS2 gene content trait and the traits of interest (rg) and genetic variances (σg2) explained by the SOCS2 gene obtained using the pedigree-based Gene Content method
| Trait | rg with | σg2 explained by |
|---|---|---|
| MY | 0.25 | 6.18% |
| FY | 0.18 | 3.22% |
| PY | 0.29 | 8.55% |
| FC | –0.14 | 1.88% |
| PC | –0.06 | 0.41% |
| LSCS | 0.34 | 11.24% |
| TA | –0.02 | 0.05% |
| UC | –0.07 | 0.56% |
| UD | –0.19 | 3.71% |
Abbreviations: MY Milk Yield, FY Fat Yield, PY Protein Yield, FC Fat Content, PC Protein Content, LSCS Somatic Cell Score, TA Teat Angle, UC Udder Cleft, UD Udder Depth
Prediction accuracies of different genetic evaluation methods for each trait using information about the SOCS2 gene or not
| Trait | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| MY | FY | PY | FC | PC | LSCS | TA | UC | UD | ||
| Prediction accuracy using | Pedigree-based BLUP | 0.507 | 0.389 | 0.330 | 0.693 | 0.684 | 0.421 | 0.451 | 0.477 | 0.336 |
| ssGBLUP | 0.549 | 0.450 | 0.463 | 0.724 | 0.745 | 0.454 | 0.523 | 0.473 | 0.423 | |
| ssGBLUP | 0.550 | 0.450 | 0.465 | 0.724 | 0.745 | 0.456 | 0.523 | 0.473 | 0.424 | |
| WssGBLUP(classical, 1) | 0.498 | 0.422 | 0.437 | 0.723 | 0.730 | 0.421 | 0.538 | 0.473 | 0.452 | |
| The best WssGBLUP(m, n) method | 0.561 | 0.461 | 0.486 | 0.739 | 0.762 | 0.471 | 0.538 | 0.504 | 0.460 | |
| Pedigree-based Gene Content | 0.557 | 0.430 | 0.405 | 0.698 | 0.688 | 0.438 | 0.448 | 0.512 | 0.366 | |
| Gain in prediction accuracy between | Pedigree-based BLUP & ssGBLUP | 8.25% | 15.54% | 40.34% | 4.41% | 8.95% | 7.80% | 15.84% | -0.96% | 26.07% |
| ssGBLUP & the best WssGBLUP method | 2.16% | 2.46% | 5.04% | 2.06% | 2.32% | 3.77% | 2.80% | 6.59% | 8.75% | |
| Without & with the | 0.22% | 0.13% | 0.51% | 0.02% | 0.10% | 1.06% | -0.02% | 0.02% | 0.26% | |
| Pedigree-based BLUP & pedigree-based Gene Content | 8.88% | 9.54% | 18.64% | 0.68% | 0.57% | 3.80% | -0.89% | 6.72% | 8.33% | |
| Parameters of the best WssGBLUP(m, n) method | Maximum 100 | Maximum 200 | Maximum 200 | Maximum 40 | Maximum 45 | Mean 200 | Classical | Sum 30 | Maximum 5 | |
Abbreviations: MY Milk Yield, FY Fat Yield, PY Protein Yield, FC Fat Content, PC Protein Content, LSCS Somatic Cell Score, TA Teat Angle, UC Udder Cleft, UD Udder Depth
Fig. 2Prediction accuracies of genomic selection from the second iterations of the different WssGBLUP methods for the LSCS trait with (orange curve: WssGBLUP) and without (blue curve: WssGBLUP(m, n)) the SOCS2 genotype. Four approaches to the WssGBLUP were computed (m = classical, mean, maximum or sum). The classical WssGBLUP approach (m = classical) gives a different weight for each marker of the chip. In alternative approaches, the chip is decomposed into non-overlapping windows of n markers (we tested n = 2, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, and 200) and within these windows, all markers are assigned the same weight: the mean weight of the n SNPs (m = mean), the maximum weight of the n SNPs (m = maximum), and the sum of the n SNPs weights (m = sum)
Fig. 3Genetic trends over the years in the reference population (5343 rams with reference performances, i.e., daughter yield deviations, born between 1996 and 2015), of the EBVs for LSCS using the Gene Content method which enables the polygenic component (EBV), excluding the effect of the SOCS2 gene, and the breeding value due to the gene effect (EBV), to be distinguished
QTL (Quantitative Trait Loci) regions (positions on ovine genome assembly v4.0) found using the best alternative WssGBLUP method for each trait (SOCS2 SNP included among the markers), and based on a threshold of 1% of genetic variance explained
| OAR | QTL region (Mb) | Trait associated with the QTL | Genetic variance explained in the trait-specific QTL regiona (%) | Trait-specific QTL regiona (Mb) |
|---|---|---|---|---|
| 3 | 128.3 - 130.5 | LSCS | 12.00 | 129.1 - 130.5 |
| PY | 4.91 | 129.0–130.3 | ||
| UD | 4.02 | 129.1–130.5 | ||
| MY | 3.94 | 128.3–129.6 | ||
| FC | 2.57 | 129.1–130.5 | ||
| UC | 1.84 | 129.1–130.5 | ||
| 136.3 - 137.6 | PC | 4.61 | 136.3–137.3 | |
| FC | 1.20 | 136.4–137.6 | ||
| 140.1 - 141.5 | PC | 1.24 | 140.1–141.5 | |
| 6 | 84.7 - 85.8 | PC | 5.95 | 84.7–85.8 |
| 11 | 33.1 - 34.9 | FC | 6.65 | 33.4–34.9 |
| PY | 2.25 | 33.3–34.5 | ||
| LSCS | 1.98 | 33.3–34.5 | ||
| MY | 1.31 | 33.1–34.3 | ||
| 13 | 63.4 - 64.5 | FC | 1.17 | 63.4–64.5 |
| 17 | 8.5 - 10.5 | MY | 1.14 | 8.5–9.7 |
| FC | 1.09 | 8.6–9.9 | ||
| PC | 1.02 | 9.0–10.5 | ||
| 19 | 44.5 - 45.6 | UC | 2.33 | 44.5–45.6 |
| 20 | 48.8 - 49.8 | LSCS | 3.09 | 48.8–49.8 |
| 23 | 32.4 - 33.9 | UC | 2.01 | 32.4–33.9 |
Abbreviations: OAR Ovis ARies, QTL Quantitative Trait Loci, Mb Megabase, MY Milk Yield, FY Fat Yield, PY Protein Yield, FC Fat Content, PC Protein Content, LSCS Somatic Cell Score, TA Teat Angle, UC Udder Cleft, UD Udder Depth
aTrait-specific QTL regions are regions where 20 adjacents SNPs explain the highest value of genetic variance of the trait
Description of the Lacaune dairy sheep dataset used for genetic evaluation
| Trait | Mean ± SD | Number of | |||
|---|---|---|---|---|---|
| Ewes | Lactationsa | Individuals in the pedigree file | Rams in the validation populationb | ||
| MY (L) | 292.91 ± 85.46 | 1,503,148 | 3,575,614 | 1,651,901 | 264 |
| FY (g) | 213.14 ± 51.56 | 1,124,636 | 1,841,351 | 1,336,060 | 263 |
| PY (g) | 169.72 ± 39.33 | ||||
| FC (g/L) | 66.54 ± 8.51 | ||||
| PC (g/L) | 52.89 ± 4.63 | ||||
| LSCS | 3.12 ± 1.56 | 769,929 | 1,321,411 | 1,031,375 | 263 |
| TA | 7.15 ± 1.06 | 349,134 | 349,134 | 349,134 | 250 |
| UC | 5.04 ± 1.26 | 349,132 | 349,132 | 653,908 | 249 |
| UD | 6.42 ± 0.70 | 349,132 | 349,132 | 653,907 | 253 |
Abbreviations MY Milk Yield, FY Fat Yield, PY Protein Yield, FC Fat Content, PC Protein Content, LSCS Somatic Cell Score, TA Teat Angle, UC Udder Cleft, UD Udder Depth
aMY, FY, PY, FC, PC and LSCS were measured during the first three lactations (lactation average); TA, UC, and UD were measured once during the first lactation
bRams born in 2015 with Estimated Breeding Values (EBVs) and reference performances, i.e. Daughter Yield Deviation (DYD)
Description of the different genetic evaluation models based on a single-step approach and using information about the SOCS2 gene or not
| Approach | Model | Use of | Information used in the relationship matrix | ||
|---|---|---|---|---|---|
| Pedigree | 54K SNPs | ||||
| Single-trait | Pedigree-based BLUP | No | Yes | No | No |
| ssGBLUP | No | Yes | Yes | No | |
| ssGBLUP | Yes | Yes | Yes | Yes | |
| WssGBLUP(m, n)b | No | Yes | Yes | No | |
| WssGBLUP | Yes | Yes | Yes | Yes | |
| Multiple-trait | Pedigree-based Gene Content | Yes (as a trait) | Yes | No | No |
Abbreviations: GBLUP Genomic Best Linear Unbiaised Prediction, ss single-step, W Weighted
aThe term SOCS2 here means that the SOCS2 SNP has been added to the 54K SNPs of the chip
bFour approaches to the WssGBLUP were computed (m = classical, mean, maximum or sum). The classical WssGBLUP approach (m = classical) gives a different weight for each marker of the chip. In alternative approaches, the chip is decomposed into non-overlapping windows of n markers (we tested n = 2, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, and 200) and within these windows, all markers are assigned the same weight: the mean weight of the n SNPs (m = mean), the maximum weight of the n SNPs (m = maximum), and the sum of the n SNPs weights (m = sum)
Description of fixed effects for the evaluation models of each phenotype
| Trait | Fixed effects | Number of levels |
|---|---|---|
| MY | • herd within year and within parity • age at delivery within year and within parity • month at delivery within year and within parity • time between delivery and first OMR within year and within parity | 42259 514 702 585 |
| FY | • herd within year and within parity • age at delivery within year and within parity • quality control within year and within parity | 20783 269 348 |
| PY | ||
| FC | ||
| PC | ||
| LSCS | • herd within year and within parity • age at delivery within year and within parity • month at delivery within year and within parity | 14306 206 312 |
| TA | • herd within year • interaction between examiner and the time difference between milking and scoring, within herd • interaction between age at delivery and lactation stage, within year • number of lambs within year | 3672 2346 160 30 |
| UC | ||
| UD |
Abbreviations: MY Milk Yield, FY Fat Yield, PY Protein Yield, FC Fat Content, PC Protein Content, LSCS Somatic Cell Score, TA Teat Angle, UC Udder Cleft, UD Udder Depth