| Literature DB >> 28923017 |
Marie-Pierre Sanchez1, Armelle Govignon-Gion2,3, Pascal Croiseau2, Sébastien Fritz2,4, Chris Hozé2,4, Guy Miranda2, Patrice Martin2, Anne Barbat-Leterrier2, Rabia Letaïef2, Dominique Rocha2, Mickaël Brochard3, Mekki Boussaha2, Didier Boichard2.
Abstract
BACKGROUND: Genome-wide association studies (GWAS) were performed at the sequence level to identify candidate mutations that affect the expression of six major milk proteins in Montbéliarde (MON), Normande (NOR), and Holstein (HOL) dairy cattle. Whey protein (α-lactalbumin and β-lactoglobulin) and casein (αs1, αs2, β, and κ) contents were estimated by mid-infrared (MIR) spectrometry, with medium to high accuracy (0.59 ≤ R2 ≤ 0.92), for 848,068 test-day milk samples from 156,660 cows in the first three lactations. Milk composition was evaluated as average test-day measurements adjusted for environmental effects. Next, we genotyped a subset of 8080 cows (2967 MON, 2737 NOR, and 2306 HOL) with the BovineSNP50 Beadchip. For each breed, genotypes were first imputed to high-density (HD) using HD single nucleotide polymorphisms (SNPs) genotypes of 522 MON, 546 NOR, and 776 HOL bulls. The resulting HD SNP genotypes were subsequently imputed to the sequence level using 27 million high-quality sequence variants selected from Run4 of the 1000 Bull Genomes consortium (1147 bulls). Within-breed, multi-breed, and conditional GWAS were performed.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28923017 PMCID: PMC5604355 DOI: 10.1186/s12711-017-0344-z
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Features of the Montbéliarde (MON), Normande (NOR), Holstein (HOL), and multi-breed populations
| Number of | MON | NOR | HOL | Multi-breed |
|---|---|---|---|---|
| Phenotyped cows | 44,959 | 12,428 | 14,530 | 71,917 |
| Total test-day records | 293,780 | 58,594 | 72,973 | 425,347 |
| Test-day records per cow | 6.5 | 4.7 | 5 | 5.9 |
| Genotyped cows | 3032 | 2659 | 2216 | 7907 |
| Polymorphic 50 K SNPs | 37,332 | 37,690 | 39,158 | 41,028 |
| Polymorphic HD SNPs | 548,185 | 549,359 | 553,712 | 586,749 |
| Polymorphic sequence variants | 15,957,336 | 14,809,860 | 15,116,501 | 18,366,748 |
| Sequence variants (MAF ≥ 2%) | 11,755,172 | 11,445,432 | 11,592,432 | 13,534,013 |
MIR predictions for milk protein composition in Montbéliarde (MON), Normande (NOR), and Holstein (HOL) cows
| Trait | Accuracya | Means ± standard deviationsb | ||||
|---|---|---|---|---|---|---|
| R2 | RE | MON | NOR | HOL | ||
| PC | Protein content | 1.00 | 0.73 | 3.4 ± 0.4 | 3.6 ± 0.4 | 3.3 ± 0.4 |
| α-LA | α-lactalbumin | 0.59 | 14.4 | 4.07 ± 0.28 | 4.16 ± 0.36 | 4.27 ± 0.42 |
| β-LG | β-lactoglobulin | 0.74 | 11.7 | 8.25 ± 1.12 | 7.94 ± 1.03 | 8.46 ± 1.17 |
| αs1-CN | αs1-casein | 0.88 | 4.7 | 27.8 ± 0.55 | 27.8 ± 0.68 | 27.9 ± 0.69 |
| αs2-CN | αs2-casein | 0.82 | 7.5 | 9.53 ± 0.30 | 9.89 ± 0.33 | 9.69 ± 0.39 |
| β-CN | β-casein | 0.92 | 3.7 | 36.6 ± 0.88 | 36.2 ± 1.2 | 36.2 ± 1.2 |
| κ-CN | κ-casein | 0.80 | 8.4 | 9.75 ± 0.60 | 9.87 ± 0.48 | 9.43 ± 0.58 |
|
| Sum of caseins | 0.97 | 2.7 | 83.7 ± 0.94 | 83.7 ± 1.5 | 83.1 ± 1.4 |
|
| Sum of whey proteins | 0.73 | 8.9 | 12.6 ± 1.1 | 11.9 ± 1.2 | 12.6 ± 1.3 |
aAccuracy of MIR predictions (R2 = coefficient of determination and RE = relative error) estimated by Ferrand et al. [7] for protein composition expressed as g/100 g milk
bg/100 g milk for protein content (PC) and g/100 g protein for other traits
Accuracies of imputations on whole-genome sequences in Holstein (HOL) and Montbéliarde (MON) breeds
| Breed | HOL | MON |
|---|---|---|
| Number of cows | 168 | 2142 |
| EuroG10k chip version | V1 | V4 |
| Number of markers in the custom part | 721 | 3082 |
| Number of markers after quality control and MAF ≥ 0.02 | 221 | 1108 |
| R2 (%) | 83.7 | 76.1 |
| Genotypic concordance rate (%) | 93.7 | 89.7 |
| Allelic concordance rate (%) | 96.5 | 94.0 |
Fig. 1Precision of imputations at the whole-genome sequence level in a Holstein and b Montbéliarde breeds, according to MAF. The 2% limit corresponds to the MAF threshold for markers used in GWAS
Number of variants with genome-wide significant effects (−log10 (P) > 8.4) for milk composition traits in within-and multi-breed analyses
| Trait | Within-breed analyses | Multi-breed analyses | |||
|---|---|---|---|---|---|
| MONa | NORa | HOLa | Shared among three breeds | ||
| PC | 1905 | 1201 | 2394 | 0 | 2350 |
| α-LA | 4590 | 6490 | 8248 | 213 | 7224 |
| β-LG | 19,952 | 16,048 | 15,517 | 2266 | 18,612 |
| αs1-CN | 2232 | 708 | 629 | 182 | 2280 |
| αs2-CN | 866 | 193 | 636 | 1 | 1947 |
| β-CN | 665 | 734 | 524 | 96 | 1652 |
| κ-CN | 4110 | 5878 | 6532 | 553 | 7012 |
|
| 13,920 | 8716 | 11,833 | 961 | 12,698 |
|
| 16,583 | 13,126 | 15,327 | 1916 | 16,546 |
| Total number of distinct variants | 29,722 | 27,787 | 30,988 | 3080 | 34,248 |
aMontbéliarde (MON), Normande (NOR), and Holstein (HOL) cows
Fig. 2Number of overlapping a variants with genome-wide significant effects (−log10(P) ≥ 8.4), b QTL regions, and c genes containing variants with genome-wide significant effects among Montbéliarde, Normande, and Holstein breeds
Fig. 3−log10(P) plotted against the position of variants on BTA2, 6, and 20 using three SNP densities: 50 K, HD, and whole-genome sequence SNPs
Functional annotations of variants included within confidence intervals (±100 kb) of the 34 QTL in the three within-and multi-breed analyses
| Functional annotation | Within-breed analyses | Multi-breed analyses | ||
|---|---|---|---|---|
| MONa | NORa | HOLa | ||
| Intergenic | 1514 | 1465 | 2676 | 1971 |
| Intronic | 1079 | 1804 | 1937 | 1737 |
| 3′ UTR | 11 | 14 | 69 | 35 |
| 5′ UTR | 14 | 27 | 16 | 18 |
| Downstream | 710 | 988 | 1276 | 1159 |
| Inframe insertion | 0 | 0 | 1 | 0 |
| Missense | 25 | 82 | 72 | 56 |
| Splice acceptor | 0 | 0 | 3 | 0 |
| Synonymous | 30 | 114 | 118 | 91 |
| Upstream | 509 | 1009 | 612 | 685 |
| % genic | 61.1 | 73.4 | 60.5 | 65.8 |
| % genic non intronic | 33.4 | 40.6 | 32.0 | 35.5 |
aMontbéliarde (MON), Normande (NOR), and Holstein (HOL) cows
Fig. 4Protein network of the 22 most probable candidate genes detected, according to STRING v10.0 action view
Gene Ontology (GO) functional enrichment with false discovery rate (FDR) < 0.05
| Pathway ID | Pathway description | Gene count | FDR | Genes | |
|---|---|---|---|---|---|
| Biological process | GO.1903494 | Response to dehydroepiandrosterone | 4 | 1.73e–08 |
|
| GO.1903496 | Response to 11-deoxycorticosterone | 4 | 1.73e–08 |
| |
| GO.0032570 | Response to progesterone | 4 | 1.81e–07 |
| |
| GO.0097305 | Response to alcohol | 5 | 3.69e–07 |
| |
| GO.0032355 | Response to estradiol | 4 | 2.34e–06 |
| |
| GO.1901700 | Response to oxygen-containing compound | 6 | 9.04e–05 |
| |
| GO.0014070 | Response to organic cyclic compound | 5 | 0.000176 |
| |
| GO.0033993 | Response to lipid | 5 | 0.000181 |
| |
| GO.0009719 | Response to endogenous stimulus | 5 | 0.00205 |
| |
| GO.0048732 | Gland development | 3 | 0.0281 |
| |
| GO.0060416 | Response to growth hormone | 2 | 0.0281 |
| |
| GO.0007595 | Lactation | 2 | 0.0298 |
| |
| Cellular component | GO.0005796 | Golgi lumen | 4 | 1.97e–08 |
|
| GO.0012505 | Endomembrane system | 8 | 0.00253 |
| |
| GO.0005576 | Extracellular region | 7 | 0.0372 |
| |
| GO.0005789 | Endoplasmic reticulum membrane | 4 | 0.0372 |
| |
| GO.0042175 | Nuclear outer membrane-endoplasmic reticulum membrane network | 4 | 0.0372 |
| |
| GO.0044444 | Cytoplasmic part | 9 | 0.0372 |
| |
| GO.0044446 | Intracellular organelle part | 9 | 0.0372 |
| |
| Molecular function | GO.0035375 | Zymogen binding | 2 | 0.0177 |
|