| Literature DB >> 33806889 |
Hewa Bahithige Pavithra Chathurangi Ariyarathne1, Martin Correa-Luna2, Hugh Thomas Blair1, Dorian John Garrick1, Nicolas Lopez-Villalobos1.
Abstract
The objective of this study was to identify genomic regions associated with milk fat percentage (FP), crude protein percentage (CPP), urea concentration (MU) and efficiency of crude protein utilization (ECPU: ratio between crude protein yield in milk and dietary crude protein intake) using grazing, mixed-breed, dairy cows in New Zealand. Phenotypes from 634 Holstein Friesian, Jersey or crossbred cows were obtained from two herds at Massey University. A subset of 490 of these cows was genotyped using Bovine Illumina 50K SNP-chips. Two genome-wise association approaches were used, a single-locus model fitted to data from 490 cows and a single-step Bayes C model fitted to data from all 634 cows. The single-locus analysis was performed with the Efficient Mixed-Model Association eXpedited model as implemented in the SVS package. Single nucleotide polymorphisms (SNPs) with genome-wide association p-values ≤ 1.11 × 10-6 were considered as putative quantitative trait loci (QTL). The Bayes C analysis was performed with the JWAS package and 1-Mb genomic windows containing SNPs that explained > 0.37% of the genetic variance were considered as putative QTL. Candidate genes within 100 kb from the identified SNPs in single-locus GWAS or the 1-Mb windows were identified using gene ontology, as implemented in the Ensembl Genome Browser. The genes detected in association with FP (MGST1, DGAT1, CEBPD, SLC52A2, GPAT4, and ACOX3) and CPP (DGAT1, CSN1S1, GOSR2, HERC6, and IGF1R) were identified as candidates. Gene ontology revealed six novel candidate genes (GMDS, E2F7, SIAH1, SLC24A4, LGMN, and ASS1) significantly associated with MU whose functions were in protein catabolism, urea cycle, ion transportation and N excretion. One novel candidate gene was identified in association with ECPU (MAP3K1) that is involved in post-transcriptional modification of proteins. The findings should be validated using a larger population of New Zealand grazing dairy cows.Entities:
Keywords: candidate gene; dairy cows; efficiency of crude protein utilization; genome-wide association studies; milk urea concentration; single nucleotide polymorphism
Year: 2021 PMID: 33806889 PMCID: PMC8004844 DOI: 10.3390/genes12030456
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Cows from each farm, breed group and production season used for the genome-wise association study.
| Production Season | |||||
|---|---|---|---|---|---|
| Farm | Breed | Only | Only | Both | Total |
| Dairy 1 | Friesian | 17 | 13 | 49 | 79 |
| Jersey | 15 | 18 | 40 | 73 | |
| F × J crossbred | 31 | 26 | 105 | 162 | |
| Dairy 4 | Friesian | 28 | 76 | 23 | 127 |
| Jersey | 1 | 2 | 2 | 5 | |
| F × J crossbred | 91 | 32 | 65 | 188 | |
Only 2016–2017 = number of cows milking only in season 2016–2017; only 2017–2018 = number of cows milking only in season 2017–2018; both seasons = number of cows milking in both 2016–2017 and 2017–2018 seasons.
Number of observations (N), mean, standard deviation (SD), coefficient of variation (CV), minimum (Min) and maximum (Max) values of milk percentage traits, milk urea concentration and efficiency of crude protein utilization in grazing dairy cows at Massey University Dairy 1 and Dairy 4.
| Trait | N | Mean | SD | CV (%) | Min | Max |
|---|---|---|---|---|---|---|
| FP, % | 6618 | 5.07 | 0.92 | 18 | 1.77 | 9.77 |
| CPP, % | 6618 | 4.05 | 0.55 | 14 | 2.72 | 7.66 |
| MU, mg/dL | 1866 | 25.6 | 8.31 | 32 | 6.08 | 61.7 |
| ECPU, % | 1866 | 24.4 | 6.65 | 27 | 4.00 | 45.5 |
FP = fat percentage, CPP = crude protein percentage, MU = milk urea concentration, ECPU = efficiency of crude protein utilization.
Estimates of variance components, heritabilities and repeatabilities of milk percentage traits, milk urea and efficiency of crude protein utilization using a univariate repeatability animal model in grazing dairy cows at Massey University Dairy 1 and Dairy 4.
| Trait | σ2a | σ2pw | σ2pa | σ2e | σ2p | h2 | t |
|---|---|---|---|---|---|---|---|
| FP | 0.13 | 0.00 | 0.06 | 0.18 | 0.37 | 0.35 | 0.51 |
| CPP | 0.05 | 0.00 | 0.00 | 0.03 | 0.08 | 0.62 | 0.63 |
| MU | 6.77 | 0.68 | 1.73 | 15.29 | 24.47 | 0.28 | 0.38 |
| ECPU | 0.57 | 0.00 | 0.49 | 26.18 | 27.25 | 0.02 | 0.04 |
FP = fat percentage, CPP = crude protein percentage, MU = milk urea concentration, ECPU = efficiency of crude protein utilization, σ2a = additive genetic variance, σ2pw = within lactation permanent environment variance, σ2pa = across lactation permanent environment variance, σ2e = residual variance, σ2p = phenotypic variance (σ2a + σ2pw + σ2pa + σ2e), h2 = trait heritability (σ2a/σ2p), t = repeatability ([σ2a + σ2pw + σ2pa]/σ2p).
Estimates of variance components of milk yield percentage traits, milk urea concentration and efficiency of crude protein utilization using a univariate single-step Bayesian (Bayes C, π = 0.997) linear mixed model in grazing dairy cows.
| Trait | σ2g | σ2m | σ2pw | σ2pa | σ2e | σ2p | S 2 | h2 | t |
|---|---|---|---|---|---|---|---|---|---|
| FP | 0.07 | 0.13 | 0.00 | 0.04 | 0.18 | 0.42 | 0.31 | 0.48 | 0.57 |
| CPP | 0.02 | 0.03 | 0.00 | 0.00 | 0.03 | 0.08 | 0.38 | 0.63 | 0.63 |
| MU | 6.64 | 3.37 | 0.41 | 0.53 | 15.58 | 26.53 | 0.13 | 0.38 | 0.41 |
| ECPU | 0.17 | 0.02 | 0.00 | 0.58 | 25.16 | 25.93 | 0.00 | 0.01 | 0.03 |
FP = fat percentage, CPP = crude protein percentage, MU = milk urea concentration, ECPU = efficiency of crude protein utilization, σ2g = additional polygenic variance, σ2m = additive genetic variance explained by the markers, σ2pw = within lactation permanent environment variance, σ2pa = across lactation permanent environment variance, σ2e = residual variance, σ2p = phenotypic variance (σ2g + σ2m + σ2pw + σ2pa + σ2e), S2 = genomic heritability (σ2m/σ2p), h2 = trait heritability ([σ2g + σ2m]/σ2p), t = repeatability ([σ2g + σ2m + σ2pw + σ2pa]/σ2p).
Figure 1Manhattan plot showing genome-wide associations for fat percentage (a), crude protein percentage (b), milk urea concentration (c) and efficiency of crude protein utilization (d). The red line shows the genome-wide significant level at −log10(p-value) = 5.95 and the blue line shows the suggestive association significant level at −log10(p-value) = 4.65.
The single nucleotide polymorphisms (SNPs) which reached significance in single-locus association for fat percentages at genome-wide threshold (p < 1 × 10−6).
| Trait | Locus | Chr | Position | Effect | Effect SE | Ref | MA | MAF | Gene | |
|---|---|---|---|---|---|---|---|---|---|---|
| FP | rs109421300 | 14 | 1,801,116 | 4.31 × 10−8 | −0.06 | 0.01 | C | T | 0.43 |
|
| rs137071126 | 14 | 1,765,835 | 5.45 × 10−8 | −0.06 | 0.01 | G | C | 0.45 |
|
FP = fat percentage, Chr = chromosome, Ref = reference allele, MA = minor allele, MAF = minor allele frequency, DGAT1 = diacylglycerol O-acyltransferase 1, SLC52A2 = solute carrier family 52 member 2.
Figure 2Manhattan plot showing genome-wide association window variance for fat percentage (a), crude protein percentage (b), milk urea concentration (c), and efficiency of crude protein utilization (d). The red line shows the genome-wide significant level at PVE = 0.37% and the blue line shows the suggestive association significant level at PVE = 0.19%.
The 1 Mb SNP windows surpass genome-wide significance level that is proportion of genetic variance (PVE) at 0.37% with their and window posterior probability of association (WPPA) for milk fat (FP) and crude protein (CPP) percentages, milk urea (MU) and efficiency of crude protein utilization (ECPU).
| Trait | Window | Chr | Start–End Window (Mb) | Start SNP | End SNP | No. of SNP | PVE (%) | WPPA | Gene |
|---|---|---|---|---|---|---|---|---|---|
| FP | 1506 | 14 | 1–2 | 1,118,964 | 1,971,143 | 22 | 38.12 | 1.00 |
|
| 633 | 5 | 9.3–9.4 | 93,005,014 | 93,995,487 | 34 | 3.29 | 0.86 |
| |
| 1602 | 15 | 1.2–1.3 | 12,020,185 | 12,928,912 | 18 | 0.53 | 0.30 | - | |
| 2418 | 27 | 3.6–3.7 | 36,075,350 | 36,959,262 | 16 | 0.53 | 0.18 |
| |
| 776 | 6 | 11.4–11.5 | 114,134,405 | 114,987,655 | 15 | 0.52 | 0.30 |
| |
| 2015 | 20 | 50–51 | 50,067,711 | 50,999,239 | 22 | 0.44 | 0.22 | - | |
| 2017 | 20 | 52–53 | 52,011,466 | 52,897,607 | 13 | 0.41 | 0.28 | - | |
| 1524 | 14 | 19–20 | 19,001,879 | 19,993,440 | 20 | 0.37 | 0.22 |
| |
| CPP | 1506 | 14 | 1–2 | 1,118,964 | 1,971,143 | 22 | 9.83 | 1.00 |
|
| 749 | 6 | 87–88 | 87,022,091 | 87,996,364 | 43 | 4.66 | 0.90 | ||
| 299 | 3 | 2–3 | 2,120,827 | 2,903,624 | 15 | 3.20 | 0.68 | - | |
| 1080 | 9 | 71–72 | 71,021,904 | 71,970,552 | 19 | 2.08 | 0.56 |
| |
| 2193 | 23 | 22–23 | 22,080,127 | 22,916,565 | 14 | 2.03 | 0.54 | - | |
| 1957 | 19 | 57–58 | 57,128,225 | 57,980,697 | 20 | 0.95 | 0.36 |
| |
| 615 | 5 | 75–76 | 75,013,265 | 75,993,374 | 26 | 0.79 | 0.44 |
| |
| 928 | 8 | 33–34 | 33,060,034 | 33,937,052 | 17 | 0.79 | 0.46 | - | |
| 1945 | 19 | 45–46 | 45,094,650 | 45,989,813 | 18 | 0.50 | 0.20 | ||
| 301 | 3 | 4–5 | 4,151,051 | 4,996,345 | 19 | 0.46 | 0.20 | - | |
| 700 | 6 | 38–39 | 38,019,605 | 38,939,012 | 49 | 0.45 | 0.30 |
| |
| 498 | 4 | 79–80 | 79,008,823 | 79,904,993 | 15 | 0.43 | 0.20 | - | |
| 1499 | 13 | 79–80 | 79,008,708 | 79,991,041 | 20 | 0.41 | 0.22 | - | |
| 1787 | 17 | 29–30 | 29,008,821 | 29,936,157 | 19 | 0.41 | 0.24 | - | |
| 2045 | 21 | 8–9 | 8,031,396 | 8,955,497 | 19 | 0.41 | 0.18 |
| |
| 2347 | 26 | 17–18 | 17,042,328 | 17,986,547 | 21 | 0.41 | 0.24 | - | |
| 309 | 3 | 12–13 | 12,112,945 | 12,941,656 | 20 | 0.40 | 0.24 | - | |
| 750 | 6 | 88–89 | 88,049,208 | 88,958,861 | 29 | 0.40 | 0.18 | - | |
| 2263 | 24 | 39–40 | 39,013,392 | 39,987,594 | 24 | 0.40 | 0.22 | - | |
| 647 | 5 | 107–108 | 107,190,274 | 107,946,258 | 17 | 0.38 | 0.22 | - | |
| MU | 2222 | 23 | 51–52 | 51,058,024 | 51,991,897 | 29 | 3.25 | 0.70 |
|
| 546 | 5 | 6–7 | 6,013,434 | 6,976,839 | 16 | 1.12 | 0.40 |
| |
| 1343 | 12 | 15–16 | 15,017,263 | 15,988,893 | 22 | 1.04 | 0.36 |
| |
| 1689 | 16 | 13–14 | 13,015,545 | 13,976,725 | 19 | 0.95 | 0.28 | - | |
| 2505 | 29 | 30–31 | 30,075,012 | 30,986,595 | 17 | 0.86 | 0.30 | - | |
| 666 | 6 | 4–5 | 4,068,561 | 4,868,243 | 18 | 0.67 | 0.26 | - | |
| 2094 | 21 | 57–58 | 57,094,715 | 57,948,571 | 86 | 0.65 | 0.42 | ||
| 735 | 6 | 73–74 | 73,015,703 | 73,990,002 | 21 | 0.64 | 0.30 | - | |
| 1320 | 11 | 100–101 | 100,018,300 | 100,942,106 | 18 | 0.64 | 0.20 |
| |
| 749 | 6 | 87–88 | 87,022,091 | 87,996,364 | 43 | 0.62 | 0.44 | - | |
| 2314 | 25 | 27–28 | 27,028,950 | 27,987,306 | 23 | 0.6 | 0.30 | - | |
| 2140 | 22 | 31–32 | 31,020,826 | 31,942,134 | 16 | 0.59 | 0.24 | - | |
| 746 | 6 | 84–85 | 84,035,488 | 84,986,240 | 28 | 0.57 | 0.24 | - | |
| 2397 | 27 | 15–16 | 15,006,739 | 15,961,806 | 18 | 0.41 | 0.18 | - | |
| 2175 | 23 | 4–5 | 4,016,329 | 4,980,889 | 17 | 0.40 | 0.18 | - | |
| 364 | 3 | 67–68 | 67,096,213 | 67,987,280 | 14 | 0.39 | 0.22 | - | |
| 1770 | 17 | 12–13 | 12,057,061 | 12,986,781 | 17 | 0.39 | 0.16 | - | |
| 2481 | 29 | 6–7 | 6,141,144 | 6,976,219 | 13 | 0.39 | 0.14 | - | |
| ECPU | 1987 | 20 | 22–23 | 22,006,676 | 22,960,402 | 99 | 0.41 | 0.30 |
|
Chr = chromosome, DGAT1 = diacylglycerol O-acyltransferase 1, GPAT4 = glycerol-3-phosphate acyltransferase 4, MGST1 = microsomal glutathione S-transferase 1, ACOX3 = acyl-CoA oxidase 3 pristanoyl, and CEBPD = CCAAT enhancer binding protein delta, CSN1S1 = α-S1-casein CSN2 = β-casein, CSN1S2 = α-S2-casein, CSN3 = kappa-casein, RPS12 = Bos taurus ribosomal protein S12, GRIN2C = glutamate ionotropic receptor NMDA type subunit 2C, EIF3D = eukaryotic translation initiation factor 3 subunit D, ADAM11 = ADAM metallopeptidase domain 11, GOSR2 = golgi SNAP receptor complex member 2, HERC6 = HECT and RLD domain containing E3 ubiquitin protein ligase family member 6, IGF1R = insulin-like growth factor 1 receptor, GMDS = GDP-mannose 4,6-dehydratase, E2F7 = E2F transcription factor 7, SIAH3 = solute carrier family 52 member 2, SLC24A4 = solute carrier family 24 member 4, LGMN = legumain, ASS1 = argininosuccinate synthase 1, MAP3K1 = mitogen-activated protein kinase kinase 1.