| Literature DB >> 34440337 |
Manuel J Wolf1, Tong Yin1, Guilherme B Neumann2, Paula Korkuć2, Gudrun A Brockmann2, Sven König1, Katharina May1.
Abstract
This genome-wide association study (GWAS) aimed to identify sequence variants (SVs) and candidate genes associated with fertility and health in endangered German Black Pied cattle (DSN) based on whole-genome sequence (WGS) data. We used 304 sequenced DSN cattle for the imputation of 1797 genotyped DSN to WGS. The final dataset included 11,413,456 SVs of 1886 cows. Cow traits were calving-to-first service interval (CTFS), non-return after 56 days (NR56), somatic cell score (SCS), fat-to-protein ratio (FPR), and three pre-corrected endoparasite infection traits. We identified 40 SVs above the genome-wide significance and suggestive threshold associated with CTFS and NR56, and three important potential candidate genes (ARHGAP21, MARCH11, and ZNF462). For SCS, most associations were observed on BTA 25. The GWAS revealed 61 SVs, a cluster of 10 candidate genes on BTA 13, and 7 pathways for FPR, including key mediators involved in milk fat synthesis. The strongest associations for gastrointestinal nematode and Dictyocaulus viviparus infections were detected on BTA 8 and 24, respectively. For Fasciola hepatica infections, the strongest associated SVs were located on BTA 4 and 7. We detected 200 genes for endoparasite infection traits, related to 16 pathways involved in host immune response during infection.Entities:
Keywords: GWAS; candidate genes; complex traits; dairy cows; dual-purpose cattle; endangered breed; endoparasite resistance
Mesh:
Year: 2021 PMID: 34440337 PMCID: PMC8391191 DOI: 10.3390/genes12081163
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Descriptive statistics for fertility traits, health and metabolic stability indicator traits, and endoparasite infection traits in DSN.
| Trait 1 | No. of Records | No. of Cows | No. of Cows with Sequence Level Genotypes | Mean 2 | SD 2 | Min. 2 | Max. 2 |
|---|---|---|---|---|---|---|---|
| CTFS | 1683 | 1683 | 1683 | 78.31 | 27.44 | 26.0 | 241.0 |
| NR56 | 1683 | 1683 | 1683 | 0.64 | 0.48 | 0 | 1.0 |
| SCS | 1638 | 1638 | 1638 | 2.58 | 1.48 | −1.06 | 8.88 |
| FPR | 1638 | 1638 | 1638 | 1.18 | 0.18 | 0.45 | 2.42 |
| FEC-GIN | 1997 | 1166 | 142 | 11.35 | 22.57 | 0 | 225.0 |
| FEC-FH | 2006 | 1166 | 142 | 0.61 | 3.64 | 0 | 89.0 |
| FLC-DV | 1988 | 1163 | 142 | 0.17 | 2.14 | 0 | 46.0 |
1 CTFS = calving-to-first service interval; NR56 = non-return after day 56; SCS = somatic cell score (log-transformed somatic cell count: log2 (SCC/100,000) +3); FPR = fat-to-protein ratio; FEC-GIN = fecal egg count for gastrointestinal nematodes; FEC-FH = fecal egg count for Fasciola hepatica; FLC-DV = fecal larvae count for Dictyocaulus viviparus. 2 Mean, SD, Min. and Max. values are given for the number of records.
Figure 1Manhattan plots for −log10 p-values of marker effects for (A) calving-to-first service interval (CTFS) and (B) non-return 56 (NR56). Markers above the genome-wide significance threshold pBonf (red dashed line) are highlighted in red and markers above the suggestive significance threshold pSug (blue dashed line) are highlighted in blue. Markers within a distance of 125 kb up- and downstream of the significantly or suggestively associated SVs are highlighted in red or blue, respectively.
Potential candidate genes related to the identified sequence variants (SVs) significantly associated with the female fertility traits calving-to first service interval (CTFS) and non-return 56 (NR56) in DSN.
| BTA | Gene Position 1 | No. of SVs within/Close to Gene 2 | Position of Maximum Association ( | rs Number of Maximum Association | Gene Name |
|---|---|---|---|---|---|
| CTFS | |||||
| 12 | 43,998,992–44,525,321 | 3/0 | 44,156,560 (2.93 × 10−6) * | rs136060929 |
|
| 13 | 25,463,655–25,592,062 | 5/10 | 25,577,261 (2.25 × 10−7) * | rs384930569 |
|
| 15 | 58,214,580–58,224,553 | 0/1 | 58,258,453 (1.32 × 10−6) * | rs41777070 |
|
| 78,914,154–78,915,587 | 0/1 | 78,929,182 (2.08 × 10−6) * | rs379801720 |
| |
| 79,013,311–79,014,249 | 0/1 | 78,929,182 (2.08 × 10−6) * | rs379801720 |
| |
| 28 | 43,760,392–43,804,579 | 10/0 | 43,796,638 (1.37 × 10−6) * | rs42155599 |
|
| NR56 | |||||
| 8 | 96,644,114–96,759,752 | 2/0 | 96,724,190 (4.07 × 10−7) * | rs110809463 |
|
| 11 | 74,126,883–74,224,519 | 1/0 | 74,190,369 (8.30 × 10−7) * | rs383197946 |
|
| 20 | 56,953,217–57,071,806 | 0/1 | 56,867,118 (1.34 × 10−6) * | rs207515592 |
|
1 Gene position (start-end) in ENSEMBL build on Bos taurus genome assembly ARS-UCD1.2; 2 number of associations that reached the Bonferroni-corrected genome-wide significance threshold (pBonf) or the suggestive significance threshold (pSug) based on the position of the identified candidate gene ± 100 kb up- and downstream; * above pSug; BTA = Bos taurus chromosome.
Figure 2Manhattan plots for −log10 p-values of marker effects for (A) somatic cell score (SCS) and (B) fat-to-protein ratio (FPR). Markers above the genome-wide significance threshold pBonf (red dashed line) are highlighted in red and markers above the suggestive significance threshold pSug (blue dashed line) are highlighted in blue. Markers within a distance of 125 kb up- and downstream of the significantly or suggestively associated SVs are highlighted in red or blue, respectively.
Potential candidate genes related to the identified sequence variants (SVs) significantly associated with the udder health indicator trait somatic cell score (SCS) and with the metabolic health and stability indicator trait fat-to-protein ratio (FPR) in DSN.
| BTA | Gene Position 1 | No. of SVs within/Close to Gene 2 | Position of Maximum Association ( | rs Number of Maximum Association | Gene Name |
|---|---|---|---|---|---|
| SCS | |||||
| 1 | 5,689,903–5,690,709 | 0/1 | 5,693,321 (2.49 × 10−6) * | - |
|
| 5,770,225–5,772,218 | 0/1 | 5,693,321 (2.49 × 10−6) * | - |
| |
| 9 | 15,242,718–15,373,861 | 1/0 | 15,301,365 (2.76 × 10−6) * | - |
|
| 11 | 101,323,401–101,414,345 | 0/1 | 101,435,611 (2.06 × 10−6) * | rs211669575 |
|
| 101,450,754–101,465,047 | 0/1 | 101,435,611 (2.06 × 10−6) * | rs211669575 |
| |
| 101,696,993–101,834,040 | 2/0 | 101,761,967 (1.30 × 10−7) ** | rs137783421 |
| |
| 13 | 61,874,277–61,955,381 | 0/1 | 61,959,686 (1.04 × 10−6) * | rs211178277 |
|
| 23 | 42,945,450–43,017,711 | 1/0 | 42,9749,92 (4.20 × 10−7) * | rs876215027 |
|
| 25 | 6,224,841–6,638,491 | 2/45 | 6,654,595 (2.58 × 10−7) * | rs136166815 |
|
| FPR | |||||
| 9 | 83,287,300–83,713,833 | 1/0 | 83,557,515 (3.16 × 10−7) * | rs467161057 |
|
| 11 | 6,481,883–6,555,244 | 0/4 | 6,464,004 (3.06 × 10−7) * | rs134892674 |
|
| 12 | 11,675,264–12,071,822 | 15/1 | 11,736,920 (1.39 × 10−7) ** | rs439994366 |
|
| 12,256,326–12,294,096 | 0/1 | 12,195,520 (2.71 × 10−6) * | rs209487147 |
| |
| 12,410,713–12,461,551 | 0/9 | 12,496,948 (1.49 × 10−6) * | rs208793423 |
| |
| 13 | 58,456,060–58,457,612 | 0/2 | 58,382,430 (2.12 × 10−6) * | rs42020993 |
|
| 60,191,141–60,234,885 | 1/0 | 60,228,471 (2.65 × 10−6) * | - |
| |
| 61,107,684–61,147,486 | 2/0 | 61,145,994 (8.51 × 10−7) * | - |
| |
| 62,381,695–62,408,299 | 0/1 | 62,434,039 (1.91 × 10−6) * | - |
| |
| 62,503,682–62,514,220 | 0/1 | 62,434,039 (1.91 × 10−6) * | - |
| |
| 62,645,078–62,655,447 | 0/1 | 62,659,594 (2.82 × 10−6) * | - |
| |
| 62,676,036–62,685,906 | 0/1 | 62,659,594 (2.82 × 10−6) * | - |
| |
| 65,100,388–65,148,653 | 2/0 | 65,123,915 (1.50 × 10−6) * | rs109380861 |
| |
| 72,891,431–72,905,729 | 1/0 | 72,897,116 (1.46 × 10−6) * | rs137243257 |
| |
| 75,255,202–75,339,020 | 2/0 | 75,261,912 (2.42 × 10−6) * | rs137115876 |
| |
| 19 | 30,045,106–30,177,093 | 0/1 | 30,186,388 (2.15 × 10−6) * | - |
|
| 27 | 36,454,819–36,470,324 | 0/12 | 36,520,069 (6.89 × 10−7) * | rs211250281 |
|
| 36,522,605–36,539,773 | 1/12 | 36,520,069 (6.89 × 10−7) * | rs211250281 |
| |
| 41,461,703–41,896,044 | 1/0 | 41,614,645 (2.30 × 10−6) * | rs135231909 |
|
1 Gene position (start-end) in ENSEMBL build on Bos taurus genome assembly ARS-UCD1.2; 2 number of associations that reached the Bonferroni-corrected genome-wide significance threshold (pBonf) or the suggestive significance threshold (pSug) based on the position of the identified candidate gene ± 100 kb up- and downstream; * above pSug; ** above pBonf; BTA = Bos taurus chromosome.
Pathways (sorted alphabetically) related to the identified candidate genes associated with fat-to-protein ratio (FPR), somatic cell score (SCS), and endoparasite infection traits (RES-GIN, RES-FH, RES-DV), selected from the DAVID and KEGG databases and previously described in literature to be associated with the corresponding trait.
| Pathway | KEGG Entry | Trait 1 | Candidate Gene (BTA) | Possible Association of Pathway with Trait according to Literature |
|---|---|---|---|---|
| B cell receptor signaling pathway | bta04662 | RES-GIN | Involvement of B cells in immune response to gastrointestinal nematodes in ruminants [ | |
| Calcium signaling pathway | FPR | Pathway associated with milk fat content in Holstein cattle [ | ||
| Cell adhesion molecules | bta04514 | SCS | Identification of the cell adhesin molecules pathway for | |
| RES-DV | Cell adhesion molecules pathway identified for | |||
| cGMP-PKG signaling pathway | bta04022 | RES-FH | Cyclic GMP (cGMP) is an intracellular messenger that mediates the action of nitric oxide, which is increasingly produced by leukocytes during | |
| Chemokine signaling pathway | bta04062 | RES-GIN | Pathway was associated with resistance to gastrointestinal nematode infections in Angus cattle [ | |
| RES-FH | Pathway was associated with | |||
| C-type lectin receptor signaling pathway | bta04625 | RES-GIN | C-type lectin receptors are involved in innate and adaptive immunity to pathogens [ | |
| Cytokine–cytokine receptor interaction | bta04060 | RES-GIN | Pathway was associated with resistance to GIN infections in Angus cattle [ | |
| RES-DV | ||||
| RES-FH | ||||
| Estrogen signaling pathway | bta04915 | RES-GIN | Increase in reproduction rate of helminths as a result of increasing metabolism of 17-ß-estradiol in the host [ | |
| RES-FH | ||||
| JAK-STAT signaling pathway | bta04630 | RES-FH | ||
| RES-DV | JAK-STAT pathway is the principal signaling mechanism for cytokines involved in | |||
| Leukocyte transendothelial migration | bta04670 | RES-FH | Pathway was associated with | |
| SCS | Identification of the Leukocyte transendothelial migration pathway for | |||
| MAPK signaling pathway | bta04010 | FPR | Pathway associated with milk fat traits in dairy cattle [ | |
| Natural killer cell mediated cytotoxicity | bta04650 | RES-FH | Natural killer cells are lymphocytes of the innate immune response involved in host defense against infections with parasites [ | |
| Neuroactive ligand-receptor interaction | bta04080 | FPR | Pathway associated with milk fat traits in dairy cattle [ | |
| NF-kappa B signaling pathway | bta04064 | RES-GIN | Family of transcription factors regulating genes involved in immunity [ | |
| NOD-like receptor signaling pathway | bta04621 | RES-FH | Family of pattern recognition receptors responsible for various pathogens and generating innate immune response [ | |
| Phospholipase D signaling pathway | bta04072 | FPR | Phospholipase D is an essential enzyme for the production of phosphatidic acid, a key intermediate in milk fat synthesis during lactation [ | |
| PI3K-Akt signaling pathway | bta04151 | RES-GIN | Pathway has important functions in cellular immune response [ | |
| RES-FH | ||||
| RES-DV | ||||
| FPR | Pathway associated with milk fat traits in dairy cattle [ | |||
| Rap 1 signaling pathway | bta04015 | FPR | Pathway associated with milk fat traits in dairy cattle [ | |
| Ras signaling pathway | bta04014 | FPR | Pathway associated with milk fat traits in dairy cattle [ | |
| T cell receptor signaling pathway | bta04660 | RES-GIN | Involvement of B cells in immune response to gastrointestinal nematodes in ruminants [ | |
| TGF-β signaling pathway | bta04350 | RES-GIN | TGF-ß involved in host immune response during | |
| RES-FH | ||||
| RES-DV | ||||
| Th17 cell differentiation | bta04659 | RES-FH |
1 RES-GIN = residuals for fecal egg counts of gastrointestinal nematodes; RES-FH = residuals for fecal egg counts of liver flukes (Fasciola hepatica); RES-DV = residuals for fecal larvae counts of bovine lungworms (Dictyocaulus viviparus).
Figure 3(A) Manhattan plot for −log10 p-values of marker effects for residuals of fecal egg counts of gastrointestinal nematodes (RES-GIN). Markers above the genome-wide significance threshold pBonf (red dashed line) are highlighted in red and markers above the suggestive significance threshold pSug (blue dashed line) are highlighted in blue. Markers within a distance of 125 kb up- and downstream of the significantly or suggestively associated SVs are highlighted in red or blue, respectively. (B) Regional association plot for the α kinase 2 (ALPK2) gene on BTA 24 with corresponding flanking regions (+/−100,000 bp). The SNP rs437851546 (blue square) was the highest associated SNP (p = 6.63 × 10−9) for RES-GIN on BTA 24. Circles show GWAS p-values, with different colors indicating linkage disequilibrium (LD): red: LD 0.8 to 1.0, orange: LD 0.5 to 0.8. yellow: LD 0.2 to 0.5, gray: LD 0.0 to 0.2. Genes with a green arrow pointing to the right are located on the forward strand, genes with an arrow pointing to the left are located on the reverse strand.
Potential candidate genes (sorted alphabetically) related to the identified sequence variants (SVs) associated with residuals for fecal egg counts of gastrointestinal nematodes (RES-GIN), residuals for fecal egg counts of liver flukes (RES-FH), and residuals for fecal larvae counts of bovine lungworms (RES-DV) in DSN. Genes with at least one associated variant in the respective gene are marked in bold. A gene was considered a candidate gene if at least one SV above pSug was positioned in the respective gene and/or within 100 kb up- and downstream of the gene.
| BTA | RES-GIN | RES-FH | RES-DV |
|---|---|---|---|
| 1 |
|
| - |
| 2 | |||
| 3 |
|
| |
| 4 |
|
| |
| 5 |
|
| |
| 6 |
|
| |
| 7 |
|
| |
| 8 |
| ||
| 9 |
| ||
| 10 |
| ||
| 11 |
| ||
| 13 |
|
| |
| 14 |
|
|
|
| 15 |
|
| |
| 16 |
|
| |
| 17 |
|
| |
| 18 |
|
| |
| 19 |
| ||
| 20 |
|
| |
| 21 |
|
| |
| 23 | |||
| 24 | |||
| 26 |
| ||
| 27 |
|
| |
| 28 |
|
| |
| 29 |
|
* Genes including at least one associated variant above the genome-wide significance threshold (pBonf = 1.22 × 10−7).
Figure 4(A) Manhattan plot for −log10 p-values of marker effects for residuals of fecal egg counts of Fasciola hepatica (RES-FH); markers above the genome-wide significance threshold pBonf (red dashed line) are highlighted in red and markers above the suggestive significance threshold pSug (blue dashed line) are highlighted in blue. Markers within a distance of 125 kb up- and downstream of significantly or suggestively associated SVs are highlighted in red or blue, respectively. (B) Regional association plot for the fibroblast growth factor receptor 1 (FGFR1) gene on BTA 27 with corresponding flanking regions (+/−100,000 bp). The SNP rs380619570 (blue square) was the highest associated SNP (p = 4.48 × 10−8) for RES-FH on BTA 27. Circles show GWAS p-values, with different colors indicating linkage disequilibrium (LD): red: LD 0.8 to 1.0, orange: LD 0.5 to 0.8. yellow: LD 0.2 to 0.5, gray: LD 0.0 to 0.2. Genes with a green arrow pointing to the right are located on the forward strand, genes with an arrow pointing to the left are located on the reverse strand.
Figure 5(A) Manhattan plot for −log10 p-values of marker effects for residuals for fecal larvae counts of Dictyocaulus viviparus (RES-DV); markers above the genome-wide significance threshold pBonf (red dashed line) are highlighted in red and markers above the suggestive significance threshold pSug (blue dashed line) are highlighted in blue. Markers within a distance of 125 kb up- and downstream of the significantly or suggestively associated SVs are highlighted in red or blue, respectively. (B) Regional association plot for the docking protein 6 (DOK6) gene on BTA 24 with corresponding flanking regions (+/−100,000 bp). The SNP rs384171524 (blue square) was the highest associated SNP (p = 1.04 × 10−14) for RES-DV. Circles show GWAS p-values, with different colors indicating linkage disequilibrium (LD): red: LD 0.8 to 1.0, orange: LD 0.5 to 0.8. yellow: LD 0.2 to 0.5, gray: LD 0.0 to 0.2.