| Literature DB >> 28358110 |
Lingzhao Fang1,2, Goutam Sahana1, Guosheng Su1, Ying Yu2, Shengli Zhang2, Mogens Sandø Lund1, Peter Sørensen1.
Abstract
Connecting genome-wide association study (GWAS) to biological mechanisms underlying complex traits is a major challenge. Mastitis resistance and milk production are complex traits of economic importance in the dairy sector and are associated with intra-mammary infection (IMI). Here, we integrated IMI-relevant RNA-Seq data from Holstein cattle and sequence-based GWAS data from three dairy cattle breeds (i.e., Holstein, Nordic red cattle, and Jersey) to explore the genetic basis of mastitis resistance and milk production using post-GWAS analyses and a genomic feature linear mixed model. At 24 h post-IMI, genes responsive to IMI in the mammary gland were preferentially enriched for genetic variants associated with mastitis resistance rather than milk production. Response genes in the liver were mainly enriched for variants associated with mastitis resistance at an early time point (3 h) post-IMI, whereas responsive genes at later stages were enriched for associated variants with milk production. The up- and down-regulated genes were enriched for associated variants with mastitis resistance and milk production, respectively. The patterns were consistent across breeds, indicating that different breeds shared similarities in the genetic basis of these traits. Our approaches provide a framework for integrating multiple layers of data to understand the genetic architecture underlying complex traits.Entities:
Mesh:
Year: 2017 PMID: 28358110 PMCID: PMC5372096 DOI: 10.1038/srep45560
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the top SNP on each chromosome with genome-wide significance determined by single-marker genome-wide association analyses (GWAS) of each trait in three cattle breeds.
| Trait | Breed | Chromosome | Position of the top SNP (bp) | ||
|---|---|---|---|---|---|
| Mastitis resistance | HOL | 6 | 88,840,407 | 5.55e-19 | 4.1 |
| 13 | 59,260,175 | 2.09e-11 | 2.0 | ||
| 18 | 43,909,571 | 2.44e-10 | 1.9 | ||
| 23 | 11,477,969 | 1.10e-13 | 1.3 | ||
| 25 | 35,353,527 | 2.89e-10 | 0.4 | ||
| RDC | 6 | 88,723,742 | 2.31e-10 | 2.3 | |
| 24 | 60,959,835 | 3.72e-10 | 4.5 | ||
| Protein yield | HOL | 6 | 88,478,678 | 9.79e-10 | 1.5 |
| 14 | 1,892,784 | 6.93e-22 | 2.9 | ||
| 18 | 57,015,407 | 4.23e-11 | 1.2 | ||
| 23 | 10,504,197 | 4.80e-11 | 5.0 | ||
| 25 | 36,403,719 | 8.36e-14 | 0.9 | ||
| 26 | 41,231,611 | 4.72e-19 | 1.4 | ||
| 28 | 10,749,791 | 1.91e-09 | 1.9 | ||
| 29 | 12,741,604 | 3.38e-10 | 2.6 | ||
| RDC | 5 | 112,450,860 | 1.32e-09 | 1.1 | |
| 14 | 1,802,667 | 3.02e-09 | 1.1 | ||
| 23 | 8,581,891 | 1.22e-09 | 0.8 | ||
| 25 | 3,498,960 | 1.98e-12 | 1.0 | ||
| 26 | 10,268,885 | 4.48e-10 | 4.4 | ||
| Fat yield | HOL | 5 | 93,945,991 | 8.95e-26 | 1.8 |
| 14 | 1,810,124 | 3.64e-132 | 18.3 | ||
| 26 | 20,547,445 | 2.06e-22 | 1.5 | ||
| 29 | 17,696,734 | 1.82e-10 | 0.7 | ||
| RDC | 5 | 93,945,694 | 3.23e-28 | 2.6 | |
| 14 | 1,807,140 | 9.80e-43 | 6.3 | ||
| 23 | 28,567,796 | 4.35e-10 | 0.8 | ||
| 25 | 9,870,005 | 3.98e-15 | 1.3 | ||
| 26 | 24,379,571 | 5.20e-15 | 1.6 | ||
| JER | 14 | 1,802,667 | 9.36e-15 | 3.1 | |
| Milk yield | HOL | 5 | 93,944,849 | 8.79e-16 | 2.2 |
| 14 | 1,825,125 | 6.16e-86 | 13.9 | ||
| 20 | 29,996,727 | 1.79e-12 | 2.3 | ||
| 23 | 17,821,120 | 1.55e-09 | 4.3 | ||
| 26 | 37,869,380 | 3.80e-15 | 1.2 | ||
| RDC | 5 | 112,343,204 | 2.00e-09 | 1.1 | |
| 14 | 1,743,939 | 9.75e-34 | 7.2 | ||
| 16 | 1,322,611 | 2.63e-09 | 0.95 | ||
| 19 | 61,449,096 | 1.19e-09 | 0.55 | ||
| 20 | 31,909,478 | 2.78e-16 | 3.2 | ||
| 25 | 3,498,960 | 4.83e-10 | 0.82 | ||
| JER | 14 | 1,828,456 | 1.21e-21 | 2.8 | |
| 20 | 33,922,713 | 3.38e-09 | 1.1 |
(%) represents the proportion of genomic variance explained by the top SNP. HOL, Nordic Holstein; RDC, Nordic red; JER, Jersey.
The number of genes responsive to intra-mammary infection in each of 48 genomic features defined at six FDR cut-offs (i.e. 0.05, 0.01, 1e-3, 1e-6, 1e-8, 1e-10) in eight experimental comparisons.
| Tissue | Pathogenic factor | Comparison (h) | No.0.05 | No.0.01 | No.1e-3 | No.1e-6 | No.1e-8 | No.1e-10 |
|---|---|---|---|---|---|---|---|---|
| Liver | LPS | 3 | 1163 | 826 | 583 | 289 | 192 | 139 |
| Liver | LPS | 6 | 7888 | 6800 | 5422 | 3047 | 2172 | 1638 |
| Liver | LPS | 9 | 8014 | 6859 | 5559 | 3312 | 2504 | 1923 |
| Liver | LPS | 12 | 6998 | 5809 | 4430 | 2402 | 1768 | 1319 |
| Liver | LPS | 48 | 483 | 240 | 128 | 48 | 32 | 25 |
| Liver | 12 | 4183 | 2835 | 1576 | 390 | 199 | 102 | |
| Liver | 24 | 4650 | 3537 | 2228 | 837 | 525 | 316 | |
| Mammary | 24i | 2308 | 1533 | 996 | 365 | 227 | 146 |
24i vs. 24c: comparison between infected mammary glands and controls at 24 h post IMI with E. coli.
Figure 1Post-GWAS analysis results for the 48 genomic features identified from RNA-Seq based on six FDR cut-offs in Holstein (HOL) and Nordic red cattle (RDC).
–log10P indicates the –log10-transformed P-values from the post-GWAS analysis. r is the Pearson correlation of the –log10P of genomic features between HOL and RDC. P is the significance for the Pearson correlation test. Each point is one of the 48 genomic features. (A) post-GWAS results in HOL, (B) post-GWAS results in RDC, (C) Pearson correlation for mastitis resistance between HOL and RDC, (D) Pearson correlation for milk yield, (E) Pearson correlation for fat yield, (F) Pearson correlation for protein yield.
Figure 2Tissue differences in the enrichment of association signals for mastitis resistance and milk production traits in Holstein (HOL) and Nordic red cattle (RDC).
The genomic features were identified using six FDR cut-offs (i.e., 0.05, 0.01, 1e-3, 1e-6, 1e-8, 1e-10) from the mammary gland data of 24i vs. 24c and the liver data of 24 h vs. −144 h after IMI with E. coli, respectively. (A and C) are the analyses conducted in HOL and RDC, respectively, using genomic features defined from the mammary gland data of 24i vs. 24c. (B and D) are the analysis conducted in HOL and RDC, respectively, using genomic features defined from the liver data of 24 h vs. −144 h. Student’s t-test (paired) was used to test the significance of differences. n.s represents P ≥ 0.1, *represents P < 0.05, **represents P < 0.01.
Figure 3Dynamic impact on the hepatic transcriptome during intra-mammary infection (IMI) with LPS in Holstein (HOL) and Nordic red cattle (RDC).
The genomic features identified in the liver data of 3 h vs. −22 h, 9 h vs. −22 h, and 48 h vs. −22 h comparisons post-IMI (pi) with LPS based on six cut-offs (i.e., 0.05, 0.01, 1e-3, 1e-6, 1e-8, 1e-10) were used for analyses. (A and D) are the analyses conducted in HOL and RDC, respectively, using genomic features defined from the liver data of 3 h vs. −22 h. B and E are the analyses conducted in HOL and RDC, respectively, using genomic features defined from the liver data of 9 h vs. −22 h. (C and F) are the analyses conducted in HOL and RDC, respectively, using genomic features defined from the liver data of 48 h vs. −22 h. Student’s t-test (paired) was used to test the significance of differences. n.s represents P ≥ 0.1, o represents P < 0.1, *represents P < 0.05, **represents P < 0.01.
Figure 4Differences in up- and down-regulated genes in the enrichment of association signals for mastitis resistance and milk production traits in Holstein (HOL) and Nordic red cattle (RDC).
(A and B) are the analyses conducted in HOL and RDC, respectively, using the genomic features identified based on a log2 (fold-change) > 2 and six different FDR cut-offs (i.e., 0.05, 0.01, 1e-3, 1e-6, 1e-8, 1e-10). (C and D) are the analyses conducted in HOL and RDC, respectively, using the genomic features identified based on a log2 (fold-change) < −2 and six different FDR cut-offs. Student’s t-test (paired) was used to test the significance of differences. n.s represents P ≥ 0.1, *represents P < 0.05, **represents P < 0.01.
Figure 5Genome feature linear mixed model (GFLM) analyses for the top genomic features in mastitis resistance and milk production traits.
The point size represents the explained proportion of genomic variance by the feature (H2%); the colour represents the P-values of the feature from post-GWAS analyses. (A) the top genomic feature (FDR < 1e-6, log2(fold-change) > 1) in mastitis resistance; (B) the top genomic feature (FDR < 0.01, log2(fold-change) < −1) in milk and fat yield; (C) the top genomic feature (FDR < 1e-3, log2(fold-change) > 2) in protein yield.
Figure 6Scatterplots of enriched (FDR < 0.05) Gene Ontology (GO) terms from the functional enrichment analysis of top genomic features in mastitis resistance and milk production traits.
The scatterplots show the enriched GO terms in a two-dimensional space (x and y coordinates) derived using multidimensional scaling to the matrix of semantic similarities of the GO terms. The closeness of the GO terms on a plot indicates their closeness in the GO graph structure (i.e., semantic similarity). The point size indicates the frequency of the GO term in the EBI GOA database; the colours represent the FDR values of GO terms from the functional enrichment analysis of genes in the top genomic feature using KOBAS2.0. (A) the top genomic feature (FDR < 1e-6, log2(fold-change) > 1) in mastitis resistance; (B) the top genomic feature (FDR < 0.01, log2(fold-change) < −1) in milk and fat yield; (C) the top genomic feature (FDR < 1e-3, log2(fold-change) > 2) in protein yield.
The top three enriched (FDR < 0.05) Gene Ontology (GO) terms relevant to the immune response detected by a functional enrichment analysis of the top genomic feature (FDR < 1e-6, log2(fold-change) > 1) in Holstein (HOL) mastitis resistance.
| GO term | GO ID | FDR | Gene |
|---|---|---|---|
| Cellular response to biotic stimulus | GO:0071216 | 2.79E-3 | |
| Response to lipopolysaccharide | GO:0032496 | 4.29E-3 | |
| Inflammatory response | GO:0006954 | 4.30E-3 |
The three enriched (FDR < 0.05) Gene Ontology (GO) terms relevant to metabolism processes detected by a functional enrichment analysis of the top genomic feature (FDR < 0.01, log2(fold-change) < −1) in Holstein (HOL) milk and fat yield.
| GO term | GO ID | FDR | Gene |
|---|---|---|---|
| long-chain fatty acid metabolic process | GO:0001676 | 2.06E-3 | |
| bile acid biosynthetic process | GO:0006699 | 7.69E-3 | |
| lipid metabolic process | GO:0006629 | 8.10E-3 |
The top six enriched (FDR < 0.05) Gene Ontology (GO) terms relevant to metabolism processes and the immune response detected by a functional enrichment analysis of the top genomic feature (FDR < 1e-3, log2(fold-change) > 2) in Holstein (HOL) protein yield.
| GO term | GO ID | FDR | Gene |
|---|---|---|---|
| negative regulation of cellular protein metabolic process | GO:0032269 | 1.51E-5 | |
| regulation of endopeptidase activity | GO:0052548 | 2.50E-5 | |
| enzyme inhibitor activity | GO:0004857 | 3.02E-5 | |
| inflammatory response | GO:0006954 | 4.78E-7 | |
| detoxification | GO:0098754 | 2.42E-6 | |
| defense response | GO:0006952 | 3.22E-6 |