| Literature DB >> 30185830 |
Duy N Do1,2, Flavio S Schenkel3, Filippo Miglior3,4, Xin Zhao5, Eveline M Ibeagha-Awemu6.
Abstract
This study aimed to identify single nucleotide polymorphisms (SNPs) associated with milk cholesterol (CHL) content via a genome wide association study (GWAS). Milk CHL content was determined by gas chromatography and expressed as mg of CHL in 100 g of fat (CHL_fat) or in 100 mg of milk (CHL_milk). GWAS was performed with 1,183 cows and 40,196 SNPs using a univariate linear mixed model. Two and 20 SNPs were significantly associated with CHL_fat and CHL_milk, respectively. The important regions for CHL_fat and CHL_milk were at 41.9 Mb on chromosome (BTA) 17 and 1.6-3.2 Mb on BTA 14, respectively. DGAT1, PTPN1, INSIG1, HEXIM1, SDS, and HTR5A genes, also known to be associated with human plasma CHL phenotypes, were identified as potential candidate genes for bovine milk CHL. Additional new potential candidate genes for milk CHL were RXFP1, FAM198B, TMEM144, CXXC4, MAML2 and CDH13. Enrichment analyses suggested that identified candidate genes participated in cell-cell signaling processes and are key members in tight junction, focal adhesion, Notch signaling and glycerolipid metabolism pathways. Furthermore, identified transcription factors such as PPARD, LXR, and NOTCH1 might be important in the regulation of bovine milk CHL content. The expression of several positional candidate genes (such as DGAT1, INSIG1 and FAM198B) and their correlation with milk CHL content were further confirmed with RNA sequence data from mammary gland tissues. This is the first GWAS on bovine milk CHL. The identified markers and candidate genes need further validation in a larger cohort for use in the selection of cows with desired milk CHL content.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30185830 PMCID: PMC6125589 DOI: 10.1038/s41598-018-31427-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genome-wide significant SNPs for milk cholesterol content.
| Traita | SNP ID | BTAb | Positionc | Alleles | MAFd | rs# | Allele_sube | p-value | Consequencef | Gene (nearby gene)g |
|---|---|---|---|---|---|---|---|---|---|---|
| CHL_fat | Hapmap40322-BTA-100742 | 17 | 41965769 | G/T | 0 .340 | rs41600454 | 11.29 | 4.26E-05 | intergenic | (FAM198B) |
| CHL_fat | BTB-01524761 | 17 | 41939826 | C/T | 0.336 | rs42640895 | −11.66 | 2.61E-05 | intergenic |
|
| CHL_milk | Hapmap30383-BTC-005848 | 14 | 1489496 | A/G | 0.423 | rs109752439 | 0.85 | 1.80E-11 | downstream |
|
| CHL_milk | ARS-BFGL-NGS-18858 | 14 | 2909929 | A/G | 0.450 | rs109558046 | 0.71 | 1.76E-08 | intergenic |
|
| CHL_milk | Hapmap30646-BTC-002054 | 14 | 2553525 | C/T | 0.356 | rs110060785 | 0.66 | 1.24E-06 | intergenic |
|
| CHL_milk | ARS-BFGL-NGS-41837 | 6 | 22129886 | C/T | 0.212 | rs110597360 | 0.63 | 4.14E-05 | intergenic |
|
| CHL_milk | ARS-BFGL-NGS-18365 | 14 | 2117455 | C/T | 0.250 | rs110892754 | −0.67 | 2.68E-06 | intergenic |
|
| CHL_milk | Hapmap36620-SCAFFOLD50018_7571 | 14 | 3297177 | C/T | 0.495 | rs29024688 | 0.58 | 8.37E-06 | intergenic |
|
| CHL_milk | Hapmap38637-BTA-88156 | 15 | 13964124 | G/T | 0.450 | rs41596665 | −0.54 | 2.86E-05 | intergenic |
|
| CHL_milk | ARS-BFGL-NGS-4939 | 14 | 1801116 | A/G | 0.336 | rs109421300 | −1.17 | 6.70E-19 | intron |
|
| CHL_milk | Hapmap30374-BTC-002159 | 14 | 2468020 | A/G | 0.490 | rs109529219 | 0.59 | 7.02E-06 | intron |
|
| CHL_milk | ARS-BFGL-NGS-34135 | 14 | 1675278 | A/G | 0.491 | rs109968515 | −0.66 | 2.34E-07 | intron |
|
| CHL_milk | Hapmap30086-BTC-002066 | 14 | 2524432 | A/G | 0.406 | rs110199901 | 0.77 | 5.14E-09 | intron |
|
| CHL_milk | ARS-BFGL-NGS-94706 | 14 | 1696470 | A/C | 0.493 | rs17870736 | −0.70 | 4.27E-08 | intron |
|
| CHL_milk | Hapmap52830-rs29014800 | 17 | 63541690 | A/G | 0.403 | rs29014800 | −0.57 | 1.58E-05 | intron |
|
| CHL_milk | Hapmap39330-BTA-42256 | 18 | 9797478 | A/C | 0.388 | rs41605812 | −0.54 | 3.63E-05 | intron |
|
| CHL_milk | Hapmap30922-BTC-002021 | 14 | 2138926 | C/T | 0.240 | rs110749653 | −0.64 | 1.12E-05 | non_coding_transcript_exon |
|
| CHL_milk | Hapmap52798-ss46526455 | 14 | 1923292 | A/G | 0.396 | rs41256919 | −0.62 | 1.08E-06 | synonymous |
|
| CHL_milk | ARS-BFGL-NGS-57820 | 14 | 1651311 | C/T | 0.340 | rs109146371 | −1.15 | 2.42E-18 | upstream |
|
| CHL_milk | ARS-BFGL-NGS-107379 | 14 | 2054457 | A/G | 0.372 | rs109350371 | −0.94 | 4.06E-13 | upstream |
|
| CHL_milk | BTA-35941-no-rs | 14 | 2276443 | G/T | 0.498 | rs41627764 | −0.64 | 1.03E-06 | upstream |
|
| CHL_milk | UA-IFASA-6878 | 14 | 2002873 | C/T | 0.419 | rs41629750 | −0.62 | 9.06E-07 | upstream |
|
aCHL_fat: mg of cholesterol in 100 g of fat, CHL_milk: mg of cholesterol in 100 g of milk. bBos taurus autosome. cSNP position on the UMD3.1 assembly in base pairs. dMinor allele frequency. eAllelic substitution effect. fSNP consequence obtained from Variant effect predictor (http://www.ensembl.org/Tools/VEP). gGene or nearest gene to the corresponding SNP (obtained from Ensembl gene database: http://www.ensembl.org/Bos_taurus/Info/Index.
Figure 1Manhattan plot of genome-wide significant (p < 5E-05) and suggestive (p < 5E-04) SNP associations for milk cholesterol content in Canadian Holstein cows. The most significant SNPs with their corresponding p-values are indicated. CHL_fat: mg of cholesterol in 100 gram of fat, CHL_milk: mg of cholesterol in 100 gram of milk.
Figure 2Linkage disequilibrium (LD) pattern on a 1.4–3.4 Mb region of BTA 14. LD blocks are marked with triangles; values in boxes are LD (squared correlation coefficient, r2) between SNP pairs; red boxes indicate LOD > 2 and D′ = 1 (LOD is the log of the likelihood odds ratio, a measure of confidence in the value of D′, where D′ is the ratio of the linkage disequilibrium coefficient D to its maximum possible).
Gene ontology and pathways enriched for positional candidate genes of CHL_fata.
| Categoryb | Names | Number of genes | p-value |
|---|---|---|---|
| GO_BP | Negative regulation of cyclin-dependent protein kinase activity | 2 | 0.001 |
| GO_BP | Cell-cell signaling | 5 | 0.001 |
| GO_BP | Cell communication | 6 | 0.004 |
| GO_BP | Regulation of cyclin-dependent protein kinase activity | 2 | 0.006 |
| GO_BP | Regulation of nervous system development | 3 | 0.007 |
| GO_BP | Organic acid catabolic process | 3 | 0.008 |
| GO_BP | Carboxylic acid catabolic process | 3 | 0.008 |
| GO_BP | Regulation of adenylate cyclase activity | 2 | 0.009 |
| GO_BP | G-protein signaling, coupled to cAMP nucleotide second messenger | 2 | 0.009 |
| GO_BP | G-protein signaling, coupled to cyclic nucleotide second messenger | 2 | 0.009 |
| GO_BP | cAMP-mediated signaling | 2 | 0.010 |
| GO_CC | Basolateral plasma membrane | 3 | 0.007 |
| GO_MF | Cyclin-dependent protein kinase regulator activity | 3 | 1.10E-04 |
| GO_MF | snRNA binding | 2 | 4.80E-04 |
| GO_MF | Cyclin-dependent protein kinase inhibitor activity | 2 | 0.001 |
| GO_MF | Protein serine/threonine kinase inhibitor activity | 2 | 0.003 |
| GO_MF | Protein kinase regulator activity | 3 | 0.003 |
| GO_MF | Kinase regulator activity | 3 | 0.005 |
| GO_MF | Protein kinase inhibitor activity | 2 | 0.006 |
| GO_MF | Kinase inhibitor activity | 2 | 0.008 |
| KEGG | Neuroactive ligand-receptor interaction | 5 | 0.015 |
| KEGG | Focal adhesion | 4 | 0.026 |
| KEGG | Leukocyte transendothelial migration | 3 | 0.032 |
| KEGG | Tight junction | 3 | 0.040 |
| KEGG | Basal cell carcinoma | 2 | 0.043 |
aCHL_fat: mg of cholesterol in 100 g of fat. Only gene ontologies with p-values < 0.01 are shown.
bGO_BP: Biological processes gene ontology term, GO_CC: Cellular component gene ontology term and GO_MF: Molecular function gene ontology term.
Gene ontology and pathways enriched for potential candidate genes of CHL_milka.
| Categoryb | Names | Number of genes | p-value |
|---|---|---|---|
| GO_BP | Cardiac muscle tissue development | 4 | 1.00E-04 |
| GO_BP | Positive regulation of cell-matrix adhesion | 2 | 4.30E-04 |
| GO_BP | Heart development | 5 | 0.001 |
| GO_BP | Negative regulation of protein ubiquitination | 2 | 0.002 |
| GO_BP | Striated muscle tissue development | 4 | 0.002 |
| GO_BP | Muscle tissue development | 4 | 0.003 |
| GO_BP | Ribosome biogenesis | 4 | 0.003 |
| GO_BP | Ventricular cardiac muscle morphogenesis | 2 | 0.003 |
| GO_BP | Regulation of cell-matrix adhesion | 2 | 0.005 |
| GO_BP | Cardiac muscle cell differentiation | 2 | 0.005 |
| GO_BP | Negative regulation of translation | 2 | 0.005 |
| GO_BP | Cardiac muscle tissue morphogenesis | 2 | 0.005 |
| GO_BP | Muscle tissue morphogenesis | 2 | 0.005 |
| GO_BP | Cardiac cell differentiation | 2 | 0.005 |
| GO_BP | Ribonucleoprotein complex biogenesis | 4 | 0.006 |
| GO_BP | Nuclear-transcribed mRNA catabolic process, nonsense-mediated decay | 2 | 0.006 |
| GO_BP | Muscle organ development | 4 | 0.006 |
| GO_BP | rRNA processing | 3 | 0.008 |
| GO_BP | Negative regulation of cellular protein metabolic process | 3 | 0.008 |
| GO_BP | rRNA metabolic process | 3 | 0.008 |
| GO_BP | Regulation of protein ubiquitination | 2 | 0.008 |
| GO_BP | Negative regulation of protein metabolic process | 3 | 0.009 |
| GO_BP | Regulation of macromolecule metabolic process | 21 | 0.009 |
| GO_BP | Notch signaling pathway | 2 | 0.009 |
| GO_BP | Regulation of cell proliferation | 7 | 0.009 |
| GO_BP | Negative regulation of cellular process | 11 | 0.009 |
| GO_BP | Anatomical structure formation involved in morphogenesis | 5 | 0.010 |
| GO_CC | Anchored to membrane | 5 | 0.001 |
| GO_CC | Intracellular | 66 | 0.006 |
| GO_MF | Interleukin-2 receptor binding | 2 | 8.60E-05 |
| GO_MF | ATP-dependent helicase activity | 5 | 2.50E-04 |
| GO_MF | Purine NTP-dependent helicase activity | 5 | 2.50E-04 |
| GO_MF | Nucleic acid binding | 31 | 0.002 |
| GO_MF | Helicase activity | 5 | 0.002 |
| GO_MF | ATPase activity, coupled | 6 | 0.006 |
| GO_MF | 3′–5′ exonuclease activity | 2 | 0.006 |
| KEGG | Glycerolipid metabolism | 2 | 0.043 |
| KEGG | Notch signaling pathway | 2 | 0.045 |
aCHL_milk: mg of cholesterol in 100 g of milk. Only gene ontologies with p-values < 0.01 are shown.
bGO_BP: Biological processes gene ontology term, GO_CC: Cellular component gene ontology term and GO_MF: Molecular function gene ontology term.
Figure 3Protein-protein interaction network created using the STRING database for CHL_fat positional candidate genes. Network analysis was set at medium confidence (STRING score = 0.4). The line widths represent the level of interactions (wider lines represent stronger evidence of interactions). CHL_fat: mg of cholesterol in 100 gram of fat.
Figure 4Protein-protein interaction network created using the STRING database for CHL_milk positional candidate genes. Network analysis was set at medium confidence (STRING score = 0.4). Line widths represent the level of interactions (wider lines represent stronger evidence of interactions). CHL_milk: mg of cholesterol in 100 gram of milk.
Significantly enriched transcription factors for positional candidate genes for CHL_fat and CHL_milk.
| Traita | Transcription factor | Overlap | p-value |
|---|---|---|---|
| CHL_fat |
| 40/3057 | 0.002 |
| CHL_fat |
| 11/516 | 0.004 |
| CHL_fat |
| 9/382 | 0.005 |
| CHL_fat |
| 14/797 | 0.006 |
| CHL_fat |
| 16/1000 | 0.009 |
| CHL_fat |
| 9/459 | 0.014 |
| CHL_fat |
| 8/407 | 0.020 |
| CHL_fat |
| 13/840 | 0.022 |
| CHL_fat |
| 19/1404 | 0.023 |
| CHL_fat |
| 25/2000 | 0.023 |
| CHL_fat |
| 25/2000 | 0.023 |
| CHL_fat |
| 25/2000 | 0.023 |
| CHL_fat |
| 24/1904 | 0.024 |
| CHL_fat |
| 23/1875 | 0.035 |
| CHL_fat |
| 57/5578 | 0.036 |
| CHL_fat |
| 26/2189 | 0.036 |
| CHL_fat |
| 14/1001 | 0.038 |
| CHL_fat |
| 24/2000 | 0.039 |
| CHL_fat |
| 13/932 | 0.045 |
| CHL_fat |
| 9/573 | 0.048 |
| CHL_milk |
| 60/2000 | 1.00E-11 |
| CHL_milk | DACH1 | 46/1698 | 1.00E-07 |
| CHL_milk |
| 51/2000 | 1.19E-07 |
| CHL_milk |
| 39/2000 | 0.001 |
| CHL_milk |
| 39/2000 | 0.001 |
| CHL_milk |
| 11/327 | 0.002 |
| CHL_milk |
| 38/2000 | 0.003 |
| CHL_milk |
| 20/935 | 0.008 |
| CHL_milk |
| 91/6207 | 0.010 |
| CHL_milk |
| 35/2000 | 0.013 |
| CHL_milk |
| 7/234 | 0.022 |
| CHL_milk |
| 7/245 | 0.028 |
| CHL_milk |
| 8/321 | 0.039 |
| CHL_milk |
| 19/1029 | 0.039 |
| CHL_milk |
| 14/695 | 0.039 |
| CHL_milk |
| 12/574 | 0.043 |
aCHL_fat: mg of cholesterol in 100 g of fat, CHL_milk: mg of cholesterol in 100 g of milk.
Positional candidate genes for milk cholesterol which are expressed in mammary gland tissues and also significantly correlated to cholesterol concentration in milk fat (CHL_fat)a of the same cows.
| Ensembl Geneb | Gene symbol | Total read counts | cor_CHL_fatc | p_cor_CHL_fat |
|---|---|---|---|---|
|
|
| 3115 | −0.893 | 0.001 |
|
|
| 974 | −0.892 | 0.001 |
|
|
| 1331 | −0.830 | 0.006 |
|
|
| 620 | −0.828 | 0.006 |
|
|
| 11237 | −0.827 | 0.006 |
|
|
| 1190 | −0.822 | 0.007 |
|
|
| 1609 | −0.815 | 0.007 |
|
|
| 2746 | −0.804 | 0.009 |
|
|
| 2049 | −0.802 | 0.009 |
|
|
| 1250 | −0.790 | 0.011 |
|
|
| 1309 | −0.767 | 0.016 |
|
|
| 242 | −0.767 | 0.016 |
|
|
| 3025 | −0.766 | 0.016 |
|
|
| 2544 | −0.763 | 0.017 |
|
|
| 4692 | −0.763 | 0.017 |
|
|
| 509 | −0.763 | 0.017 |
|
|
| 4 | 0.760 | 0.017 |
|
|
| 3333 | −0.758 | 0.018 |
|
|
| 1968 | −0.757 | 0.018 |
|
|
| 4564 | −0.753 | 0.019 |
|
|
| 545 | −0.750 | 0.020 |
|
|
| 20 | 0.749 | 0.020 |
|
|
| 456 | −0.745 | 0.021 |
|
|
| 921 | −0.742 | 0.022 |
|
|
| 196 | −0.741 | 0.022 |
|
|
| 1583 | −0.720 | 0.029 |
|
|
| 6735 | −0.717 | 0.030 |
|
|
| 2232 | −0.714 | 0.031 |
|
|
| 1089 | −0.709 | 0.033 |
|
|
| 4740 | −0.704 | 0.034 |
|
|
| 3270 | −0.704 | 0.034 |
|
|
| 6186 | −0.703 | 0.035 |
|
|
| 1418 | −0.695 | 0.038 |
|
|
| 386 | −0.694 | 0.038 |
|
|
| 20172 | −0.687 | 0.041 |
|
|
| 2763 | −0.683 | 0.042 |
|
|
| 9582 | −0.683 | 0.042 |
|
|
| 4120 | −0.683 | 0.042 |
|
|
| 4821 | −0.681 | 0.043 |
|
|
| 62 | −0.679 | 0.044 |
|
|
| 2635 | −0.679 | 0.044 |
|
|
| 1496 | −0.679 | 0.044 |
|
|
| 19107 | −0.677 | 0.045 |
|
|
| 21 | −0.671 | 0.048 |
|
|
| 34529 | −0.669 | 0.049 |
aCHL_fat: mg of cholesterol in 100 g of fat, CHL_milk: mg of cholesterol in 100 g of milk.
bGenes in bold face are also positional candidate genes for CHL_milk.
cPearson correlation coefficient.
Positional candidate genes for milk cholesterol which are expressed in mammary gland tissues and also significantly correlated to cholesterol concentration in milk (CHL_milk)a of the same cows.
| Ensembl Geneb | Gene symbol | Total read counts | cor_CHL_milkc | p_cor_CHL_milk |
|---|---|---|---|---|
|
|
| 4 | 0.933 | 2.39E-04 |
|
|
| 634 | −0.923 | 3.84E-04 |
|
|
| 44254 | −0.897 | 0.001 |
|
|
| 974 | −0.897 | 0.001 |
|
|
| 7826 | −0.893 | 0.001 |
|
|
| 1524 | −0.892 | 0.001 |
|
|
| 47 | −0.876 | 0.002 |
|
|
| 2308 | −0.871 | 0.002 |
|
|
| 332 | −0.868 | 0.002 |
|
|
| 845 | −0.863 | 0.003 |
|
|
| 555 | −0.857 | 0.003 |
|
|
| 2659 | −0.847 | 0.004 |
|
|
| 2619 | −0.839 | 0.005 |
|
|
| 4706 | −0.835 | 0.005 |
|
|
| 10394 | −0.829 | 0.006 |
|
|
| 287 | −0.828 | 0.006 |
|
|
| 988 | −0.821 | 0.007 |
|
|
| 498 | −0.819 | 0.007 |
|
|
| 751 | −0.814 | 0.008 |
|
|
| 6064 | −0.813 | 0.008 |
|
|
| 465 | −0.811 | 0.008 |
|
|
| 2290 | −0.809 | 0.008 |
|
|
| 6104 | −0.808 | 0.008 |
|
|
| 1032 | −0.807 | 0.009 |
|
|
| 372 | −0.805 | 0.009 |
|
|
| 1730 | −0.805 | 0.009 |
|
|
| 1331 | −0.802 | 0.009 |
|
|
| 2101 | −0.801 | 0.009 |
|
|
| 2811 | −0.799 | 0.010 |
|
|
| 2179 | −0.799 | 0.010 |
|
|
| 4493 | −0.794 | 0.011 |
|
|
| 309 | −0.792 | 0.011 |
|
|
| 1161 | −0.792 | 0.011 |
|
|
| 2079 | −0.790 | 0.011 |
|
|
| 35493 | −0.786 | 0.012 |
|
|
| 3161 | −0.777 | 0.014 |
|
|
| 1067 | −0.773 | 0.015 |
|
|
| 1850 | −0.759 | 0.018 |
|
|
| 6075 | −0.757 | 0.018 |
|
|
| 607 | −0.757 | 0.018 |
|
|
| 6186 | −0.754 | 0.019 |
|
|
| 163 | −0.753 | 0.019 |
|
|
| 6020 | −0.752 | 0.019 |
|
|
| 5533 | −0.751 | 0.020 |
|
|
| 330 | −0.743 | 0.022 |
|
|
| 2268 | −0.739 | 0.023 |
|
|
| 6558 | −0.735 | 0.024 |
|
|
| 6605 | −0.727 | 0.026 |
|
|
| 7819 | −0.727 | 0.027 |
|
|
| 2136 | −0.727 | 0.027 |
|
|
| 44 | −0.727 | 0.027 |
|
|
| 57636 | −0.724 | 0.028 |
|
|
| 2239 | −0.724 | 0.028 |
|
|
| 20 | 0.721 | 0.028 |
|
|
| 1632 | −0.718 | 0.029 |
|
|
| 16551 | −0.711 | 0.032 |
|
|
| 1067 | −0.710 | 0.032 |
|
|
| 3006 | −0.702 | 0.035 |
|
|
| 8527 | −0.701 | 0.035 |
|
|
| 4420 | −0.697 | 0.037 |
|
|
| 14432 | −0.697 | 0.037 |
|
|
| 13022 | −0.696 | 0.037 |
|
|
| 2653 | −0.695 | 0.038 |
|
|
| 4154 | −0.693 | 0.038 |
|
|
| 1451 | −0.692 | 0.039 |
|
|
| 7 | −0.686 | 0.041 |
|
|
| 7 | −0.686 | 0.041 |
|
|
| 1729 | −0.686 | 0.042 |
|
|
| 4800 | −0.679 | 0.044 |
|
|
| 2406 | −0.675 | 0.046 |
|
|
| 12515 | −0.674 | 0.047 |
|
|
| 17674 | −0.668 | 0.049 |
aCHL_fat: mg of cholesterol in 100 g of fat, CHL_milk: mg of cholesterol in 100 g of milk.
bGenes in bold face are also positional candidate genes for CHL_fat.
cPearson correlation coefficient.