Literature DB >> 30451907

Integrated PTR-ToF-MS, GWAS and biological pathway analyses reveal the contribution of cow's genome to cheese volatilome.

Sara Pegolo1, Matteo Bergamaschi2, Flavia Gasperi3, Franco Biasioli3, Alessio Cecchinato2, Giovanni Bittante2.   

Abstract

Volatile organic compounds (VOCs) are small molecules that contribute to the distinctive flavour of cheese which is an important attribute for consumer acceptability. To investigate whether cow's genetic background might contribute to cheese volatilome, we carried out genome-wide association studies (GWAS) and pathway-based analyses for 173 spectrometric peaks tentatively associated with several VOCs obtained from proton-transfer-reaction mass spectrometry (PTR-ToF-MS) analyses of 1,075 model cheeses produced using raw whole-milk from Brown Swiss cows. Overall, we detected 186 SNPs associated with 120 traits, several of which mapped close to genes involved in protein (e.g. CSN3, GNRHR and FAM169A), fat (e.g. AGPAT3, SCD5, and GPAM) and carbohydrate (e.g. B3GNT2, B4GALT1, and PHKB) metabolism. Gene set enrichment analysis showed that pathways connected with proteolysis/amino acid metabolism (purine and nitrogen metabolism) as well as fat metabolism (long-term potentiation) and mammary gland function (tight junction) were overrepresented. Our results provide the first evidence of a putative link between cow's genes and cheese flavour and offer new insights into the role of potential candidate loci and the biological functions contributing to the cheese volatilome.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30451907      PMCID: PMC6242841          DOI: 10.1038/s41598-018-35323-5

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Cheese quality depends on many related, interacting factors, ranging from compositional, functional, sensory and safety characteristics to nutritional, psychological, convenience, processing and economic factors[1]. Consumer acceptability of dairy products is highly dependent on sensory characteristics[2], particularly flavour, an important determinant of quality. During the manufacture and ripening of cheese, enzymes from various sources (native milk enzyme, rennet, lactic acid bacteria, secondary microflora and exogenous enzyme preparations) are responsible for the breakdown of macronutrients (fat, proteins and lactose) into fatty acids, amino acids and lactic acid, the major precursors of volatile organic compounds (VOCs), which play a significant role in determining cheese flavour[3,4]. Several studies aimed at characterizing the volatile fraction of various cheeses have been conducted[5-7], most of them using solid-phase micro-extraction (SPME)-GC-MS equipment. Proton-transfer-reaction time-of-flight mass spectrometry (PTR-ToF-MS), however, is a more time-efficient and sensitive method for characterising the cheese VOC fingerprint[8,9]. Several factors (e.g. dairy system, herd, individual cow characteristics) have been shown to affect the cheese volatilome[9,10] and evidence for the existence of an exploitable genetic variation in the cheese VOC profile has also recently been put forward[11], suggesting there is potential to modify cheese flavour through selective breeding in order to improve cheese quality. Genome-wide association studies (GWAS) have been widely used to disentangle the genomic architecture underlying complex traits in dairy cattle[12-14]. It has become common to couple GWAS with biological pathway analysis to extract biological information from the GWAS data and overcome the limitations of this method, such as the its reduced ability to detect small-effect loci and its poor replication[15-17]. The genomic and biological information thus acquired makes it possible to elucidate the genetic basis and molecular mechanisms underlying complex traits on the one hand, and, on the other hand, to increase the accuracy of genomic prediction when incorporated into prediction models[18,19]. Herein, we investigated whether cow’s genetic background contributes to variability in the cheese volatilome and, therefore, might play a role in determining cheese flavour. The potential existence of a genomic control for VOC profile in cheese would be of considerable significance given the economic importance of cheese quality to the dairy industry. To our knowledge, there is no existing information on whether there is a relationship between the cow’s genome and the cheese VOC profile, nor on the biological functions that may be involved in regulating the cheese volatilome. The aim of this study, therefore, was i) to perform GWAS analyses for milk and cheese composition traits in dairy cows, and for cheese VOC profiles determined by proton-transfer-reaction time-of-flight mass spectrometry (PTR-ToF-MS), and ii) to carry out pathway analyses on the SNP markers, in order to identify genomic regions and biological mechanisms that contribute to the variability in cheese volatilome.

Results

Descriptive statistics and genomic heritability estimates for milk and cheese composition are reported in Table 1. We found milk fat percentage to have a relatively low heritability (0.08), and confirmed protein percentage as being under strong genetic influence (0.40). Lactose percentage was moderately heritable (h2 = 0.22), while heritability estimates were, instead, close to 0 for the milk fat to protein ratio, and cheese fat and protein, which depend mainly on the cheese-making procedure. Table 2 shows the concentrations and heritabilities for some of the spectrometric peaks associated with the VOCs of model cheeses measured by PTR-ToF-MS. Among the tentatively identified spectrometric peaks, those associated with dimethylsulfone m/z 95.017 (0.22), alkyl fragment (terpenes) m/z 81.070 (0.15), butan-1-ol/pentan-1-ol, heptan-1-ol m/z 75.080 (0.14) and hexanal/nonanal m/z 83.086 (0.10) had moderate heritabilities. Among the unknown compounds, the peaks at m/z 85.029 (0.22), m/z 135.134 (0.18), m/z 66.063 (0.18), m/z 48.053 (0.17), m/z 44.980 (0.15), m/z 83.071 (0.15) and m/z 169.044 (0.15) had the highest heritabilities.
Table 1

Descriptive statistics and genomic heritability (h2) for milk and cheese composition.

TraitMeanSDh2#SNPa
Milk yield, kg/d24.267.960.092
Milk composition
  Fat, %4.190.670.085
  Prot, %3.710.420.402
  Fat to prot1.130.18<0.011
  Lactose, %4.850.200.221
Cheese composition
  Fat, %38.114.23<0.012
  Prot, %27.084.04<0.015

a#SNP: number of significant SNP (5 × 10−5) for each trait.

Table 2

Descriptive statistics and genomic heritability (h2) for some spectrometric peaks from PTR-ToF-MS analysis of model cheeses*.

Measured mass(m/z)ppbh2#SNPa
MeanSD
Volatile compounds
44.9804.640.660.151
48.05310.030.820.175
49.0116.820.670.08
49.0285.280.730.102
63.0448.020.490.083
66.0638.610.860.185
75.0275.470.960.083
75.0807.701.140.145
78.0014.230.890.053
79.0756.170.640.072
81.0705.370.460.154
82.9454.480.700.133
83.0717.440.950.153
83.0866.130.690.103
84.0754.410.580.093
84.9424.290.630.085
85.0294.460.400.203
91.0598.470.810.152
92.0616.640.970.052
93.09010.111.370.134
93.4324.540.460.073
94.0395.430.900.132
94.0956.921.370.113
95.0044.720.650.051
95.0175.040.710.223
95.0345.220.640.102
95.0965.440.980.142
105.0394.530.440.075
109.0706.300.700.062
111.1045.080.940.122
113.0294.640.480.061
119.0725.540.660.115
119.0896.441.000.132
121.1225.170.820.135
123.0474.560.350.052
123.0764.560.380.093
123.1174.820.640.062
127.0734.490.380.081
129.0644.370.370.13
129.1274.950.690.06
133.1024.450.380.06
133.1235.600.690.073
135.1026.690.620.07
135.1345.800.750.183
137.1014.710.500.124
137.1325.280.390.162
139.0764.580.350.062
139.1344.120.540.092
149.0456.080.970.166
157.1594.330.460.081
163.0966.551.250.101
169.0445.920.920.155
171.1734.870.760.113
173.1535.200.480.091
189.1844.230.500.07
191.1634.350.320.07
201.1844.340.410.072

*Only the peaks with h2 ≥ 0.05 are presented.

a#SNP: number of significant SNP (5 × 10−5) for each trait.

Descriptive statistics and genomic heritability (h2) for milk and cheese composition. a#SNP: number of significant SNP (5 × 10−5) for each trait. Descriptive statistics and genomic heritability (h2) for some spectrometric peaks from PTR-ToF-MS analysis of model cheeses*. *Only the peaks with h2 ≥ 0.05 are presented. a#SNP: number of significant SNP (5 × 10−5) for each trait. Results of the GWAS analyses of milk and cheese composition and cheese VOCs are summarised in Table 3 and Supplementary Table S1. Overall, we detected 186 significant SNPs (P < 5E-05) across all Bos taurus autosomes (BTAs), which were associated to 120 traits. One SNP had an unknown position on the genome, which was significantly associated with m/z 131.107 (P = 1.04E-05). Most of the significant associations were one SNP-one trait (80%).
Table 3

Summary results of the genome wide association analysis for spectrometric peaks from PTR-ToF-MS analysis of model cheeses.

BTAa#SNPInterval, MbpTop SNPTop SNP P-valueTop SNP location, bpTop SNP MAFTrait(m/z)b
11rs1104704512.43E-0568465850.4261.062
11rs1093607402.50E-0596010180.3675.080, 48.053, 66.063, 84.075, 95.096
11rs1105883944.87E-05445695180.02113.029
1298.34–100.38rs415858108.97E-061003828400.0791.059, 123.047,137.101, 139.076
11rs1100865183.58E-051199318780.1255.055
11rs1104425123.07E-051316840530.25115.077
11rs1102249463.59E-051361276820.4751.044
12146.59–147.58rs1094541923.79E-061475829800.04159.138, 83.052
21rs1106626354.20E-05247340640.3443.054
21rs433046414.95E-05321548060.02103.075
21rs415737592.68E-05526120520.0493.432
21rs416432814.93E-051351673070.4343.054
31rs1098059341.95E-05264381360.45105.071
31rs1106820532.83E-05294618000.22105.091
31rs428242744.31E-05473374770.1563.044
3379.73–81.77rs433498362.87E-05817749720.0133.034, 51.044, 93.037
31rs290112173.87E-05952790130.22109.07, 95.034, 169.044
31rs437122014.87E-05984509190.3095.004
32105.61–106.61rs427913251.35E-051056132300.4695.034, 91.051
41rs1109912476.18E-06131484840.49129.091, 155.144, 173.153, 201.184
4320.69–22.62rs1099009967.78E-06226188150.02113.098, fatMILK
41rs423582651.18E-05354476600.1749.028,131.084
41rs424350594.43E-06392444470.01fatCHEESE
4270.42–70.45rs1107315933.13E-05704199510.0295.081
4293.71–93.74rs1106000593.00E-05937094950.27106.077
41rs433737041.84E-051195524570.05113.098
51rs1094389715.76E-06271306950.0592.061
51rs434393084.73E-05883119480.38105.039
51rs1091739221.30E-051109826400.33105.091
51rs1107588314.06E-051155610040.49105.091
61rs1105874191.64E-05370199720.05MY
61rs420050694.83E-05551479900.2163.044
6273.25–73.82rs416537624.50E-05732548010.16149.045, 123.076
6981.65–88.07rs1102397391.32E-07846899910.16protMILK, MY,163.096,85.029, 149.045, 93.037, 169.044, 75.027, 137.101, 123.047, 109.070, 91.059
61rs415910813.41E-05918402390.2189.060
61rs1109866763.57E-05993415210.14116.078
71rs435065784.65E-05245991670.19115.112
7265.47–67.20rs1105808021.37E-05672021300.09119.107, 111.119
71rs423837872.74E-05766686760.2144.058, 61.062
72rs415747923.42E-051055379810.3157.070, 82.945
81rs430977933.75E-0545032680.06105.039
81rs416546912.74E-05497239240.11137.132
8268.44–69.53rs415898873.13E-05695252760.0948.053, 135.134, 99.039
81rs1100027481.94E-05749976820.28117.047
8298.88–99.05rs1092634942.38E-05988815810.0175.08, 48.053, 66.063, 93.09111.104, 119.089, 46.031
91rs435842582.44E-05160234480.3681.070
9259.57–59.60rs436025973.05E-05595714910.07119.072
91rs416315882.14E-05669978520.05125.132, 157.159, 171.173
91rs415941752.39E-05754844560.18171.173
91rs436060081.87E-05869192660.3894.039
91rs1109504972.29E-05906909910.42109.099, 125.095
10211.42–11.71rs428898443.35E-06117077250.1773.027, 75.044
101rs1094239713.66E-05164608890.3673.027
101rs1093937817.00E-06302653870.0483.086
101rs436260862.89E-05469230300.0295.017
101rs1108423194.13E-05861556730.2745.033
111rs415693282.75E-05498245650.2183.086
111rs416711735.30E-07601506440.0178.001
111rs1098259613.67E-05787552600.0294.074
111rs1106222883.05E-05936358330.4694.039
12215.44–15.46rs290152213.23E-05154627790.41protCHEESE
12356.69–57.90rs415828112.63E-05570201020.4944.98, 123.117, 141.129, 60.045
121rs416202032.04E-05601348100.22105.039
12264.87–65.09rs1105649511.31E-05650924420.3474.051, 75.027
131rs422830064.78E-0566000080.2649.028
131rs416936454.70E-05451121170.15121.122
13272.93–74.05rs417122803.32E-05740542380.2581.070
141rs1103231352.88E-05193167020.38133.123, 201.184
14326.00–26.95rs423047867.12E-06261681470.04fatMILK, fatCHEESE, 67.058
141rs1092302063.33E-06440296340.26121.122, 147.134
141rs290240789.75E-06479967870.1083.086
14281.20–81.31rs1097312854.96E-05813093000.3596.961
151rs1102155342.61E-05234265460.1833.034
151rs437169724.71E-05311938190.2495.017
151rs1094299183.69E-05554883190.01fat_to_protMILK
151rs425308963.40E-05628490700.18113.098
151rs415684183.09E-05823881560.0233.034
161rs1098186962.85E-05129636660.06lactose
161rs417981969.62E-08217729910.05 prot CHEESE
161rs416405662.63E-05572397200.0481.070, 137.132
161rs1090330261.44E-05606150120.0378.001
161rs1103116774.51E-05688892250.13 prot CHEESE
161rs1096594981.14E-05714315410.2584.075, 119.072
161rs416342242.54E-05752329240.1946.031, 44.058
16277.29–77.47rs1103002631.72E-05772865290.0875.08, 48.053, 66.063, 83.071, 93.09,93.432, 94.095, 119.072, 93.090, 94.095
171rs1095011842.86E-05228203440.46107.066
17233.83–35.38rs424362952.02E-05338276410.0482.945, 84.942
171rs415775102.03E-05666718390.3757.07
18312.16–13.35rs416065346.58E-06133467680.11111.119, 121. 122,71.086, 79.075, 121.122, 133.123, 135.134
181rs418677852.40E-07161199850.0571.086, 75.08, 85.101, 48.053, 66.063, 70.064, 79.075, 83.071, 84.075, 93.09, 93.432, 94.095, 95.096, 111.104, 119.072119.089, 121.122, 123.076, 133.123, 135.134,136.022137.101, 139.076, 139.134
181rs415746923.12E-05312263550.0192.061
181rs1103238201.75E-06527390930.18111.119
191rs1106814232.59E-05489336190.02117.047
191rs1109858364.41E-05572137640.03113.057
19660.32–63.14rs415729675.78E-06616306100.2357.033, 57.070, 117.047
2026.94–6.97rs430963545.41E-0669355340.3156.045, 119.107, 40.027
201rs1107811473.66E-05136536620.4453.039
201rs416312763.76E-07462968400.01ProtCHEESE
211rs1107688922.51E-05111449850.01123.076
211rs1105829788.96E-06200283230.0157.033
21329.25–30.49rs416627769.19E-06304855840.4474.051, 125.132, 123.117, 171.173, 143.143
211rs426450464.13E-05368439510.11109.099
21740.72–45.33rs419849268.27E-06431170130.21131.107,105.071, 147.113
211rs110682144619945970.1644.058
221rs415866724.43E-0531266800.2644.022
221rs290103794.78E-05204346430.1495.081
221rs416089343.25E-05251418510.0384.942
221rs415846273.67E-05509260150.46127.073
22256.11–56.92rs1094703292.66E-05569168230.16105.071, 75.08
231rs416336903.39E-05423573620.14FatMILK
23450.14–51.86rs1095795779.47E-06517954960.1282.945, 96.961, 84.942, 100.954, fatCHEESE
241rs1104153883.44E-05577258120.0566.063
251rs1102981461.13E-0569688650.0284.942
251rs1102062312.52E-06163357670.4339.023, 40.027
251rs1092747954.88E-05190823290.26149.045
251rs1093227532.48E-05247850510.01105.039, 169.044
251rs290125992.05E-05290657780.0195.017
261rs1106252874.58E-05205979760.3678.001
261rs420965624.66E-05265855570.01139.134
261rs427041692.10E-05295294920.03113.057
261rs1108584062.35E-05320307740.2863.044
271rs1096638335.00E-06421180370.03fatMILK
281rs1098537064.70E-05227926130.2695.049
281rs1094123941.46E-05288840680.19127.112, 99.081, 44.022
281rs1109298153.91E-05395284210.3083.071
281rs416002364.08E-05424361880.01105.039
281rs1108202523.89E-05461467200.3242.010, 75.044
29226.01–26.24rs421739244.13E-05262437020.3160.021
291rs1110074594.01E-06374401500.1557.033
291rs421922391.31E-06472694190.0260.045
Uc11.04E-050.27131.107

#SNP = number of the single nucleotide polymorphisms significantly associated to the trait; Interval: The region on the chromosome spanned among the significant SNP(s) (in Mb); P-value (range) = The P-value of the highest significant SNP adjusted for genomic control; Top SNP location (bp) = position of the highest significant SNP on the chromosome in base pairs on UMD3.1 (http://www.ensembl.org/index.html); Top SNP MAF = minor allele frequency of the top SNP.

aBTA: Bos taurus autosome.

bMY: milk yield. Cheese volatile compounds are reported as measured mass (m/z). The trait with the highest P-value in each genomic region is bolded.

cU: Undefined chromosome and position on the genome.

Summary results of the genome wide association analysis for spectrometric peaks from PTR-ToF-MS analysis of model cheeses. #SNP = number of the single nucleotide polymorphisms significantly associated to the trait; Interval: The region on the chromosome spanned among the significant SNP(s) (in Mb); P-value (range) = The P-value of the highest significant SNP adjusted for genomic control; Top SNP location (bp) = position of the highest significant SNP on the chromosome in base pairs on UMD3.1 (http://www.ensembl.org/index.html); Top SNP MAF = minor allele frequency of the top SNP. aBTA: Bos taurus autosome. bMY: milk yield. Cheese volatile compounds are reported as measured mass (m/z). The trait with the highest P-value in each genomic region is bolded. cU: Undefined chromosome and position on the genome. We identified significant associations on BTA4, BTA14, BTA23 and BTA27 for milk fat, with the highest peak corresponding to marker rs42435059 (P = 4.43E-06) located at 39,244,447 on BTA4. We also identified significant associations for milk protein and milk yield on BTA6, the highest signal being associated with milk protein and corresponding to the marker rs110239739 (P = 1.32E-07) located at 84,689,991 bp. Only 1 SNP (rs109429918) was significant for the fat-to-protein ratio and this was located on BTA15 at 55,488,319 Mbp. A significant association was found for lactose on BTA16 and corresponded to rs109818696 located at 12,963,666. Significant SNPs for cheese fat were mapped on BTA 14 (~26.17 Mbp) and BTA23 (~42.36 Mbp). We detected very high peaks for cheese protein on BTA16 and BTA20, corresponding to markers rs41798196 located at 21,772,991 on BTA16 (P = 9.62E-08) and rs41631276 located at 46,296,840 on BTA20 (P = 3.76E-07). We found other significant associations for cheese protein on BTA12 at ~15.45 Mbp. Regarding cheese VOCs, we detected the strongest signals on BTA11 and BTA18. Marker rs41671173 located at 60,150,644 bp on BTA11 was significant for the spectrometric peak at m/z 78.001 (P = 5.30E-07). We detected another strong signal at 16,119,985 bp on BTA18 and corresponded to marker rs41867785, which was associated with the peak at m/z 135.134 (P = 1.10E-07). Overall, this marker was significant for 24 spectrometric peaks, three of which were tentatively associated with butan-1-ol/pentan-1-ol, heptan-1-ol[9,11] m/z 75.080 (P = 1.97E-06), 3-methyl-1-butanol/3-methyl-3-buten-1-ol/pentan-1-ol m/z 71.086 (P = 1.20E-05) and hexan-1-ol/hexan-2-ol m/z 85.101 (P = 1.61E-05). The largest regions of consecutive SNPs were located on BTA6 (~81.65–88.07 Mbp) and BTA21 (~40.72–45.33 Mbp). The spectrometric peaks with the highest number of significant SNPs were those associated with ethyl pentanoate (ethyl valerate)/ethyl-2-methylbutanoate/ethyl-3-methylbutanoate (ethyl isovalerate)/heptanoic acid m/z 131.107 (6), m/z 149.045 (5), m/z 48.053 (5), m/z 66.063 (5), the peaks associated with butan-1-ol/pentan-1-ol, heptan-1-ol m/z 75.080 (5), m/z 84.942 (5), m/z 105.039 (5), m/z 117.047 (5), m/z 119.072 (5), m/z 121.122 (5) and m/z 169.044 (5). The chromosomes with the highest number of significant associations were BTA1 (10), BTA3 (9), BTA4 (11), BTA6 (16), BTA16 (9) and BTA21 (14). Based on the similarity matrix generated with ExpressionCorrelation, we identified 8 sub-networks represented by ≥3 nodes (Fig. 1), within which ClusterOne identified 12 densely connected clusters (P < 0.05; Supplementary Table S2). Two clusters were detected in sub-network 1: one with 13 nodes, which included some spectrometric peaks tentatively associated with aldehydes and/or ketones, i.e. hexan-1-one/hexan-2-one/hexanal m/z 101.097, heptan-2-one m/z 115.112, octan-1-one m/z 129.127 and nonan-2-one m/z 143.143; the other with 7 nodes, which included some spectrometric peaks associated with aldehydes, ketones or alcohols, i.e. propan-2-one (acetone) m/z 59.049, 1,2-pentanediol m/z 105.091 and 2-methylbutanal/3-methylbutanal/pentan-2-one m/z 87.080. A cluster of 16 nodes was significant in subnetwork 2, which included the spectrometric peak associated with butan-1-ol m/z 75.080. Sub-network 3 comprised 2 clusters with 7 nodes, which contained some spectrometric peaks associated with esters and/or alcohols, i.e. ethyl hexanoate/octanoic acid m/z 145.123, ethyl butanoate/ethyl-2-methylpropanoate (ethyl isobutyrate) m/z 117.091 and hexanoic acid m/z 99.081. Two clusters were detected in sub-network 4: the first with 4 nodes, including the spectrometric peak associated with acetates/acetic acid m/z 61.028; the second also included the spectrometric peaks tentatively identified as 3-hydroxy-2-butanone (acetoin) m/z 89.060 and butanoic acid m/z 71.049. Two clusters with 5 nodes were significant in sub-network 5, which included spectrometric peaks associated with the alkyl fragment m/z 43.054, m/z 41.039 and m/z 57.070. Sub-network 6 contained a significant cluster which included the spectrometric peak associated with 2,6-dimethyl pyrazine m/z 109.070. Sub-networks 7 and 8 contained clusters of 4 and 3 nodes, respectively, neither of which included any tentatively identified spectrometric peaks.
Figure 1

Similarity network among cheese volatile compounds generated using ExpressionCorrelation. The nodes corresponded to cheese VOCs and the edges represented the similarity between vectors of the additive effects of all SNPs. Only correlations with r > |0.80| and P < 0.01 are represented. Eight sub-networks of ≥3 nodes were identified which contained significantly dense clusters of VOCs (P < 0.05) detected by ClusterOne. The width of the edge indicates the value of the correlation; a wider edge corresponds to a higher correlation in absolute value.

Similarity network among cheese volatile compounds generated using ExpressionCorrelation. The nodes corresponded to cheese VOCs and the edges represented the similarity between vectors of the additive effects of all SNPs. Only correlations with r > |0.80| and P < 0.01 are represented. Eight sub-networks of ≥3 nodes were identified which contained significantly dense clusters of VOCs (P < 0.05) detected by ClusterOne. The width of the edge indicates the value of the correlation; a wider edge corresponds to a higher correlation in absolute value.

Pathway analyses

Of the total 37,568 SNPs used in this study, 17,006 were located 15 kb up- or down-stream of the coding regions. An average of around 900 genes were significant (P < 0.05) for the peaks tentatively associated with cheese VOCs. We carried out pathway analyses to shed light on the biological role of these genes and to identify potentially overrepresented pathways or molecular functions that might help explain the variability in the cheese volatilome. Overall, pathways of 5 of the 45 tentatively identified compounds were significantly enriched (FDR < 0.05) (Fig. 2, Supplementary Table S3). Results showed that purine metabolism was enriched for the peak associated with phenol m/z 95.049 (FDR = 0.00017), while the tight junction pathway was overrepresented for the spectrometric peaks associated with heptan-2-one[20] m/z 115.112 (FDR = 0.00013) and ethyl pentanoate (ethyl valerate)/ethyl-2-methylbutanoate/ethyl-3-methylbutanoate (ethyl isovalerate)/heptanoic acid m/z 131.107. Furthermore, the nitrogen metabolism pathway was significantly enriched for the peak associated with ethyl pentanoate (ethyl valerate)/ethyl-2-methylbutanoate/ethyl-3-methylbutanoate(ethyl isovalerate)/heptanoic acid (FDR = 0.00019) m/z 131.107. Finally, the long-term potentiation pathway was enriched for the peaks associated with octan -1–one m/z 129.127 and nonan-2-one m/z 143.143 (FDR = 0.00023 and FDR = 0.00024, respectively).
Figure 2

Significantly enriched KEGG pathways using genes associated to spectrometric peaks with a tentative identification from PTR-ToF-MS analysis of model cheeses. Only the traits showing significantly enriched terms are reported (FDR < 0.05). EPE_E2MB_E3MB_HA: Ethyl pentanoate (ethyl valerate)-Ethyl-2-methylbutanoate-Ethyl-3-methylbutanoate (ethyl isovalerate)-Heptanoic acid.

Significantly enriched KEGG pathways using genes associated to spectrometric peaks with a tentative identification from PTR-ToF-MS analysis of model cheeses. Only the traits showing significantly enriched terms are reported (FDR < 0.05). EPE_E2MB_E3MB_HA: Ethyl pentanoate (ethyl valerate)-Ethyl-2-methylbutanoate-Ethyl-3-methylbutanoate (ethyl isovalerate)-Heptanoic acid.

Discussion

GWAS analysis

In recent years, there has been growing concern about food quality and safety from both the demand and the supply sides. Given that flavour attributes play a crucial role in cheese quality[21], better knowledge of the key flavour components and pathways involved in the development and characterisation of cheese VOCs would provide a useful basis for defining cheese-making procedures more precisely, and improving cheese sensory characteristics. There is also increasing interest in the authentication of traditional cheeses with EU protected designation of origin classification, which are often linked to local breeds and help maintain farm animal biodiversity[22]. In this study, therefore, we first sought to investigate whether the cow’s genome organisation significantly impacts on the cheese volatilome, and possibly cheese flavour. Although plenty of GWAS studies for milk production traits[13,23,24] and cheese-making properties[25,26] in dairy cows have now been published, to our knowledge none has focused on identifying the genomic regions associated with cheese composition and quality traits. Despite the lack of GWAS analyses for cheese VOCs, the estimates of genomic heritability found in this study confirm previous findings supporting the existence of an exploitable genetic variation in cheese VOCs[11]. The main pathways involved in the formation of cheese VOCs are glycolysis (metabolism of lactose, lactate and citrate), lipolysis (and metabolism of fatty acids) and proteolysis (and catabolism of amino acids)[3]. Accordingly, our GWAS analyses revealed a contribution of cow’s genes related to protein, fat and carbohydrate metabolism.

Protein metabolism

A region of 9 SNPs on BTA6 covered the cluster of casein genes (~87.14–87.38 Mbp) and showed significant association with 12 traits, including milk protein. In particular, 4 spectrometric peaks - m/z 85.029, m/z 149.045, m/z 163.096 and m/z 169.044 - were associated with the marker rs41567942, which was located 0.4 Mb from the gene encoding for k-casein (CSN3), which is essential for milk coagulation and therefore largely influences milk coagulation properties[27]. Moreover, the marker rs29001782 was located on BTA6 at 4 kb from GNRHR, which signalling pathway has been shown to play a role in controlling milk protein synthesis and metabolism[17]. Interestingly, the markers rs110300263 and rs111018457, which had significant associations with 10 traits, were located in the region at ~77.29–77.47 Mbp on BTA16, which was close to a quantitative trait locus (QTL) for the milk protein and k-casein percentages[28]. Two markers, which were associated to m/z 56.045 and m/z 119.107 (rs43096354) and m/z 40.027 (rs42353243), mapped on BTA20 at ~0.3 Mb from FAM169A which has been suggested to be a key regulator of milk protein synthesis in dairy cattle[17]. Additionally, the region of 7 SNPs on BTA21 included a known QTL for milk fat and protein yield and percentage from the Cattle QTL database information[28].

Fat metabolism

The contribution of fatty acid metabolism to cheese VOCs is corroborated by several significant associations. For instance, rs43283349, which was significant for 3-methylbutyl butanoate (isoamyl butyrate)/nonanoic acid m/z 159.138, was located on BTA1 at ~0.1 Mb from AGPAT3, a positional candidate gene for milk FA[29]. The marker rs110986676, which was located on BTA6 and was significant for m/z 116.078, corresponded to an intron variant of SCD5 which was associated to variation in milk FA composition in dairy cattle[16,30]. The marker rs110681423, which was associated with m/z 117.047, was located on BTA19 at ~0.2 Mb from GH1 which has been put forward as candidate gene for milk fat percentage and fat composition[12,31]. The marker rs110858406, associated to m/z 63.044, mapped on BTA26 at ~0.9 Mb from GPAM which is involved in the regulation of milk fat synthesis and composition in dairy cattle[32,33]. Finally, rs110820252, which had significant associations with the spectrometric peaks associated with the alkyl fragment m/z 42.01 and propanoic acid/ propanoic ester m/z 75.044 mapped on BTA28 within 2 kb 5′ to AGT, which is the sole precursor of all angiotensin peptides. Interestingly, the renin-angiotensin system is believed to impact body-fat storage as well as lipid and carbohydrate metabolism[34,35].

Carbohydrate metabolism

A significant association was found between rs110002748 and m/z 117.047, which mapped on BTA8 at ~1 Mb from B4GALT1. This gene encodes an enzyme that participates in glyconjugation and lactose biosynthesis, which occurs exclusively in the mammary gland[36]. An increase in the expression of B4GALT1 was observed in transition milk samples, and is reflected in an increase in lactose biosynthesis during the earlier stages of lactation[37]. The high signal detected on BTA11 (rs41671173) was located on BTA11 at ~0.5 Mb from B3GNT2, which synthesizes a unique structure known as poly-N-acetyllactosamine (polyLacNAc), a linear carbohydrate polymer composed of alternating N-acetylglucosamine and galactose residues[38]. This SNP explained ~60% of additive genetic variance for m/z 78.001. Finally, the high signals on BTA18 corresponded to the marker rs41867785, which is annotated as an intron variant of PHKB. This gene has been associated with the carbohydrate metabolic process, the generation of precursor metabolites and energy, and energy reserve[39].

Correlations among VOCs based on SNP additive effects

A greater level of detail concerning the shared genomic basis of cheese VOCs might form the basis for more accurate prediction models to be developed in the context of genomic selection for possible modulation of cheese flavour. In a previous work, we estimated the genetic relationships among cheese VOCs based on pedigree information[11]. Here, we used ExpressionCorrelation to calculate pairwise correlations between VOCs based on the SNP additive genetic effects, and we clearly identified groups of VOCs sharing a common behaviour. Having tentatively identified some compounds, we sought to associate the largest sub-networks to biochemical pathways and possibly associated flavour notes. Sub-network 1 contained mostly ketones and aldehydes and might, therefore, represent catabolism of amino acids and fatty acids. Branched-chain aldehydes originate from AA degradation, in particular 2-methylbutanal from isoleucine and 3-methylbutanal from leucine[40], while ketones can be produced from β-ketoacids derived from β-oxidation of fatty acids[41]. Green/fruity/floral notes are mostly associated with the compounds included in this group[20,40,42]. The reaction between free fatty acids and alcohols from lactose and AA degradation yield esters[43], common cheese VOCs, and this pathway might be represented in sub-network 3, including the spectrometric peaks associated with hexanoic acid, ethyl hexanoate/octanoic acid and ethyl butanoate/ethyl-2-methylpropanoate (ethyl isobutyrate). Most esters (e.g. ethyl butanoate, ethyl hexanoate, ethyl-2-methylpropanoate) are associated with the sweet, fruity and floral characteristics of cheese[44-46]. Finally, sub-network 4 might represent the glycolysis pathway, and, in particular, lactate or citrate metabolism, since it included the spectrometric peaks associated with the acetate ester fragment/acetic acid, 3-hydroxy-2-butanone(acetoin) and butanoic acid. Lactose is metabolised by starter bacteria, mostly through the glycolytic pathway, into lactate, which might be further metabolised into acetate by lactococci or into butyrate by Clostridium sp.[47]. Acetate is also the main flavour compound originating from citrate metabolism as well as acetoin[47,48]. Cheesy, rancid and sour milk notes are associated with 3-hydroxy-2-butanone(acetoin) and butanoic acid[45,49], while acetic acid has a typical vinegar odour[50].

Pathway analysis

Standard GWAS analysis allows individual loci and genes likely to play a role in controlling the investigated traits. However, it lacks the power to establish whether the detected genes act in cooperation as part of a complex network to control specific biological functions. We therefore carried out pathway analyses to prioritize genes in associated loci that are part of the biological pathways and processes potentially contributing to the cheese volatilome. These pathway analyses confirmed the importance of proteolysis and amino acid metabolism for the formation of cheese VOCs (i.e. nitrogen and purine metabolism). Phenol in cheese originates from the metabolism of protein (casein) and, in particular, from the catabolism of tyrosine[3]. Besides sugar and fat metabolism, amino acid metabolism also provides substrates for ester formation, which might explain the enrichment of nitrogen metabolism for the spectrometric peaks associated with ethyl pentanoate (ethyl valerate)/ethyl-2-methylbutanoate/ethyl-3-methylbutanoate(ethyl isovalerate)/heptanoic acid m/z 132.109. The tight junction pathway was enriched for the spectrometric peaks associated with heptan-2-one and ethyl pentanoate (ethyl valerate)-ethyl-2-methylbutanoate-ethyl-3-methylbutanoate (ethyl isovalerate)-heptanoic acid. In the mammary gland, the tight junction (TJ) state is closely linked to milk secretion[51], as they are involved in the transcellular transport of lactose and K+ to the extracellular fluid, while Na+ and Cl− are transported to the milk[52]. TJ integrity is compromised during mammary involution and also as a result of mastitis and periods of mammary inflammation[53]. Among the genes identified within this pathway, we found three protein kinase C (PKC) family members: alpha (PRKCA), beta (PRKCB) and epsilon (PRKCE). Several PKC inhibitors affect both the assembly and disassembly of TJs, which means that PKCs may regulate the dynamics of TJ formation[54]. Interestingly, this pathway was enriched for the energy of the curd as a percentage of the energy of the milk processed, which is an indicator of cheese-making efficiency[55]. Finally, enrichment of the long-term potentiation pathway for the spectrometric peaks associated with two ketones, octan-1-one and nonan-2-one, might be connected to their biosynthetic pathway, which is related to fatty acid metabolism; indeed, this pathway was significantly overrepresented in a recent GWAS and pathway-based analysis of milk fatty acids in dairy cows[16]. Moreover, this pathway contained several genes coding for glutamate ionotropic receptors (GRI), including GRIA1; it is of note that previous findings assigned to this gene a significant SNP for C14:1[15]. In our study, we exploited the potential of PTR-ToF-MS to provide detailed spectral information to characterise food quality and authentication, and this was integrated with the genomic and biological information provided by GWAS and pathway analyses. Results obtained increase our understanding of the metabolic pathways and biological functions likely involved in the formation of cheese VOCs, providing unprecedented insights into the potential contribution of the cow’s genes to cheese flavour. A more effective approach might be to more accurately identify compounds using PTR-MS and to improve the quality of cattle genome annotations.

Methods

Ethics statement

The cows in the current study belonged to commercial private herds and were not subjected to any invasive procedures. Milk and blood samples were previously collected during routine milk recording coordinated by technicians from the Breeders’ Association of Trento Province (Italy), hence certified by the local authority.

Phenotypes and genotypes

Individual milk samples were collected from 1,075 Italian Brown Swiss cows from 72 commercial herds located in the Alpine province of Trento (Italy). Details of the animals used in this study and the characteristics of the area are reported in Cipolat-Gotet et al.[56] and Cecchinato et al.[57] Gross milk composition was measured using a MilkoScan FT6000 (Foss Electric A/S Hillerød, Denmark). Model cheeses were manufactured from the raw milk of individual cows, as described in detail in Cipolat-Gotet et al.[56]. We used a commercial starter culture at a concentration 8 times higher than recommended in order to reduce the acidification time to 90 min and minimise the role of milk microflora. After ripening (60d), the model cheeses were weighed and analysed for fat and protein contents using a FoodScan apparatus (Foss Electric, Hillerød, Denmark). The headspace gas of each model cheese (n = 1,075) was measured with a commercial PTR-ToF-MS 8000 instrument supplied by Ionicon Analytik GmbH, Innsbruck (Austria), as described in detail in Bergamaschi et al.[10] Internal calibration and peak extraction was performed according to the procedure described by Cappellin et al.[58] Absolute headspace VOC concentrations, expressed as parts per billion by volume (ppbv), were estimated using the formula described by Lindinger et al.[59] Given that the distribution of all spectrometric peaks showed a strong positive skewness, the data were transformed: the fraction of each peak plus one was multiplied by 106 and expressed as a natural logarithm to obtain a Gaussian-like data distribution. After filtering out all peaks below a threshold of 1 ppbv and interfering ions, 240 spectrometric peaks remained for the analyses. The fragmentation pattern of 61 relevant compounds, representing 78.0% of the total spectral intensity of the compressed data set without interfering ions, were retrieved from available GC-MS data on the same model cheeses[10] and from the literature[60-62]. Isotope removal (r > 0.95, P < 0.001) yielded 173 spectrometric peaks, of which 45 were tentatively associated with VOCs. The Illumina BovineSNP50 v.2 BeadChip (Illumina Inc., San Diego, CA) was used to genotype 1,152 cows (blood samples were not available for all the phenotyped animals). Quality control excluded markers with call rates >95%, with minor allele frequencies >0.5%, and without extreme deviation from Hardy-Weinberg equilibrium (P > 0.001, Bonferroni corrected). After filtering, 1,011 cows and 37,568 SNPs were retained for subsequent analyses.

Genome-wide association study

Genome-wide association analyses (GWAS) were conducted using single-marker regression and the three-step Genome-wide Association using the Mixed Model and Regression-Genomic Control (GRAMMAR-GC) approach[63] implemented in the GenABEL R package[64]. In the first step, an additive polygenic model with a genomic relationship matrix is fitted; secondly, the residuals obtained from this model are regressed on the SNPs to test for associations; in the third step, genomic control corrects for the conservativeness of the procedure[65]. The polygenic model was:where y is a vector of the observed response (milk fat, protein and fat-to-protein ratio; cheese fat and protein; cheese VOCs); β is a vector with the fixed effects of (i) days in milk of the cow (classes of 30 days each), (ii) the parity of each cow (classes of 1, 2, 3, ≥4), and (iii) the herd-date effect (n = 72); X is an incidence matrix connecting each observation to specific levels of the factors in β. The two random terms in the model were the animal and the residuals, which were assumed to be normally distributed as and , where G is the genomic relationship matrix, I is the identity matrix, is the additive genomic variance and the residual variance. The G matrix was built in GenABEL[64] using identity-by-state coefficients. We adopted a threshold of P < 5 × 10−5 to declare significant SNPs[66]. The proportion of genomic variance explained by the SNPs was calculated as 2pqa2, where p and q were the allele frequencies and a was the allele substitution effect. Model (1) was also used to estimate the variance components and the genomic heritability of the traits based on the genomic relationship matrix. Heritability was estimated as . The results of the GWAS analysis without filtering for the P-value threshold were used to build a matrix of row-wise SNPs (n = 37,568) and column-wise phenotypes (i.e. cheese VOCs, n = 173) in which the value in the cell corresponded to the SNP additive effect. This matrix was fed into the ExpressionCorrelation plugin of Cytoscape[67] to create a correlation matrix of pair-wise Pearson correlations between phenotypes based on the effect across all the SNPs included in the analysis. Only the high-confidence correlations with P < 0.01 and >|0.80| were selected. A similarity network was generated by ExpressionCorrelation, where the nodes corresponded to the phenotypes and the edges represented the similarity between vectors of the additive effects of all SNPs. This network was analysed with the ClusterOne plugin of Cytoscape[68] to identify significantly dense clusters of VOCs (Mann-Whitney test, P < 0.05).

Gene-set enrichment and pathway analyses

Pathway analyses were carried out on the tentatively identified spectrometric peaks (n = 45) to shed light on the biological functions underlying the synthesis and/or metabolism of cheese VOCs. As detailed in Dadousis et al.[55], the GWAS results were filtered for significance with a P-value < 0.05 to identify “relevant” and “non-relevant” SNPs. Using the BiomaRt R package[69,70], we assigned “relevant” SNPs to genes if they were located within the gene or within 15 kb up- or down-stream of the gene[71] based on the Ensembl Bos taurus UMD 3.1 assembly. This made it possible to also capture those SNPs that are missed by standard GWAS, due to its stringent significance threshold, but that may help explain the variability in the observed phenotypes, which may play a role in organised pathways or biological functions. The Kyoto Encyclopaedia of Genes and Genomes (KEGG)[72] and the Gene Ontology (GO) databases[73] were used to define the functional categories associated with the gene sets. To avoid testing broad or narrow functional categories, only GO and KEGG terms with >10 and <1000 genes were considered. A Fisher’s exact test was used to test for overrepresentation of functional categories (FDR < 0.05). The gene-set enrichment analysis was performed with the R package goseq[74]. Supplementary Dataset 1 Supplementary Dataset 2 Supplementary Dataset 3
  52 in total

1.  PTR-TOF-MS and data-mining methods for rapid characterisation of agro-industrial samples: influence of milk storage conditions on the volatile compounds profile of Trentingrana cheese.

Authors:  Alessandra Fabris; Franco Biasioli; Pablo M Granitto; Eugenio Aprea; Luca Cappellin; Erna Schuhfried; Christos Soukoulis; Tilmann D Märk; Flavia Gasperi; Isabella Endrizzi
Journal:  J Mass Spectrom       Date:  2010-09       Impact factor: 1.982

2.  Extending the dynamic range of proton transfer reaction time-of-flight mass spectrometers by a novel dead time correction.

Authors:  Luca Cappellin; Franco Biasioli; Erna Schuhfried; Christos Soukoulis; Tilmann D Märk; Flavia Gasperi
Journal:  Rapid Commun Mass Spectrom       Date:  2011-01-15       Impact factor: 2.419

3.  Rapid variance components-based method for whole-genome association analysis.

Authors:  Gulnara R Svishcheva; Tatiana I Axenovich; Nadezhda M Belonogova; Cornelia M van Duijn; Yurii S Aulchenko
Journal:  Nat Genet       Date:  2012-09-16       Impact factor: 38.330

4.  Genome-wide association and pathway-based analysis using latent variables related to milk protein composition and cheesemaking traits in dairy cattle.

Authors:  Christos Dadousis; Sara Pegolo; Guilherme J M Rosa; Giovanni Bittante; Alessio Cecchinato
Journal:  J Dairy Sci       Date:  2017-08-24       Impact factor: 4.034

5.  Polymorphisms in genes in the SREBP1 signalling pathway and SCD are associated with milk fatty acid composition in Holstein cattle.

Authors:  Gonzalo Rincon; Alma Islas-Trejo; Alejandro R Castillo; Dale E Bauman; Bruce J German; Juan F Medrano
Journal:  J Dairy Res       Date:  2011-11-25       Impact factor: 1.904

6.  Beta3GnT2 (B3GNT2), a major polylactosamine synthase: analysis of B3GNT2-deficient mice.

Authors:  Akira Togayachi; Yuko Kozono; Atsushi Kuno; Takashi Ohkura; Takashi Sato; Jun Hirabayashi; Yuzuru Ikehara; Hisashi Narimatsu
Journal:  Methods Enzymol       Date:  2010       Impact factor: 1.600

7.  A travel guide to Cytoscape plugins.

Authors:  Rintaro Saito; Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Samad Lotia; Alexander R Pico; Gary D Bader; Trey Ideker
Journal:  Nat Methods       Date:  2012-11-06       Impact factor: 28.547

8.  Transcriptome profiling of bovine milk oligosaccharide metabolism genes using RNA-sequencing.

Authors:  Saumya Wickramasinghe; Serenus Hua; Gonzalo Rincon; Alma Islas-Trejo; J Bruce German; Carlito B Lebrilla; Juan F Medrano
Journal:  PLoS One       Date:  2011-04-25       Impact factor: 3.240

9.  Gene networks driving bovine milk fat synthesis during the lactation cycle.

Authors:  Massimo Bionaz; Juan J Loor
Journal:  BMC Genomics       Date:  2008-07-31       Impact factor: 3.969

10.  Genomic regions associated with bovine milk fatty acids in both summer and winter milk samples.

Authors:  Aniek C Bouwman; Marleen H P W Visker; Johan A M van Arendonk; Henk Bovenhuis
Journal:  BMC Genet       Date:  2012-10-29       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.