| Literature DB >> 23874393 |
Inês Mendes1, Ricardo Franco-Duarte, Lan Umek, Elza Fonseca, João Drumonde-Neves, Sylvie Dequin, Blaz Zupan, Dorit Schuller.
Abstract
Saccharomyces cerevisiae strains from diverse natural habitats harbour a vast amount of phenotypic diversity, driven by interactions between yeast and the respective environment. In grape juice fermentations, strains are exposed to a wide array of biotic and abiotic stressors, which may lead to strain selection and generate naturally arising strain diversity. Certain phenotypes are of particular interest for the winemaking industry and could be identified by screening of large number of different strains. The objective of the present work was to use data mining approaches to identify those phenotypic tests that are most useful to predict a strain's potential for winemaking. We have constituted a S. cerevisiae collection comprising 172 strains of worldwide geographical origins or technological applications. Their phenotype was screened by considering 30 physiological traits that are important from an oenological point of view. Growth in the presence of potassium bisulphite, growth at 40 °C, and resistance to ethanol were mostly contributing to strain variability, as shown by the principal component analysis. In the hierarchical clustering of phenotypic profiles the strains isolated from the same wines and vineyards were scattered throughout all clusters, whereas commercial winemaking strains tended to co-cluster. Mann-Whitney test revealed significant associations between phenotypic results and strain's technological application or origin. Naïve Bayesian classifier identified 3 of the 30 phenotypic tests of growth in iprodion (0.05 mg/mL), cycloheximide (0.1 µg/mL) and potassium bisulphite (150 mg/mL) that provided most information for the assignment of a strain to the group of commercial strains. The probability of a strain to be assigned to this group was 27% using the entire phenotypic profile and increased to 95%, when only results from the three tests were considered. Results show the usefulness of computational approaches to simplify strain selection procedures.Entities:
Mesh:
Year: 2013 PMID: 23874393 PMCID: PMC3713011 DOI: 10.1371/journal.pone.0066523
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Geographical location of 172 yeast strains.
Underlined identifiers indicate the original designation of sequenced strains [12]. Symbols represents the strains technological applications or origin: black star – wine and vine; grey star – commercial wine strain; black square – clinical; grey square – natural isolates; black circle – sake; grey circle – other fermented beverages; black pentagon – beer; grey pentagon- baker; black rectangle – laboratory; grey rectangle – unknown biological origin.
Number of strains belonging to different phenotypic classes, regarding values of optical density (Class 0: A640 = 0.1; Class 1: 0.2
| Phenotypic test | Type of medium | Phenotypic class of growth | |||
| 0 | 1 | 2 | 3 | ||
| 30°C | liquid (must) | 0 | 0 | 3 | 168 |
| 18°C | liquid (must) | 51 | 120 | 1 | 0 |
| 40°C | liquid (must) | 28 | 14 | 80 | 50 |
| pH 2 | liquid (must) | 101 | 68 | 3 | 0 |
| pH 8 | liquid (must) | 0 | 0 | 19 | 153 |
| KCl (0.75 M) | liquid (must) | 0 | 2 | 146 | 24 |
| NaCl (1.5 M) | liquid (must) | 84 | 79 | 9 | 0 |
| CuSO4 (5 mM) | liquid (must) | 124 | 45 | 3 | 0 |
| SDS (0.01% w/v) | liquid (must) | 139 | 32 | 1 | 0 |
| Ethanol 6% (v/v) | liquid (must) | 0 | 2 | 36 | 134 |
| Ethanol 10% (v/v) | liquid (must) | 17 | 28 | 85 | 42 |
| Ethanol 14% (v/v) | liquid (must) | 82 | 35 | 50 | 5 |
| Ethanol 12% (v/v) | solid (MEA) | 150 | 20 | 1 | 1 |
| Ethanol 12% (v/v) + Na2S2O5 (75 mg/L) | solid (MEA) | 159 | 14 | 0 | 0 |
| Ethanol 12% (v/v) + Na2S2O5 (100 mg/L) | solid (MEA) | 169 | 3 | 0 | 0 |
| Ethanol 14% (v/v) + Na2S2O5 (50 mg/L) | solid (MEA) | 148 | 24 | 0 | 0 |
| Ethanol 16% (v/v) + Na2S2O5 (50 mg/L) | solid (MEA) | 163 | 9 | 0 | 0 |
| Ethanol 18% (v/v) + Na2S2O5 (50 mg/L) | solid (MEA) | 165 | 7 | 0 | 0 |
| KHSO3 (150 mg/L) | liquid (must) | 34 | 11 | 26 | 101 |
| KHSO3 (300 mg/L) | liquid (must) | 57 | 19 | 29 | 67 |
| Wine supplemented with glucose (0.5% w/v) | liquid | 103 | 45 | 24 | 0 |
| Wine supplemented with glucose (1% w/v) | liquid | 115 | 41 | 16 | 0 |
| Iprodion (0.05 mg/mL) | liquid (must) | 1 | 0 | 28 | 143 |
| Iprodion (0.1 mg/mL) | liquid (must) | 1 | 1 | 13 | 157 |
| Procymidon (0.05 mg/mL) | liquid (must) | 0 | 0 | 7 | 165 |
| Procymidon (0.1 mg/mL) | liquid (must) | 1 | 0 | 9 | 162 |
| Cycloheximide (0.05 | liquid (must) | 3 | 0 | 7 | 162 |
| Cycloheximide (0.1 | liquid (must) | 2 | 1 | 19 | 150 |
| H2S production | solid (BiGGY) | 1 | 11 | 105 | 55 |
| Galactosidase activity | liquid (YNB) | 0 | 21 | 98 | 53 |
MEA: Malt Extract Agar.
Figure 2Principal component analysis of phenotypic data for 172 strains.
(a) −30 phenotypic tests (loadings). Numbers indicate phenotypic tests, as mentioned in Table 1: (1) −30°C; (2) −18°C; (3) −40°C; (4) – pH 2; (5) – pH 8; (6) – KCl (0.75 M); (7) – NaCl (1.5 M); (8) – CuSO4 (1.5 M); (9) – SDS (0.01%); (10) – ethanol 6% (v/v) liquid medium; (11) – ethanol 10% (v/v) liquid medium; (12) – ethanol 14% (v/v) liquid medium; (13) – ethanol 12% (v/v) solid medium; (14) – ethanol 12% (v/v) solid medium + Na2S2O5 (75 mg/L); (15) – ethanol 12% (v/v) solid medium + Na2S2O5 (100 mg/L); (16) – ethanol 14% (v/v) solid medium + Na2S2O5 (50 mg/L); (17) – ethanol 16% (v/v) solid medium + Na2S2O5 (50 mg/L); (18) – ethanol 18% (v/v) solid medium + Na2S2O5 (50 mg/L); (19) – KHSO3 (150 mg/L); (20) – KHSO3 (300 mg/L); (21) – wine supplemented with glucose 0.5% (w/v); (22) – wine supplemented with glucose 1% (w/v); (23) – Iprodion (0.05 mg/mL); (24) – Iprodion (0.1 mg/mL); (25) – Procymidon (0.05 mg/mL); (26) –Procymidon (0.1 mg/mL); (27) – Cycloheximide (0.05 µg/mL); (28) – Cycloheximide (0.1 µg/mL); (29) – H2S production; (30)– galactosidase activity. (b) – 172 strains (scores) distribution. Symbols represents the strains technological applications or origin: black star – wine and vine; grey star – commercial wine strain; black square – clinical; grey square – natural isolates; black circle – sake; grey circle – other fermented beverages; black pentagon – beer; grey pentagon- baker; black rectangle – laboratory; grey rectangle – unknown biological origin.
Phenotypic tests mostly contributing for the division of strains into three clusters, in terms of information gain, obtained with k-means clustering algorithm.
| Phenotypic test | Information gain | Cluster | ||
| 1 | 2 | 3 | ||
| 18°C | 0,33 | 1 | 0 | 1 |
| 40°C | 0,33 | 2 | 0 | 3 |
| NaCl (1.5M) | 0,26 | 0 | 0 | 1 |
| KHSO3 (300 mg/L) | 0,23 | 3 | 0 | 3 |
| Ethanol 6% (v/v) – liquid medium | 0,23 | 3 | 2 | 3 |
| pH 2 | 0,21 | 0 | 0 | 1 |
| KHSO3 (150 mg/L) | 0,21 | 3 | 0 | 3 |
|
| 38 | 90 | 44 | |
Numbers in the last three columns represent the most characteristic value in terms of phenotypic class of strains included in the clusters, for the mentioned phenotypic tests.
Relevant associations (adjusted p<0.1) between phenotypic results and strain's technological application or origin, obtained using Mann-Whitney test and after Bonferroni correction.
| Phenotypic test | Class of phenotypic result | Technological group/origin | Adjusted | % of strains sharing positive association |
| Iprodion (0.05 mg/mL) | 2 | Commercial | 3.24×10−8 | 82.0 |
| Iprodion (0.05 mg/mL) | 3 | Wine and vine | 0.015 | 56.4 |
| KHSO3 (150 mg/L) | 2, 3 | Commercial | 0.001 | 59.3 |
| Wine supplemented with glucose (0.5%, w/v) | 0 | Commercial | 0.075 | 57.0 |
| Wine supplemented with glucose (0.5%, w/v) | 2 | Natural isolate | 0.002 | 87.2 |
| Wine supplemented with glucose (1%, w/v) | 2 | Natural isolate | 0.041 | 89.5 |
| Ethanol 14% (v/v) – liquid medium | 0 | Commercial | 0.004 | 64.5 |
| Cycloheximide (0.1 | 2 | Commercial | 0.007 | 75.6 |
| Procymidon (0.1 mg/mL) | 2 | Other fermented beverages | 0.005 | 92.4 |
| SDS (0.01%, w/v) | 0 | Commercial | 0.078 | 45.3 |
| CuSO4 (5 mM) | 0 | Commercial | 0.075 | 50.6 |
Percentage of strains that share the phenotypic result and belong to the described group or that didn't share the phenotypic result nor belong to that group.
Confusion matrix indicating the technological application or origin prediction of 172 strains and their predictions as obtained with naïve Bayesian classifier (AUC = 0.70).
| Predicted technological application or origin | ||||||||||||
| Total number of strains | Beer | Bread | Clinical | Commercial wine strain | Laboratory | Natural isolate | Other fermented beverages | Sake | Unknown biological origin | Wine and vine | ||
|
| Beer | 1 |
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| Bread | 4 | 0 |
| 0 | 0 | 0 | 3 | 0 | 0 | 0 | 1 | |
| Clinical | 9 | 0 | 0 |
| 2 | 0 | 1 | 0 | 0 | 1 | 5 | |
| Commercial wine strain | 47 | 0 | 0 | 3 |
| 0 | 2 | 1 | 0 | 0 | 5 | |
| Laboratory | 3 | 0 | 0 | 1 | 0 |
| 0 | 1 | 0 | 1 | 0 | |
| Natural isolate | 12 | 0 | 1 | 2 | 2 | 0 |
| 2 | 0 | 0 | 3 | |
| Other fermented beverages | 12 | 0 | 0 | 1 | 1 | 0 | 2 |
| 1 | 0 | 4 | |
| Sake | 6 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 0 | 2 | |
| Unknown biological origin | 4 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
| 1 | |
| Wine and vine | 74 | 0 | 1 | 3 | 8 | 1 | 2 | 3 | 1 | 1 |
| |
Figure 3Nomogram showing naïve Bayesian classifier results for the prediction of commercial strains based on phenotypic classes of growth for each test.
(a) Performance of three phenotypic tests that contributed in a positive way to predict commercial strains; (b) Probability of predicting commercial strains when considering the entire phenotypic profile (grey circle), or only the three phenotypic tests mentioned in panel (a) by the blue dots (black circle).