| Literature DB >> 31396264 |
Emilien Peltier1,2, Anne Friedrich3, Joseph Schacherer3, Philippe Marullo1,2.
Abstract
The budding yeast Saccharomyces cerevisiae is certainly the prime industrial microorganism and is related to many biotechnological applications including food fermentations, biofuel production, green chemistry, and drug production. A noteworthy characteristic of this species is the existence of subgroups well adapted to specific processes with some individuals showing optimal technological traits. In the last 20 years, many studies have established a link between quantitative traits and single-nucleotide polymorphisms found in hundreds of genes. These natural variations constitute a pool of QTNs (quantitative trait nucleotides) that modulate yeast traits of economic interest for industry. By selecting a subset of genes functionally validated, a total of 284 QTNs were inventoried. Their distribution across pan and core genome and their frequency within the 1,011 Saccharomyces cerevisiae genomes were analyzed. We found that 150 of the 284 QTNs have a frequency lower than 5%, meaning that these variants would be undetectable by genome-wide association studies (GWAS). This analysis also suggests that most of the functional variants are private to a subpopulation, possibly due to their adaptive role to specific industrial environment. In this review, we provide a literature survey of their phenotypic impact and discuss the opportunities and the limits of their use for industrial strain selection.Entities:
Keywords: QTG; QTL; QTN; aroma; biotechnology; fermentation; variant; yeast
Year: 2019 PMID: 31396264 PMCID: PMC6664092 DOI: 10.3389/fgene.2019.00683
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Overview of identified quantitative trait genes (QTGs). (A) Proportion of genetic variants according to three main categories. Resistance to stress (RTS), central metabolism (MET), and organoleptic compound (OC). (B) Proportion of QTGs according to the context of the study in which they were identified.
Figure 2Distribution of the QTGs along the genome. Each dot represents the position of one QTG. No QTG hotspot was found (hypergeometric distribution, 1,000 permutations test, FDR = 5%).
Figure 3Enrichment of QTGs. Only significantly over-represented or under-represented categories are represented (chi-squared test, p-value < 0.05, with Bonferroni adjustment for multiple tests). The percentage of genes in the genome and in the QTGs is indicated in blue and red, respectively. F, P, and C represent molecular function, biological process, and cellular component, respectively.
Figure 4Proportion of dispensable/core gene and frequency of functional alleles. (A) Dispersion of QTGs allele frequency among 1,011 isolates. For each QTG, the frequency of the favorable alleles is used. (B) Proportion of dispensable and core genes among the pangenome (all) and among QTGs (functional).
Nature of SNP grouped by phenotypes.
| Phenotype categories | Genes involved (experimentally validated) | Number and nature of the molecular cause involved | No data | References | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Total | nsSNP | InDels | 5′ or 3′UTR positions | Translocation | STR | sSNP | ||||
| Central metabolism | ||||||||||
| Nitrogen requirement, growth rate on specific nitrogen sources, ammonium and amino acid uptake, vitamin biosynthesis |
| 29 | 18 | 2 | 2 | 0 | 0 | 0 | 7 | ( |
| Sugar catabolism, fructose, glucose, maltose, and maltotriose uptake, glucose–galactose switch, diauxic switch |
| 10 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | ( |
| Glycerol metabolism |
| 10 | 7 | 1 | 0 | 0 | 0 | 0 | 2 | ( |
| Acetic acid production |
| 4 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | ( |
| Fermentation rate and completion, |
| 18 | 15 | 1 | 1 | 1 | 0 | 0 | 0 | ( |
| Resistance to toxins and stresses | ||||||||||
| Ethanol accumulation capacity, ethanol tolerance, growth on ethanol |
| 14 | 12 | 0 | 2 | 0 | 0 | 0 | O | ( |
| High-temperature growth, temperature tolerance, low temperature adaptation, freezing tolerance |
| 18 | 14 | 0 | 2 | 0 | 0 | 1 | 1 | ( |
| Stress tolerance, oxidative, osmotic |
| 12 | 8 | 4 | 0 | 0 | 0 | 0 | 0 | ( |
| Toxins resistance: acidic, basic, phenolic compounds, SO2 |
| 31 | 12 | 0 | 4 | 2 | 0 | 3 | 11 | ( |
| Organoleptic compounds production | ||||||||||
| Acids, higher alcohols, and relative ester production |
| 19 | 19 | 0 | 0 | 0 | 0 | 0 | 0 | ( |
| Sulfur compounds |
| 9 | 5 | 1 | 0 | 1 | 0 | 0 | 2 | ( |
| Other aromas |
| 3 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | ( |
| Adhesion, flocculation, clumpiness |
| 17 | 13 | 0 | 0 | 0 | 3 | 1 | 0 | ( |
Figure 5Overview of QTG involved in nitrogen metabolism. Proteins highlighted in red are coded by allelic variants that were experimentally validated for their contribution to variation in nitrogen source consumption in S. cerevisiae. Indirect relationships are shown in dashed lines. Proteins are between brackets when only the protein of interest of a pathway is shown.
Figure 6Overview of QTG involved in organoleptic compound production. Proteins highlighted in red are coded by allelic variants that were experimentally validated for their contribution to variation in organoleptic compound production in S. cerevisiae. Indirect relationships are shown in dashed lines. Proteins are between brackets when only the protein of interest of a pathway is shown. TCA = tricarboxylic acid cycle. NCR = nitrogen catabolite repression of transcription.
Figure 7Proportion of genetic background used for QTL mapping studies. The percentage of time a genetic background was used in reported studies.