| Literature DB >> 31070742 |
Guilherme Borelli1, Mateus Bernabe Fiamenghi1, Leandro Vieira Dos Santos2, Marcelo Falsarella Carazzolle1,2, Gonçalo Amarante Guimarães Pereira1,2, Juliana José1.
Abstract
Xylose assimilation and fermentation are important traits for second generation ethanol production. However, some genomic features associated with this pentose sugar's metabolism remain unknown in yeasts. Comparative genomics studies have led to important insights in this field, but we are still far from completely understanding endogenous yeasts' xylose metabolism. In this work, we carried out a deep evolutionary analysis suited for comparative genomics of xylose-consuming yeasts, searching for of positive selection on genes associated with glucose and xylose metabolism in the xylose-fermenters' clade. Our investigation detected positive selection fingerprints at this clade not only among sequences of important genes for xylose metabolism, such as xylose reductase and xylitol dehydrogenase, but also in genes expected to undergo neutral evolution, such as the glycolytic gene phosphoglycerate mutase. In addition, we present expansion, positive selection marks, and convergence as evidence supporting the hypothesis that natural selection is shaping the evolution of the little studied methylglyoxal reductases. We propose a metabolic model suggesting that selected codons among these proteins caused a putative change in cofactor preference from NADPH to NADH that alleviates cellular redox imbalance. These findings provide a wider look into pentose metabolism of yeasts and add this previously overlooked piece into the intricate puzzle of oxidative imbalance. Although being extensively discussed in evolutionary works the awareness of selection patterns is recent in biotechnology researches, rendering insights to surpass the reached status quo in many of its subareas.Entities:
Keywords: Saccharomycotina; comparative genomics; natural selection; phylogenetics; xylitol dehydrogenase; xylose reductase
Mesh:
Substances:
Year: 2019 PMID: 31070742 PMCID: PMC6637916 DOI: 10.1093/gbe/evz036
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
List of Chosen Species for Comparative Analysis
| Collections | ||||||||
|---|---|---|---|---|---|---|---|---|
| Species | Prefix | Xyl | Genome Size (Mb) | # Genes | Source | ATCC | CBS | References |
|
| Aa | + | 11.8 | 6,119 | JGI | Y-17692 K | 8244 |
|
|
| Ca | + | 13.2 | 5,861 | JGI | YB-2248 | 8468 |
|
|
| Cb | + | 19.4 | 5,978 | NCBI | Y-2332 K | 2428 |
|
|
| Cs | + | 11.9 | 5,231 | NCBI | Y-17909 K | 7871 |
|
|
| Cn | − | 13.1 | 5,895 | JGI | Y-17324 K | 7422 |
|
|
| Ce | + | 10.7 | 5,533 | JGI | Y-1498 K | 615 |
|
|
| Ct | + | 14.6 | 6,254 | NCBI | Y-12968 K | 94 |
|
|
| Db | − | 13.4 | 5,636 | JGI | Y-12961 K | 74 |
|
|
| Ka | − | 11.1 | 5,378 | JGI | Y-8276 | 2517 |
|
|
| Kl | +/− | 10.7 | 5,076 | JGI | Y-8279 K | 683 |
|
|
| Ls | + | 21.3 | 8,192 | JGI | Y-11557 K | 1807 |
|
|
| Pp | −/+ | 9.2 | 5,040 | JGI | Y-1603 K | 704 |
|
|
| Os | + | 15.4 | 5,841 | JGI | Y-7124 K | 5773 |
|
|
| Sc | − | 12.1 | 6,575 | JGI | Y-12632 N K | 1171 |
|
|
| So | − | 12.6 | 5,134 | JGI | Y-12796 K | 356 |
|
|
| Sp | + | 13.2 | 5,983 | JGI | Y-27907 | 10155 |
|
|
| Wa | + | 14.1 | 6,423 | JGI | Y-8168 | 5759 |
|
|
| Yl | − | 20.5 | 6,447 | JGI | YB-423 K | 6124 |
|
Species names prefixes.
Information about xylose consumption retrieved from CBS-KNAW website (+ stands for detected xylose consumption, − for no detection, +/− and −/+ for variation among strains, being the first sign the description of the used strain).
Genome attributes and yeasts type strains of two international collections.
. 1.—Maximum likelihood species phylogenetic inference for 18 yeasts. S. pombe was used as outgroup of the 17 species from Saccharomycotina subphylum. Tree was reconstructed in RAxML using 1,255 single-copy ortholog groups and branch supports were obtained using 5,000 bootstrap randomizations. The group referred to as fermenters’ clade is highlighted. Phylogeny is scaled in substitution per positions.
. 2.—Rate of gene gains and losses along species of Saccharomycotina. Inferences were made by maximum likelihood estimates based on gene birth and death models of evolution. Branch rates were calculated as the difference ratio between gene gains and losses to its inferred ancestor gene number. The tree branches’ width and colors indicate the percentage of gene gain/loss as shown in caption. Two families were significantly expanded in the fermenters’ ancestral branch, shown by the dark gray box. The actual number of genes is indicated in the side table. Light gray box highlights xylose-fermenters. Phylogeny is scaled in substitutions per site.
. 3.—Methylglyoxal Bayesian gene tree inference (fam7 and fam167). Columns show amino acid alignment for sites which have evidence for positive selection or parallel evolution. Highlighted boxes represent clades with positive selection evidence on fermenters’ group. Red gene names indicate gene copies from fam167, and red amino acids indicate that for this gene the respective amino acid is distinct from others at the alignment site. Amino acids with convergence between families on the alignment are shown in orange. Branches are colored relative to their posterior probabilities. Tree is scaled by substitutions per site.
Characterization of Enzymes from Different Metabolic Pathways Associated with Xylose Fermentation
| Metabolic Pathway | Enzyme | Family | Omega (d | Number of Genes | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| m0 | m2 | Wa | Kl | Sc | Ka | Ca | Db | Cb | Pp | Cs | Ct | Sp | Cn | Ps | Ce | Yl | Aa | Ls | So | |||
| Xylose | Xylose reductase (XR) | 634 | 0.13 | 18.64 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 2 | 0 |
| Xylitol dehydrogenase (XDH) | 33 | 0.12 | 1.00 | 1 | 2 | 2 | 2 | 1 | 1 | 2 | 1 | 3 | 2 | 2 | 3 | 3 | 4 | 5 | 2 | 3 | 2 | |
| 34 | 0.08 | 1.00 | 6 | 1 | 1 | 1 | 2 | 4 | 2 | 1 | 2 | 2 | 2 | 3 | 2 | 3 | 2 | 3 | 2 | 2 | ||
| Xylulose kinase (XK) | 578 | 0.06 | 1.00 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Glycose | Glucokinase (GLK) | 436 | 0.06 | 1.00 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Glucose-6P-isomerase (GPI) | 2728 | 0.08 | 11.32 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 6-Phosphofructokinase (PFK) | 86 | 0.07 | 1.00 | 2 | 2 | 2 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | |
| Fructose-1, 6-bisphosphatase (FBP) | 1995 | 0.04 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Fructose-bisphosphate-aldolase (FBA) | 828 | 0.14 | 999.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| GA3P-dehydrogenase (TDH) | 160 | 0.12 | 1.00 | 1 | 2 | 3 | 3 | 1 | 2 | 1 | 1 | 0 | 1 | 1 | 1 | 3 | 2 | 1 | 1 | 1 | 2 | |
| Phosphoglycerate kinase (PGK) | 570 | 0.11 | 3.01 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Phosphoglycerate mutase (PGM) | 807 | 0.05 | 3.01 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 1849 | 0.04 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| Phosphopyruvate hydratase (ENO) | 175 | 0.11 | 5.08 | 2 | 1 | 5 | 2 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | |
| Pyruvate kinase (PYK) | 728 | 0.09 | 1.00 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Glycerol | Glycerol kinase (GUT) | 2636 | 0.04 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 |
| Glycerol 3-phosphate (GPP) | 3186 | 0.07 | 1.00 | 1 | 1 | 2 | 2 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 2 | 1 | 1 | 0 | 2 | 1 | 0 | |
| Glycerol 3P-dehydrogenase (GPDH) | 159 | 0.08 | 1.00 | 2 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 2 | |
| Methylglyoxal | Triose-phosphate isomerase (TPI) | 2006 | 0.11 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Lactoyl-glutathione-lyase (GLO1) | 2421 | 0.05 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Hydroxyacylglutathione hydrolase (GLO2) | 425 | 0.03 | 1.00 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | |
| D-lactate dehydrogenase (DLD) | 45 | 0.05 | 1.00 | 5 | 3 | 1 | 0 | 3 | 4 | 3 | 1 | 2 | 2 | 1 | 2 | 3 | 3 | 1 | 2 | 1 | 0 | |
| 556 | 0.06 | 1.00 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| Methylglyoxal reductase (MGR) | 7 | 0.19 | 1.00 | 8 | 2 | 4 | 3 | 2 | 0 | 5 | 6 | 3 | 6 | 4 | 7 | 3 | 2 | 9 | 4 | 1 | 0 | |
| 7_167 | 0.11 | 10.95 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 17 | 1 | 5 | 0 | 0 | 0 | 0 | 1 | ||
| Ethanol | Pyruvate decarboxylase (THI–PDC) | 64 | 0.09 | 1.00 | 1 | 1 | 4 | 3 | 2 | 1 | 2 | 1 | 2 | 3 | 2 | 3 | 2 | 2 | 1 | 2 | 1 | 2 |
| 1974 | 0.07 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| Aldehyde dehydrogenase (ALD) | 35 | 0.13 | 186.16 | 5 | 3 | 3 | 3 | 2 | 3 | 2 | 3 | 1 | 3 | 1 | 2 | 1 | 1 | 2 | 1 | 3 | 1 | |
| 128 | 0.07 | 1.00 | 2 | 2 | 2 | 1 | 3 | 3 | 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 1 | 1 | ||
| 223 | 0.10 | 1.00 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 4 | 2 | 0 | 0 | ||
| Alcohol dehydrogenase (ADH) | 23 | 0.12 | 1.00 | 5 | 4 | 4 | 2 | 3 | 3 | 3 | 1 | 2 | 4 | 1 | 2 | 2 | 1 | 4 | 1 | 1 | 1 | |
| 149 | 0.08 | 10.13 | 1 | 1 | 1 | 4 | 3 | 2 | 5 | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 0 | 0 | 1 | 0 | ||
| 6178 | 0.13 | – | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | ||
| 508 | 0.06 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | ||
| PPP (oxidative) | Glucose-6-phosphate dehydrogenase (ZWF) | 836 | 0.09 | 1.00 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 6-phosphogluconolactonase (SOL) | 376 | 0.06 | 1.00 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 3366 | 0.09 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | ||
| 6-phosphogluconate dehydrogenase (GND) | 422 | 0.08 | 1.00 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| PPP (nonoxidadive) | Ribose-5-phosphate isomerase (RPI) | 296 | 0.04 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 3 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Ribulose-phosphate 3-epimerase (RPE) | 3347 | 0.06 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Transketolase (TKL) | 79 | 0.12 | 1.00 | 2 | 1 | 2 | 2 | 3 | 1 | 3 | 3 | 1 | 1 | 1 | 1 | 2 | 3 | 1 | 3 | 3 | 1 | |
| Sedoheptulose-7-phosphate (TAL/NQM) | 1014 | 0.10 | 15.14 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| PPP (accessory) | Ribose phosphate diphosphokinase (PRS) | 540 | 0.03 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 824 | 0.03 | 1.00 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 2251 | 0.04 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 2882 | 0.02 | 32.70 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | ||
| Ribokinase (RBK) | 1802 | 0.05 | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Control | 628 single-copy orthologs | 0.05 (+/−0, 02) | 1.00 (95% = 1) | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
Gene groups retrieved from OrthoMCL analysis.
Values for CODEML model 2 and gene copies are shown in a color gradient from low value/copies (gray) and high value/copies (red).
Omega mean and standard deviation for model 0, and mean and fourth quartile for model 2, calculated for 628 single-copy orthologs families.
. 4.—Phylogenetic inference for 179 genes (not shown) from 3 gene families of putative xylose reductase (black dots) genes and 4 families of xylitol dehydrogenase genes (gray dots), carried out with maximum likelihood inference on RAxML with branch support obtained with 1,000 bootstraps. Tree is scaled in substitutions per site.
. 5.—Xylose reductase Bayesian gene tree inference of fam634. Posterior probabilities are shown above the branches. The three columns show amino acid alignment for sites which have positive selection evidence. Red amino acids are distinct when compared with other species. Highlighted boxes represent clades with positive selection evidence on fermenter groups. Tree is scaled by substitutions per site.
. 6.—Inferred probabilities of evolutionary models along protein sequences. Plots of posterior probabilities (in y axis) for each tested evolutionary model (negative selection, neutral evolution, positive selection) in codeml model 2, for each protein site (x axis). Sites with higher probability than 0.95 for positive selection are indicated with the model code above graphs. Cofactor binding sites and proton donor sites known for S. cerevisiae are also shown.
. 7.—Saccharomyces cerevisiae metabolic pathways related to xylose and glucose consumption. Enzymes are colored according to omega (dN/dS, CODEML model 2, NsSites 1–2) values (light red—low values, dark red—high values). (A) and (B) refer to reactions where the cofactor preference is modified in S. passalidarum or S. stipitis. (C) refers to methylglyoxal reductase hypothetical cofactor preference change in paralog proteins with evidence for positive selection in our analysis.