| Literature DB >> 29079624 |
Dong-Dong Yang1,2,3, Gustavo M de Billerbeck4,5, Jin-Jing Zhang2, Frank Rosenzweig3, Jean-Marie Francois1,6.
Abstract
Homology searches indicate that Saccharomyces cerevisiae strain BY4741 contains seven redundant genes that encode putative aryl-alcohol dehydrogenases (AAD). Yeast AAD genes are located in subtelomeric regions of different chromosomes, and their functional role(s) remain enigmatic. Here, we show that two of these genes, AAD4 and AAD14, encode functional enzymes that reduce aliphatic and aryl-aldehydes concomitant with the oxidation of cofactor NADPH, and that Aad4p and Aad14p exhibit different substrate preference patterns. Other yeast AAD genes are undergoing pseudogenization. The 5' sequence of AAD15 has been deleted from the genome. Repair of an AAD3 missense mutation at the catalytically essential Tyr73 residue did not result in a functional enzyme. However, ancestral-state reconstruction by fusing Aad6 with Aad16 and by N-terminal repair of Aad10 restores NADPH-dependent aryl-alcohol dehydrogenase activities. Phylogenetic analysis indicates that AAD genes are narrowly distributed in wood-saprophyte fungi and in yeast that occupy lignocellulosic niches. Because yeast AAD genes exhibit activity on veratraldehyde, cinnamaldehyde, and vanillin, they could serve to detoxify aryl-aldehydes released during lignin degradation. However, none of these compounds induce yeast AAD gene expression, and Aad activities do not relieve aryl-aldehyde growth inhibition. Our data suggest an ancestral role for AAD genes in lignin degradation that is degenerating as a result of yeast's domestication and use in brewing, baking, and other industrial applications.IMPORTANCE Functional characterization of hypothetical genes remains one of the chief tasks of the postgenomic era. Although the first Saccharomyces cerevisiae genome sequence was published over 20 years ago, 22% of its estimated 6,603 open reading frames (ORFs) remain unverified. One outstanding example of this category of genes is the enigmatic seven-member AAD family. Here, we demonstrate that proteins encoded by two members of this family exhibit aliphatic and aryl-aldehyde reductase activity, and further that such activity can be recovered from pseudogenized AAD genes via ancestral-state reconstruction. The phylogeny of yeast AAD genes suggests that these proteins may have played an important ancestral role in detoxifying aromatic aldehydes in ligninolytic fungi. However, in yeast adapted to niches rich in sugars, AAD genes become subject to mutational erosion. Our findings shed new light on the selective pressures and molecular mechanisms by which genes undergo pseudogenization.Entities:
Keywords: AKR superfamily; aryl-alcohol dehydrogenases; evolution; lignin; pseudogenization; subtelomeric
Mesh:
Substances:
Year: 2017 PMID: 29079624 PMCID: PMC5734042 DOI: 10.1128/AEM.01553-17
Source DB: PubMed Journal: Appl Environ Microbiol ISSN: 0099-2240 Impact factor: 4.792
FIG 1ScAad4p and ScAad14p have aryl-aldehyde reductase activity. Activities were assayed in morpholineethanesulfonic acid (MES) buffer (50 mM [pH 6.1]) containing 0.3 mM NADPH and 0.3 mM substrate. The standard activity of PcAad1p in reducing 3,4-dimethoxybenzaldehyde (being 5.4 μmol · min−1 · mg−1) was set at 100%. Data represent the means ± standard deviations of the results from triplicate experiments.
Yeast aldehyde reductases and their kinetic constants toward preferred substrates
| Enzyme (reference) | Cofactor preference | Aliphatic aldehyde substrate | Aryl-aldehyde substrate | |||||
|---|---|---|---|---|---|---|---|---|
| Substrate | Substrate | |||||||
| NADPH | 39 | Hexanal | NA | 247 | 3,4-Dimethoxybenzaldehyde | 12 | 530 | |
| Heptaldehyde | NA | 138 | Benzaldehyde | 1,700 | 430 | |||
| Cinnamaldehyde | 3,400 | 670 | ||||||
| Aad4 | NADPH | NA | Hexanal | NA | 172 | 3,4-Dimethoxybenzaldehyde | NA | 142 |
| Heptaldehyde | NA | 64 | 4-Nitrobenzaldehyde | NR | NR | |||
| Aad14 | NADPH | NA | Hexanal | NA | 50 | 4-Nitrobenzaldehyde | NA | 22 |
| Heptaldehyde | NA | 46 | Cinnamaldehyde | NR | NR | |||
| Aad10-35C | NADPH | NA | Hexanal | NR | NR | 4-Nitrobenzaldehyde | NR | NR |
| Heptaldehyde | NR | NR | Cinnamaldehyde | NR | NR | |||
| Aad6518G | NADPH | NA | Hexanal | NR | NR | 4-Nitrobenzaldehyde | NR | NR |
| Heptaldehyde | NR | NR | Cinnamaldehyde | NR | NR | |||
| Adh6 ( | NADPH | 29 | Hexanal | 152 | 21,270 | Cinnamaldehyde | 172 | 18,400 |
| Pentanal | 60 | 22,700 | Veratraldehyde | 73 | 15,800 | |||
| Adh7 ( | NADPH | NA | Pentanal | 49 | 11,915 | Cinnamaldehyde | 43 | 7,913 |
| 3-Methylbutanal | 48 | 9,581 | Veratraldehyde | 58 | 6,000 | |||
| Gre3 ( | NADPH | 13 | Hexanal | 3,100 | 109 | 4-Nitrobenzaldehyde | 120 | 142 |
| Gcy1 ( | NADPH | 12 | Butyraldehyde | 5,400 | 81 | 4-Nitrobenzaldehyde | 130 | 71 |
| Benzaldehyde | 5,200 | 58 | ||||||
| Ypr1 ( | NADPH | 8.7 | Hexanal | 390 | 354 | 4-Nitrobenzaldehyde | 1,070 | 1,776 |
| 2-Methylbutyraldehyde | 1,090 | 524 | 9,10-Phenanthrequinone | 2,600 | 272 | |||
| Yjr096w ( | NADPH | 370 | Butyraldehyde | 1,800 | 0.5 | 4-Nitrobenzaldehyde | 500 | 88 |
| Benzaldehyde | 4,700 | 4 | ||||||
| Ydl124w ( | NADPH | 23 | Butyraldehyde | 210,000 | 14 | 4-Nitrobenzaldehyde | 30 | 3.3 |
| Benzaldehyde | 240 | 4 | ||||||
kcat as reported or normalized based on reported Vmax. NA, data not available; NR, not reactive.
Results of this study.
FIG 2Single-base-pair substitutions, truncations, and deletions in the AAD gene family of yeast S288C. (A) Amino acid sequence alignments of PcAad1p (reference protein) and the seven putative ScAadp proteins. Blue asterisks (*) denote positions of four strictly conserved essential amino acids in PcAad1p: Asp71, Tyr76, Lys103, and His152. Structural details of the reference protein are provided in Fig. S4. Red asterisks (*) denote inferred point mutations to relative ancestral ScAAD genes. Green color denotes the truncated part of the ancestral gene. Point mutations result in (i) substitution of the conserved Tyr73 in ScAad3p, (ii) truncation of β-sheet 1 in ScAad4p, (iii) truncation of two α-helices and two β-sheets in ScAad10p, and (iv) the split of one ORF into two ORFs (ScAad6p and ScAad16p). The crossed-out ScAad16 indicates that this previously annotated ORF is a truncated part of AAD6 (in chromosome VI), not an independent ORF. The 5′ coding sequence of the ScAad15p was completely deleted from the genome of yeast lab strain S288C (in red). (B) Positions of the corresponding mutations at the nucleic acid and amino acid levels. Minus symbol (−) indicates nucleotide positions upstream of the ORFs as they were annotated in SGD at the time of submission.
Recovery of enzyme activity from pseudogenized ScAad10p and ScAad6/16p following ancestral state reconstruction
| Enzyme | ||||||
|---|---|---|---|---|---|---|
| 4-Methoxybenzaldehyde | 6.63 ± 0.12 | 87.3 ± 116 | 0.08 ± 0.01 | 157 ± 39.8 | 0.55 ± 0.02 | 217 ± 26.1 |
| Hydroxymethylfurfural | 2.40 ± 0.27 | 171 ± 36.7 | 0.37 ± 0.07 | 608 ± 246 | NR | |
| Phenylacetaldehyde | 9.00 ± 0.27 | 527 ± 72.5 | 0.30 ± 0.08 | 17.6 ± 2.20 | 0.49 ± 0.02 | 296 ± 29.6 |
Vmax (in micromoles per minute per milligram) and K (micromolar) values are shown as the mean ± standard error (SE) of the results from triplicate experiments. NR, not reactive.
AAD ORFs are highly varied among sequenced S. cerevisiae genomes
| Strain | Nucleic acid sequence similarity/amino acid sequence similarity (%) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Lab | Pathogen | Wine | Beer | Sake | |||||||
| BY4741 | Sigma1278b | YJM789 | EC1118 | AWRI1631 | AWRI796 | Lalvin QA23 | VL3 | Fosters B | Fosters O | Kyokai no. 7 | |
| 100/100 | 98.2/82.7 | 96.4/93.7 | 99.8/99.5 | 99.9/99.5 | 99.4/82.1 | 99.7/67.6 | 99.8/85.2 | NR | NR | NR | |
| 99.9/83.5 | NR | 93.2/93.9 | 94.3/84.6 | 94.4/84.3 | 94.3/84.3 | 94.2/71.3 | 94.2/84.3 | 95.9/84.3 | 94.3/84.3 | 93.4/94.7 | |
| 99.9/- | NR | NR | NR | NR | NR | NR | NR | NR | NR | 99.8/- | |
| 99.9/76.4 | 87.3/55.2 | 99.9/76.4 | 99.7/76.4 | 99.8/76.1 | 99.6/99.5 | NR | 92.6/72.9 | NR | NR | */* | |
| 100/100 | 100/100 | 99.2/99.5 | 97.1/97.6 | NR | 97.1/97.6 | 96.9/52.0 | 96.9/65.3 | 97.3/97.1 | 97.5/97.3 | 100/100 | |
| 100/100 | 100/100 | NR | 97.7/95.1 | NR | NR | NR | NR | NR | NR | NR | |
Nucleic acid/amino acid sequence similarities are relative to query sequences from lab strain S288C. NR, AAD homologs were not retrieved. *, percentage similarity not calculated when only partial sequences were found at the end of a sequencing contig with missing 3' or 5' coding sequences. (Note: high similarity in nucleic acids can result in low amino acid similarity due to ORF truncation.)
FIG 3AAD orthologs are distributed within Basidiomycota and Ascomycota fungi typically associated with plant habitats. Orthologs were identified by BLAST using yeast AAD14 as the query sequence against the NCBI database. Phylogeny of the resulting sequences was constructed using a k-mer (k = 15)-based neighbor-joining method.
FIG 4Hypothesized evolution of fungal aryl-aldehyde reductases, enzymes that detoxify lignin by-products. AAD orthologs are observed among wood and leaf litter saprophytes but not among fungi adapted to animal niches. Four hundred million years ago, split of Basidiomycota (blue branch) and the Ascomycota (red branches) (37) coincides with emergence of woody plants in the form of gymnosperms (56–58); 350 mya, divergence of S. pombe and Saccharomyces spp. (42); 150 mya, divergence of L. waltii and S. cerevisiae (44); 50 to 100 mya, adaptation to sugar-rich niches via Adh neofunctionalization (black asterisk) and ethanologenesis (22, 68, 69, 70). Green nodes denote species found in ligninolytic habitats and red nodes denote species found in nonligninolytic habitats. Text along the branches signifies AAD copy numbers and their average nucleotide similarity to ScAAD14.
Primers and plasmids for cloning, plasmid construction, and site-directed mutagenesis
| Primer | Primer sequence | Purpose(s) | Yielded plasmid(s) |
|---|---|---|---|
| AAD3_pGS_F | pGS-21a- | ||
| AAD3_pGS_R | |||
| AAD4_pGS_F | pGS-21a- | ||
| AAD4_pGS_R | |||
| AAD14_pGS_F | pGS-21a- | ||
| AAD14_pGS_R | |||
| PC_AAD_ORF1_F1_HR | GTAATTATCTACTTTTTACAACAAATATAAAACAAGATCTCGACTCTAGA | Subcloning | YEplac195PGK/CYC1-JL52-URA3- |
| PC_AAD_ORF1_R1_HR | CCAAAGGCCATCTTGGTACCGGGCCCCCCCTCGAGGTCGACGGTATCGAT | ||
| SC_AAD3_F1_HR | GTAATTATCTACTTTTTACAACAAATATAAAACAAGATCTCGACTCTAGA | Subcloning | YEplac195PGK/CYC1-JL52-URA3- |
| SC_AAD3_R1_HR | CCAAAGGCCATCTTGGTACCGGGCCCCCCCTCGAGGTCGACGGTATCGAT | ||
| SC_AAD4_F1_HR | GTAATTATCTACTTTTTACAACAAATATAAAACAAGATCTCGACTCTAGA | Subcloning | YEplac195PGK/CYC1-JL52-URA3- |
| SC_AAD4_R1_HR | CCAAAGGCCATCTTGGTACCGGGCCCCCCCTCGAGGTCGACGGTATCGAT | ||
| SC_AAD14_F1_HR | GTAATTATCTACTTTTTACAACAAATATAAAACAAGATCTCGACTCTAGA | Subcloning | YEplac195PGK/CYC1-JL52-URA3- |
| SC_AAD14_R1_HR | CCAAAGGCCATCTTGGTACCGGGCCCCCCCTCGAGGTCGACGGTATCGAT | ||
| PcAad1Tyr76mut2Cys_F1 | AACTTCATTGATACCGCTAATGTC | pGS-21a- | |
| PcAad1Tyr76mut2Cys_R1 | AAATTCCTCGGATGTCTCGTCTTG | ||
| ScAad3MutCys2Tyr_BamHI_A1 | ATCGCGC | pGS-21a- | |
| ScAad3MutCys2Tyr_B1 | CCATTCTTCTGATTGCTCGTTTTGG | ||
| ScAad3MutCys2Tyr_C1 | AATTTCATTGATGCCGCAAACAACT | ||
| ScAad3MutCys2Tyr_XhoI_D1 | ATCGCCG | ||
| ScAad6InserG_F1_A | ATCGATCGCGC | Fusion of | pGS-21a-ScAad6518G |
| ScAad6InserG_R1_B | AGACACACCCAAATAGAGGACCTTG | ||
| ScAad6InserG_F2_C | AGTTTACACATTCTAGTGCAGCAGG | ||
| ScAad6InserG_R2_D | ATCGATCGCCG | ||
| ScAad10Mute_F1_A | ATCGATCGCGC | pGS-21a- | |
| ScAad10Mute_R1_B | ATCCAAGTCTCTGACTGCTCATACT | ||
| ScAad10Mute_F2_C | GGAAATTTCATTGATACTGCAAATAATTATCAGTATGAGCAGTCAGAGACTTGGAT | ||
| ScAad10Mute_R2_D | ATCGATCGCCG |
Italics indicate the enterokinase-coding sequence, boldface indicates start and stop codons, and underlining indicates restriction sites.
List of primers for quantitative PCR for expression studies