| Literature DB >> 35261926 |
Bo He1, Xue Bai1, Yumeng Tan1, Wentao Xie1, Yan Feng1, Guang-Yu Yang1.
Abstract
UDP-Glycosyltransferases (UGTs) catalyze the transfer of nucleotide-activated sugars to specific acceptors, among which the GT1 family enzymes are well-known for their function in biosynthesis of natural product glycosides. Elucidating GT function represents necessary step in metabolic engineering of aglycone glycosylation to produce drug leads, cosmetics, nutrients and sweeteners. In this review, we systematically summarize the phylogenetic distribution and catalytic diversity of plant GTs. We also discuss recent progress in the identification of novel GT candidates for synthesis of plant natural products (PNPs) using multi-omics technology and deep learning predicted models. We also highlight recent advances in rational design and directed evolution engineering strategies for new or improved GT functions. Finally, we cover recent breakthroughs in the application of GTs for microbial biosynthesis of some representative glycosylated PNPs, including flavonoid glycosides (fisetin 3-O-glycosides, astragalin, scutellarein 7-O-glucoside), terpenoid glycosides (rebaudioside A, ginsenosides) and polyketide glycosides (salidroside, polydatin).Entities:
Keywords: Biosynthesis; Enzyme mining; Glycosylated plant natural products; Glycosyltransferases; Protein engineering
Year: 2022 PMID: 35261926 PMCID: PMC8883072 DOI: 10.1016/j.synbio.2022.01.001
Source DB: PubMed Journal: Synth Syst Biotechnol ISSN: 2405-805X
Fig. 1Representative substrates of functionally characterized GTs in the synthesis of glycosylated PNPs. Chemical groups that are typically glycosylated were colored red.
Fig. 2Phylogenetic tree analysis of the characterized plant GTs from the GT1 family. The substrate specificities of the characterized plant GTs were depicted by different colors in peripheral circle. The glycosidic bond catalytic by the characterized plant GTs were depicted by different colors in interlayer circle. The characterized plant GTs with co-crystal structures reported were highlighted by red lines. GTs mentioned in the text were marked and numbered in red.
Fig. 3The workflow for GT mining and application in biosynthesis of glycosylated PNPs.
The recent five years characterized GTs identified by high-throughput sequencing strategies.
| Enzyme Name | Accession No. | Taxonomy | Systems biology tool | Reference |
|---|---|---|---|---|
| UGT75D1 | Genomic | [ | ||
| UGT71C3 | Genomic | [ | ||
| UGT1 | QGI57841.1 | Genomics | [ | |
| MdUGT88F1 | Genomics | [ | ||
| UGT84A23 | Genomics | [ | ||
| MdPh-4-OGT | Genomics | [ | ||
| DcUCGalT1 | Genomics | [ | ||
| GuCGTa, GuCGTb | QLF98865.1, QLF98866.1 | Genomic | [ | |
| UGT72AD1, UGT72AH1, | Genomic | [ | ||
| MdP2GT | Genomic | [ | ||
| MdUGT88F, MdUGT88F4 | Malus domestica | Genomic | [ | |
| MdPh-4-OGT | Genomic | [ | ||
| AgUCGalT1 | Transcriptomic | [ | ||
| UGT71B5 | Transcriptomic | [ | ||
| UGT76C1, UGT76C2 | Transcriptomic | [ | ||
| UGT76E12 | Transcriptomic | [ | ||
| UGT85A1 | Trancsriptomic | [ | ||
| AeCGTb, AeCGTa | QLF98868.1, QLF98867.1 | Transcriptomic | [ | |
| CsUGT73A20, CsUGT75L12, CsUGT78A14, CsUGT78A15 | Transcriptomic | [ | ||
| UGT84A57 | Transcriptomic | [ | ||
| GgCGT | QGL05036.1 | Transcriptomic | [ | |
| Pn1-31, Pn3-29, Pn3-31, Pn3-32 | QOJ43864.1, QOJ43865.1, QOJ43866.1, QOJ43868.1 | Transcriptomic | [ | |
| RyUGT3A | QSB46663.1 | Transcriptomic | [ | |
| RsUGT75L20, RsUGT75T4 | Transcriptomic | [ | ||
| UGT76E2 | Transcriptomic | [ | ||
| TcCGT1 | QCZ42162.1 | Transcriptomic | [ | |
| GuGT1 | QDM38894.1 | Transcriptomic | [ | |
| ApUFGT1 | QDA11331.1 | Transcriptomic | [ | |
| CcUGT3 | QDH43895.1 | Transcriptomic | [ | |
| Sb3GT1 | QBL54224.1 | Transcriptomic | [ | |
| PgUGT95B2 | Transcriptomic | [ | ||
| OcUGT1 | Transcriptomic | [ | ||
| Pq3-O-UGT1 | Transcriptomic | [ | ||
| Pq3-O-UGT2 | Transcriptomic | [ | ||
| UGT72B3 | Transcriptomic | [ | ||
| UGT73F17 | Transcriptomic | [ | ||
| EpPF3RT | MG264429.1 | Transcriptomic | [ | |
| UGT76E11 | Transcriptomic | [ | ||
| PgUGT71A27 | A0A0A7HB61.1 | Transcriptomic | [ | |
| UGT75B2 | Genomic & Trancsriptomic | [ | ||
| ChCGT, CuCGT, FcCGT | Genomic & Trancsriptomic | [ | ||
| CtUGT1 | MW629113 | Genomic and transcriptomic | [ | |
| GmSGT2 | Genomic & Transcriptomic | [ | ||
| UGT71K3 | Genomics & Transcriptomic | [ | ||
| Pq-PPT-6/20-O-UGT, Pq-PPT-6-O-UGT | QEV87497.1, QEV87498.1 | Genomic and Transcriptomic | [ | |
| PsCGTa, PsCGTb | QLF98871.1, QLF98872.1 | Genomic and Transcriptomic | [ | |
| SbGT30, SbGT34 | Genomic & Trancsriptomic | [ | ||
| UGT90F1, UGT73B26, UDPG1 | Genomic and Transcriptomic | [ | ||
| ZmCGTb | QLF98873.1 | Genomic and Transcriptomic | [ | |
| UGT99D1 | Transcriptomic & Protemic | [ | ||
| UGT74H5, UGT74H6 | Transcriptomic & Protemic | [ | ||
| UGT73AD1 | Transcriptomic and Proteomic | [ | ||
| GmSSAT1 | Transcriptomic & Proteomic | [ |
The structure-based rational design and directed evolution of GTs that involved in the biosynthesis of glycosylated plant natural products.
| Protein name | Organism | Genbank | PDB code | Representative schematic reaction | Key residues | Engineering | Reference |
|---|---|---|---|---|---|---|---|
| YjiC | 7BOV | Ser277 is critical for Nucleoside Diphosphate (NDP) recognition Glu317 | V108A increase 5-fold for UDP glycosylation activity and improve 35% for pterostilbene glycosylation under the existence UDPG. L320A improve 2-fold for | [ | |||
| UGT72B1 | 2VCE | His19 is positioned to act as a Brønsted base Gln389 and Glu388 interact with the glucose moiety of donors Glu83, Ile86, Leu118, Phe119, Phe148, Leu183, and Leu197 are predominant in the acceptor binding | – | [ | |||
| UGT74F2 | 5U6M | His18 shows a central role in catalysis Tyr180 is important for ligand recognition or binding Met274 could be crucial for orientation of the salicylic acid | – | [ | |||
| UGT89C1 | 6IJ7 | Asp356, His357, Pro147 and Ile148 are key residues for sugar donor recognition and specificity for UDP-β- His21 is a key residue as the catalytic base and the only catalytic residue involved in catalysis. | H357Q exhibited activity with both UDP-β- | [ | |||
| UGT78K6 | 3WC4 | Pro78, Asp181 and Asp367 are involved in the acceptor binding. Asn137 could participate in the recognition of the glucose moiety. His17 is the key catalytic residue. | – | [ | |||
| UGT708C1 | 6LLG | Asp382, Gln383, Thr151 and Thr150 play important role in the recognition of sugar moiety. Phe130, Tyr102 and Phe198 bound and stabilize the acceptor. Arg280 and Asp96 play important roles in the catalytic activity. | – | [ | |||
| GgCGT | QGL05036.1 | 6L5P | R285, T145, D390 and Q391 determine the sugar donor preference. The flopropione unit is the minimum required unit for the di- The spacious substrate-binding tunnel near G389 is critical for the di- | G389K mutation could switch di- to mono-C-glycosylation | [ | ||
| UGT73P12 | 7C2X | Arg32 is the essential residue to provide high specificity for UDP-glucuronic acid. | – | [ | |||
| LpCGTa | QLF98869.1 | 6LG1 | – | – | [ | ||
| LpCGTb | QLF98870.1 | 6LFN | – | – | [ | ||
| SbCGTa | QLF98861.1 | 6LG0 | His24 is critical to initiate the catalytic reaction through deprotonation of the substrate. | – | [ | ||
| SbCGTb | QLF98862.1 | 6LFZ | His23 is critical to initiate the catalytic reaction through deprotonation of the substrate. | R94 M/I143 M/V144T/T145S/H194D/G275T/P374Q mutant had switched the function of SbCGTb to SbCGTa | [ | ||
| UGT708A6 | 6LF6 | – | – | [ | |||
| UGT71G1 | 2ACV | His22 is the catalytic base. Asp121 is a key residue that may assist deprotonation of the acceptor. Glu381 is the key residue in recognition of the sugar donor. | – | [ | |||
| UGT85H2 | 2PQ6 | His21 and Asp125 are essential for catalytic activity. | – | [ | |||
| UGT78G1 | 3HBF | Glu192 is the key residue for the reverse reaction. His26 act as the catalytic residue. Asp124 also plays an essential role in catalysis. | – | [ | |||
| Os79 | 5TMB | His27 activate the trichothecene O3 hydroxyl for nucleophilic attack at C1’ of the UDP-glucose donor. Thr291 plays a critical role in catalysis as a catalytic acid or to position the UDP moiety during the nucleophilic attack. | – | [ | |||
| PtUGT1 | 5NLM | E88 could play a major role in indoxyl specificity and turnover. H26 is expected to be the Brønsted base. D122 is believed to balance the charge on the catalytic histidine. | – | [ | |||
| PaGT2 | 6JEL | His18 and His81 are recognized as the catalytic residues. | C142A and C142F mutants formed resveratrol 3-O-β-glucoside and resveratrol 4′-O-β-glucoside, respectively, with high regioselectivity. | [ | |||
| PaGT3 | 6LZX | His20 is the active-site residue. Trp417 and Arg419 are actively participate in the formation of the acceptor-binding pocket. | – | [ | |||
| UGT51 | 5GL5 | Asp752 serve as a catalytic base. Met851 is important for UGT51 activity. Gln1094, Asp1093 and Ser1072 make several critical interactions with the glucose moiety of donor. | A mutant M7_1 (S81A/L82A/V84A/K92A/E96K/S129A/N172D) presented an ∼1800-fold activity improvement toward an unnatural substrate protopanaxadiol. | [ | |||
| UGT74AC1 | 6L8W | His18 is the general base abstracts a proton from the 3-hydroxyl group of mogrol. Asp111 stabilizes the catalytic conformation and balance the charge. | Mutant M4 (T79Y/R28H/L48 M/L109I) showed ∼200-fold higher activity than WT. Mutant M5 (T79Y/R28H/L48 M/L109I/S15A) showed ∼3.8-fold higher than M4. | [ | |||
| UGT74AC2 | 7BV3 | The uracil ring forms hydrogen bonds and parallel π-stacking interactions with A353 and W352, respectively, and the ribose ring interacts with the enzyme through hydrogen bonds with E378 and Q355, while the α-phosphate forms hydrogen bonds with H370, N374 and S375 The acceptor binding pocket is constructed by 13 residues, where P12, L43 and V91 are located at the entrance, and V190, M196 as well as L200 are situated at the bottom | Mutant G11Y is found that shows 75% selectivity and >99% conversion towards silybin A-3,7-O-diglucoside. Three variants show enhanced regioselectivity toward silybin A-7-O-glucoside, P12Y (81% selectivity and 68% conversion), L200W (92% selectivity and 61% conversion) and Y145W (89% selectivity and 75% conversion) | [ | |||
| UGT76G1 | 6INF | His25 is the general base, which deprotonates the 3-hydroxyl of the accepting glucose A to activate it as a nucleophile. Asp124 plays an important catalytic role in relaying protons off and on His25. | – | [ | |||
| TcCGT1 | QCZ42162.1 | 6JTD | H24 acts to stabilize both the deprotonated substrate and the product sugar, though it is not indispensable for the glycosylation activity. E396 plays an important role to stabilize and orient the UDP–Glc sugar. | I94E and G284K switch C- to O-glycosylation. | [ | ||
| VvGT1 | 2C1X | Asp374, Gln375 and Thr141 are key players in sugar recognition. | – | [ | |||
| UGTPg45 | – | – | A mutant UGTPg45-HV with two missense mutations (Q222H and A322V) gave 70% increase of ginsenoside Rh2 yield | [ |
The recent glycosylated plant natural products synthesis by microbial sources.
| Compound | Microbial sources | GTs | Titer | Reference |
|---|---|---|---|---|
| Cyanidin 3-O-glucoside | 3 GT | 0.35 g/L | [ | |
| Luteolin-7-O-glucuronide, Quercetin-3-O-glucuronide, Quercetin 3-O-galactoside | AmUGT10, VvUGT, PhUGT | 0.30, 0.69, 0.28 g/L | [ | |
| Fisetin 3-O-glucoside, Fisetin 3-O-rhamnoside | UGT78K1, | 0.39, 0.34 g/L | [ | |
| Quercetin 3-O-galactoside, Quercetin 3-O-rhamnoside | RhaGT | 0.94, 1.12 g/L | [ | |
| Scutellarein 7-O-glucoside | SbGT34 | 1.20 g/L | [ | |
| Kaempferol 3-O-glucoside | AtUGT78D2 | 3.60 g/L | [ | |
| Rebaudioside A | UGT76G1 | 1.16 g/L | [ | |
| Tyrosol glucoside | UGT72B14 | 6.7 mg/L | [ | |
| Ginsenoside Rh1, Ginsenoside F1 | UGTPg1, UGTPg100 | 0.10, 0.04 g/L | [ | |
| Ginsenoside Rh2, Ginsenoside Rg3 | UGTPg29, UGTPg45 | 0.02, 0.05 g/L | [ | |
| Ginsenoside Rh2 | UGT51 | 0.30 g/L | [ | |
| 3β,12β-Di-O-Glc-PPD, -PPT, DM | UGT109A1 | 9.05, 4.57, 11.5 mg/L | [ | |
| 3β-O-Glc-DM | UGT74AE2 | 5.60 g/L | [ | |
| Ginsenoside Rh2, PPD, DM | UGTPn50 | 2.25, 9.05, 8.09 g/L | [ | |
| Ginsenoside Compound K | UGTPg1 | 0.16 g/L | [ | |
| Ginsenoside Compound K | UGTPg1 | 5.74 g/L | [ | |
| Ginsenoside Rg1, Notoginsenoside R1, Notoginsenoside R2 | PgUGT71A53, PgUGT94Q13, PgUGT71A54 | 1.95, 1.62, 1.25 g/L | [ | |
| Crocetin | YjiC, YdhE, YojK | 4.42 mg/L | [ | |
| Kaempferol, astragalin | AtUGT78D2 | 1.18, 1.74 g/L | [ | |
| Geranyl glucoside | VvGT14a | 0.93 g/L | [ | |
| Glycyrrhizin and Glycyrrhetic acid 3-O-mono-β-D-glucuronide | UGT1A1 | 5.98, 2.31 mg/L | [ | |
| Polydatin | PcR3GAT | 0.55 g/L | [ |