| Literature DB >> 29463020 |
Jia Li1, Qin Liang2,3, Changfu Li4, Mengdi Liu5,6, Yansheng Zhang7.
Abstract
Dioscorea zingiberensis is a perennial herb native to China. The rhizome of D. zingiberensis has long been used as a traditional Chinese medicine to treat rheumatic arthritis. Dioscin is the major bioactive ingredient conferring the medicinal property described in Chinese pharmacopoeia. Several previous studies have suggested cholesterol as the intermediate to the biosynthesis of dioscin, however, the biosynthetic steps to dioscin after cholesterol remain unknown. In this study, a comprehensive D. zingiberensis transcriptome derived from its leaf and rhizome was constructed. Based on the annotation using various public databases, all possible enzymes in the biosynthetic steps to cholesterol were identified. In the late steps beyond cholesterol, cholesterol undergoes site-specific oxidation by cytochrome P450s (CYPs) and glycosylation by UDP-glycosyltransferases (UGTs) to yield dioscin. From the D. zingiberensis transcriptome, a total of 485 unigenes were annotated as CYPs and 195 unigenes with a sequence length above 1000 bp were annotated as UGTs. Transcriptomic comparison revealed 165 CYP annotated unigenes correlating to dioscin biosynthesis in the plant. Further phylogenetic analysis suggested that among those CYP candidates four of them would be the most likely candidates involved in the biosynthetic steps from cholesterol to dioscin. Additionally, from the UGT annotated unigenes, six of them were annotated as 3-O-UGTs and two of them were annotated as rhamnosyltransferases, which consisted of potential UGT candidates involved in dioscin biosynthesis. To further explore the function of the UGT candidates, two 3-O-UGT candidates, named Dz3GT1 and Dz3GT2, were cloned and functionally characterized. Both Dz3GT1 and Dz3GT2 were able to catalyze a C3-glucosylation activity on diosgenin. In conclusion, this study will facilitate our understanding of dioscin biosynthesis pathway and provides a basis for further mining the genes involved in dioscin biosynthesis.Entities:
Keywords: Dioscorea zingiberensis; cytochrome P450; dioscin biosynthesis; glycosyltransferase; transcriptome
Mesh:
Substances:
Year: 2018 PMID: 29463020 PMCID: PMC6017347 DOI: 10.3390/molecules23020454
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Putative pathway for dioscin biosynthesis in D. zingiberensis. Abbreviations used are: AACT, acetyl CoA acetyltransferase; HMGS, hydroxymethyl glutaryl CoA synthase; HMGR, 3-hydroxy-3-methylglutaryl CoA reductase; MK, mevalonate kinase; PMK, phosphomevalonate kinase; MDC, mevalonate diphosphosphate decarboxylase; IPI, IPP diphosphateisomerase; DXS, 1-deoxy-D-xylulose-5-phosphate synthase; DXR, 1-deoxy-D-xylulose-5-phosphate reductoisomerase; MCT, 2-C-methyl-D-erythritol 4-phosphate cytidylyl transferase; CMK, 4-diphospho-cytidyl-2-C-methyl-D-erythritol kinase; MECPS, 2-C-methyl-D-erythritol-2, 4-cyclodiphosphate synthase; HDS, 4-hydroxy-3-methylbut-2-(E)-enyl diphosphate synthase; HDR, 4-hydroxy-3-methylbut-2-enyldiphosphate reductase; GPPS, geranyl diphosphate synthase; FPPS, farnesyl diphosphate synthase; SS, squalene synthase; SE, squalene epoxidase; CAS, cycloartenol synthase; SSR, sterol side chain reductase; SMO1, sterol 4a–methyl oxidase 1; SMO2, sterol 4a-methyl oxidase 2; CPI, cycloeucalenol cycloisomerase; CYP51, sterol C-14 demethylase; C14-R, sterol C-14 reductase; 8,7-SI, sterol 8,7-isomerase; C5-SD, sterol C-5(6) desaturase; 7-DR, 7-dihydrocholesterol reductase.
Figure 2Phytochemical analysis of the leaf and rhizome tissues of D. zingiberensis. (A,B) representative pictures are shown for the D. zingiberensis plants collected in August (A) and November (B) of 2015; (C), the content of diosgenin and dioscin in the different plant parts, including the leaves harvested in August (Aug_L), the rhizomes harvested in August (Aug_R), and the rhizomes harvested in November (Nov_R). Two biological replicates were included for this experiment. Different letters on the bar indicate a significant difference at p < 0.05 based on generalized linear models with Bonferroni multicomparison tests.
Summary of the sequencing data.
| Item | Sample | Number | Q20(%) | Q30(%) | GC(%) | N50(bp) |
|---|---|---|---|---|---|---|
| Raw read | Aug_L | 55,798,190 | ND | ND | ND | ND |
| Aug_R | 45,367,360 | ND | ND | ND | ND | |
| Nov_R | 42,564,054 | ND | ND | ND | ND | |
| Clean read | Aug_L | 53,330,426 | 97.68 | 94.42 | 49.52 | ND |
| Aug_R | 43,234,636 | 97.68 | 94.44 | 47.54 | ND | |
| Nov_R | 40,686,226 | 97.72 | 94.51 | 47.87 | ND | |
| Total | Unigenes | 143,245 | ND | ND | ND | 814 |
ND: no data.
Transcriptomic comparison of putative cholesterol pathway genes across different tissues of D. zingiberensis.
| Enzyme Name | Gene ID | FPKM | FPKM | FPKM | log2ratio | q-Value | log2ratio | q-Value | |
|---|---|---|---|---|---|---|---|---|---|
| MVA | |||||||||
| AACT-1 | c46893_g1 | 4.86 | 0 | 0.17 | −5.1594 | 0.13758 | −3.6187 | 0.18281 | |
| AACT-2 | c72984_g1 | 38.7 | 32.07 | 67.57 | 0.59302 | 1.30E-05 | −0.50544 | 0.99456 | |
| AACT-3 | c102447_g1 | 3.08 | 0.05 | 0 | −0.40148 | 0.99421 | −0.67435 | 0.99456 | |
| HMGS | c61824_g1 | 4.19 | 0 | 0.71 | −2.9209 | 0.35955 | −3.6026 | 0.1887 | |
| HMGR | c66823_g1 | 560.62 | 515.37 | 476.23 | −0.58795 | 0.0030532 | −0.48117 | 0.2103 | |
| MK-1 | c58223_g1 | 5.82 | 10.08 | 6.3 | −0.20776 | 0.99421 | 0.43378 | 0.75911 | |
| MK-2 | c76083_g1 | 2.81 | 0 | 0 | −3.3916 | 0.34457 | −3.3936 | 0.27635 | |
| PMK | c67242_g5 | 17.33 | 16.48 | 16.47 | −0.42636 | 0.99421 | −0.43222 | 0.99456 | |
| MVD | c56352_g1 | 32.56 | 27.55 | 71.17 | 0.77398 | 0.00014198 | −0.60355 | 0.99456 | |
| IDI-1 | c11491_g1 | 110.38 | 67.91 | 168.01 | −1.2525 | 0.99421 | −0.71196 | 0.99456 | |
| IDI-2 | c74391_g1 | 3.27 | 0 | 0 | 0.25038 | 0.0045152 | −1.0658 | 0.0053014 | |
| MEP | |||||||||
| DXS-1 | c72095_g4 | 7.95 | 29.55 | 13.16 | 0.44095 | 0.27058 | 1.4152 | 1.02E-06 | |
| DXS-2 | c54473_g1 | 1.48 | 16.01 | 2.96 | 0.64894 | 0.99421 | 3.0798 | 8.03E-07 | |
| DXR | c71143_g1 | 21.4 | 111.68 | 22.72 | −0.26747 | 0.99421 | 2.0223 | 8.93E-24 | |
| MCT | / | / | / | / | |||||
| CMK | c61103_g1 | 7.83 | 20.07 | 7.43 | −0.43182 | 0.99421 | 0.9928 | 0.072373 | |
| MECPS | c61331_g1 | 24.64 | 102.4 | 32.58 | 0.045319 | 0.99421 | 1.6868 | 6.12E-10 | |
| HDS | c71989_g1 | 12.68 | 81.59 | 15.17 | −0.093038 | 0.99421 | 2.3319 | 4.61E-48 | |
| HDR | c67492_g1 | 20.44 | 116.92 | 19.53 | −0.41845 | 0.99421 | 2.1561 | 2.70E-30 | |
| IPP to Cholesterol | |||||||||
| GPPS-1 | c97556_g1 | 9.45 | 45.39 | 5.51 | −1.1357 | 0.98019 | 1.8974 | 3.81E-06 | |
| GPPS-2 | c90599_g1 | 11.17 | 56.51 | 12.66 | −0.17417 | 0.99421 | 1.9746 | 3.10E-09 | |
| FPPS | c67887_g1 | 25.07 | 33.68 | 18.17 | −0.81943 | 0.87666 | 0.061922 | 0.76188 | |
| SS | c58461_g1 | 72.56 | 69.42 | 26.82 | −1.7907 | 4.01E-06 | −0.42756 | 0.99456 | |
| SE | c56806_g1 | 247.1 | 11.46 | 48.8 | −2.6935 | 6.27E-52 | −4.7909 | 1.01E-90 | |
| CAS | c73551_g1 | 66.58 | 244.71 | 39.38 | −1.1029 | 0.00062765 | −0.15504 | 0.70716 | |
| SSR-1 | c71749_g1 | 114.37 | 24.43 | 63.32 | −1.2053 | 9.8524E-07 | −2.5864 | 2.0831E-28 | |
| SSR-2 | c70423_g1 | 333.58 | 118.07 | 110.38 | −1.9485 | 1.2352E-46 | −1.8588 | 4.3029E-43 | |
| SMO1 | c62143_g1 | 787.76 | 15.24 | 131 | −2.938 | 2.493E-172 | −6.0534 | 2.5658E-285 | |
| CPI | c57629_g1 | 16.77 | 11.25 | 42.91 | 1.0016 | 0.010324 | −0.9308 | 0.92087 | |
| CYP51 | c56203_g1 | 1248.79 | 77.57 | 445.7 | −1.8394 | 9.6484E-154 | −4.3694 | 0 | |
| C14-R | c69698_g1 | 62.54 | 21.13 | 43.66 | −0.87364 | 0.26089 | −1.9294 | 3.8533E-06 | |
| 8,7 SI | c69931_g1 | 57.39 | 36.66 | 70.93 | −0.046848 | 0.48601 | −1.0062 | 0.0167 | |
| SMO2 | c50487_g1 | 4.47 | 0 | 0.25 | −4.5004 | 0.3795 | NA | NA | |
| C5-SD | c67458_g1 | 580.81 | 245.51 | 146.88 | −2.3389 | 8.7667E-70 | −1.607 | 4.0749E-36 | |
| 7-DR | c60553_g1 | 74.97 | 111.84 | 64.56 | −0.56945 | 0.94196 | 0.21529 | 0.0091335 | |
Transcript abundance was calculated by Fragments Per kb per Million fragments (FPKM) values and only the unigenes at full length were subjected to this gene expression analysis. The “/” represents that the unigene was not at full length presented in the transcriptome.
Figure 3Phylogenetic analysis of potential CYP450 candidates from D. zingiberensis transcriptome with previously published CYP450s from various metabolic pathways.
Figure 4In vitro C3-glycosylation of diosgenin and cholesterol by the purified recombinant Dz3GT1 and Dz3GT2. HPLC profiles were shown for the assays with diosgenin (A) and cholesterol (B). CK means the control reaction. For the controls, the purified Dz3GTs were excluded from the reaction mixtures. Both Dz3GT1 and Dz3GT2 were able to convert diosgenin to one new product (peak 1), corresponding to trillin (diosgenin 3-O-glucoside), which was not produced in the controls. When cholesterol was used as the substrate, a new product (peak2) was produced by both the UDP-glycosyltransferase (UGT) proteins but was not observed from the control assays. The peak 2 was assumed to be cholesterol 3-O-glucoside based on the C3-glucosylating activities of Dz3GTs.