| Literature DB >> 29954065 |
Xiu Yun Lei1, Jing Xia2, Jian Wen Wang3, Li Ping Zheng4.
Abstract
Cyanotis arachnoidea contains a rich array of phytoecdysteroids, including 20-hydroxyecdysone (20E), which displays important agrochemical, medicinal, and pharmacological effects. To date, the biosynthetic pathway of 20E, especially the downstream pathway, remains largely unknown. To identify candidate genes involved in 20E biosynthesis, the comparative transcriptome of C. arachnoidea leaf and root was constructed. In total, 86.5 million clean reads were obtained and assembled into 79,835 unigenes, of which 39,425 unigenes were successfully annotated. The expression levels of 2427 unigenes were up-regualted in roots with a higher accumulation of 20E. Further assignments with Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways identified 49 unigenes referring to the phytoecdysteroid backbone biosynthesis (including 15 mevalonate pathway genes, 15 non-mevalonate pathway genes, and 19 genes for the biosynthesis from farnesyl pyrophosphate to cholesterol). Moreover, higher expression levels of mevalonate pathway genes in roots of C. arachniodea were confirmed by real-time quantitative PCR. Twenty unigenes encoding CYP450s were identified to be new candidate genes for the bioreaction from cholesterol to 20E. In addition, 90 transcription factors highly expressed in the roots and 15,315 unigenes containing 19,158 simple sequence repeats (SSRs) were identified. The transcriptome data of our study provides a valuable resource for the understanding of 20E biosynthesis in C. arachnoidea.Entities:
Keywords: 20-hydroxyecdysone; Cyanotis arachnoidea; biosynthesis; leaves; roots; transcriptome
Mesh:
Substances:
Year: 2018 PMID: 29954065 PMCID: PMC6073978 DOI: 10.3390/ijms19071885
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1(A) Typical high-performance liquid chromatography (HPLC) chromatograms and (B) 20E contents in leaves and roots of C. arachnoidea. 20E was isolated from four-month old C. arachnoidea. Data are expressed as means ± standard deviation of three replicates. Statistical significance in comparison with the corresponding control values is indicated by ** p < 0.01.
Summary of the sequencing data of the leaf and root transcriptome of C. arachnoidea.
| The Sequencing Data | Leaf | Root |
|---|---|---|
| Clean bases | 5,671,005,500 | 5,136,914,500 |
| Clean reads | 45,368,044 | 41,095,316 |
| Q30 (%) | 90.63 | 89.90 |
| GC content (%) | 43.50 | 44.50 |
| Unigenes (≥ 300 bp) | 69,782 | 70,556 |
| All unigenes (≥ 300 bp) | 79,835 | |
| 1268 | ||
| Average length (bp) | 894 | |
Numbers of unigene related to the secondary metabolism biosynthetic pathway in C. arachnoidea.
| Secondary Metabolites Biosynthesis Pathway | Total | Up-Regulated Unigenes | Down-Regulated Unigenes |
|---|---|---|---|
| Phenylpropanoid biosynthesis (ko00940) | 289 | 62 | 17 |
| Stilbenoid, diarylheptanoid, and gingerol biosynthesis (ko00945) | 85 | 9 | 5 |
| Terpenoid backbone biosynthesis (ko00900) | 79 | 1 | 1 |
| Flavonoid biosynthesis (ko00941) | 79 | 2 | 12 |
| Limonene and pinene degradation (ko00903) | 68 | 4 | 8 |
| Ubiquinone and other terpenoid-quinone biosynthesis (ko00130) | 67 | 4 | 12 |
| Isoquinoline alkaloid biosynthesis (ko00950) | 63 | 8 | 2 |
| Steroid biosynthesis (ko00100) | 59 | 4 | 2 |
| Carotenoid biosynthesis (ko00906) | 43 | 1 | 6 |
| Tropane, piperidine and pyridine alkaloid biosynthesis (ko00960) | 39 | 1 | 2 |
| Brassinosteroid biosynthesis (ko00905) | 37 | 7 | 0 |
| Diterpenoid biosynthesis (ko00904) | 35 | 3 | 2 |
| Zeatin biosynthesis (ko00908) | 34 | 3 | 1 |
| Flavone and flavonol biosynthesis (ko00944) | 24 | 0 | 1 |
| Isoflavonoid biosynthesis (ko00943) | 16 | 0 | 1 |
| Monoterpenoid biosynthesis (ko00902) | 14 | 1 | 4 |
| Sesquiterpenoid and triterpenoid biosynthesis (ko00909) | 12 | 3 | 1 |
| Anthocyanin biosynthesis (ko00942) | 8 | 0 | 0 |
| Betalain biosynthesis (ko00965) | 6 | 0 | 0 |
| Geraniol degradation (ko00281) | 5 | 0 | 0 |
| Glucosinolate biosynthesis (ko00966) | 4 | 1 | 0 |
| Caffeine metabolism (ko00232) | 2 | 0 | 0 |
| Polyketide sugar unit biosynthesis (ko00523) | 1 | 0 | 0 |
| Indole alkaloid biosynthesis (ko00901) | 1 | 1 | 0 |
| Total | 1070 | 115 | 77 |
The transcript abundance of each unigene was calculated through the FPKM method, and those with |log2 FoldChange| ≥ 1, with a p value ≤ 0.05, were considered as DEGs. Positive and negative log2 FoldChange values indicates up- and down-regulated unigenes, respectively.
The FPKM value of unigenes which encode enzymes involved in 20E biosynthesis.
| Enzymes | Enzyme Commission Number | Unigenes ID | FPKM Value | |
|---|---|---|---|---|
| Leaf | Root | |||
|
| ||||
| acetyl CoA acetyltransferase (AACT) | 2.3.1.9 | comp60224_c0_seq7 | 1.35 | 4.97 |
| comp70319_c0_seq8 | 36.31 | 114.60 | ||
| 3-hydroxy-3-methyl-glutaryl coenzyme A synthase (HMGS) | 2.3.3.10 | comp69323_c0_seq6 | 15.40 | 26.53 |
| 3-hydroxy-3-methyl-glutaryl coenzyme A reductase (HMGR) | 1.1.1.34 | comp68996_c0_seq2 | 59.92 | 55.43 |
| comp52491_c0_seq3 | 0.29 | 2.98 | ||
| comp31434_c0_seq1 | 0 | 1.82 | ||
| mevalonate kinase (MVK) | 2.7.1.36 | comp69732_c0_seq13 | 10.58 | 5.18 |
| comp120320_c0_seq1 | 0 | 1.10 | ||
| phosphomevalonate kinase (PMK) | 2.7.4.2 | comp65552_c0_seq7 | 35.22 | 69.05 |
| diphosphomevalonate decarboxylase (PMD) | 4.1.1.33 | comp33484_c0_seq1 | 29.43 | 28.81 |
| comp59477_c0_seq1 | 30.56 | 72.50 | ||
|
| ||||
| 1-deoxy- | 2.2.1.7 | comp66526_c0_seq5 | 17.96 | 9.56 |
| comp24131_c0_seq1 | 2.04 | 0.25 | ||
| comp71989_c1_seq3 | 6.86 | 6.63 | ||
| comp59095_c0_seq1 | 3.92 | 12.75 | ||
| 1-deoxy- | 1.1.1.267 | comp47417_c0_seq1 | 41.16 | 0 |
| comp71987_c0_seq5 | 31.46 | 22.10 | ||
| comp74054_c0_seq1 | 59.57 | 35.26 | ||
| 2- | 2.7.7.60 | comp71928_c2_seq77 | 21.02 | 15.51 |
| 4-diphosphocytidyl-2- | 2.7.1.148 | comp74100_c0_seq1 | 24.09 | 10.73 |
| 2- | 4.6.1.12 | comp30840_c0_seq1 | 88.79 | 22.58 |
| hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase (HDS) | 1.17.7.1 | comp33211_c0_seq1 | 30.23 | 4.26 |
| comp53197_c0_seq1 | 7.95 | 2.19 | ||
| comp67980_c0_seq1 | 91.56 | 39.75 | ||
| hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate reductase (HDR) | 1.17.1.2 | comp90900_c0_seq1 | 0.82 | 0.69 |
| comp73401_c0_seq1 | 331.58 | 174.81 | ||
| isopentenyl diphosphate isomerase (IDI) | 5.3.3.2 | comp64868_c0_seq4 | 24.51 | 13.51 |
| comp61897_c0_seq2 | 8.51 | 10.42 | ||
| comp29765_c0_seq1 | 106.24 | 337.84 | ||
| comp74567_c0_seq1 | 19.86 | 25.58 | ||
|
| ||||
| farnesyl diphosphate synthase (FDS) | 2.5.1.10 | comp96291_c0_seq1 | 0.47 | 0.57 |
| comp60714_c0_seq9 | 6.56 | 3.03 | ||
| comp28732_c0_seq1 | 71.68 | 87.98 | ||
| squalene synthase (SQS) | 2.5.1.21 | comp68019_c0_seq1 | 105.92 | 144.18 |
| squalene monooxygenase (SQLE) | 1.14.13.132 | comp35298_c0_seq1 | 201.88 | 387.62 |
| sterol 14-demethylase (CYP51) | 1.14.13.72 | comp67692_c0_seq1 | 33.34 | 24.69 |
| comp67692_c1_seq3 | 335.08 | 250.89 | ||
| δ14-sterol reductase (ERG24) | 1.3.1.70 | comp52043_c0_seq5 | 21.53 | 69.73 |
| sterol-4α-carboxylate 3-dehydrogenase (ERG26) | 1.1.1.170 | comp40656_c0_seq1 | 1.25 | 1.91 |
| comp63129_c0_seq5 | 8.84 | 18.50 | ||
| comp48599_c0_seq1 | 22.32 | 23.31 | ||
| comp69306_c0_seq26 | 7.47 | 29.19 | ||
| δ24-sterol reductase (DHCR24) | 1.3.1.72 | comp29143_c0_seq1 | 136.31 | 215.60 |
| comp48248_c0_seq1 | 1.45 | 272.54 | ||
| cholestenol δ-isomerase (EBP) | 5.3.3.5 | comp47084_c0_seq3 | 10.87 | 59.19 |
| comp60181_c0_seq1 | 40.63 | 54.68 | ||
| δ7-sterol 5-desaturase (ERG3) | 1.14.19.20 | comp50441_c0_seq1 | 5.89 | 5.25 |
| comp61289_c0_seq4 | 28.21 | 24.24 | ||
| 7-dehydrocholesterol reductase (DHCR7) | 1.3.1.21 | comp71639_c2_seq1 | 55.04 | 56.70 |
Figure 2(A) The putative pathway of 20E biosynthesis in C. arachnoidea. (B) The gene expression was validated by RT-qPCR. The solid arrows denote known steps while the dashed arrows denote multi-step and incomplete reactions. Enzymes related to the different steps are shown between the reactions catalyzed. The expression level of unigenes related to these enzymes in leaf and root is shown by a heatmap. Names of the enzymes are provided in Table 3.
Figure 3RT-qPCR analysis of selected genes involved in 20E biosynthesis. (A) Unigenes involved in terpenoid backbone biosynthesis. (B) Unigenes involved in steroid biosynthesis. Leaves and roots of four-month old C. arachnoidea were collected and their total RNAs were used for RT-qPCR analysis. Relative expression level of unigenes in roots was calculated by the 2−ΔΔ method with actin as an internal control and was compared to those in leaves, which were all set to be 1. The negative value represents higher expression levels of unigenes in leaves than in roots. The error bar indicated standard deviations of three biological replicates. Asterisks represented significant differences: * p < 0.05 and ** p < 0.01. The full names of the abbreviations (unigenes) are provided in Table 3.
Validation of the expression levels of CYP450 unigenes by using RT-qPCR
| Unigene ID | Description | Foldchange | RT-qPCR |
|---|---|---|---|
| comp36620_c0_seq1 | cytochrome P450 94C1-like | Inf | (1.67 ± 0.13) × 103 ** |
| comp56632_c0_seq1 | cytochrome P450 90B1-like | Inf | 38.29 ± 10.74 * |
| comp60467_c1_seq1 | cytochrome P450 710A1-like | Inf | 49.69 ± 0.24 ** |
| comp70580_c0_seq1 | cytochrome P450 704C1 | Inf | 32.05 ± 7.01 * |
| comp28108_c0_seq1 | cytochrome P450 CYP736A12 | 1013.73 | (1.26 ± 0.27) × 102 * |
| comp30002_c0_seq1 | cytochrome P450 90A1 isoform X2 | 388.30 | (1.83 ± 0.63) × 102 |
| comp67837_c0_seq4 | cytochrome P450 86B1-like isoform X2 | 147.93 | (5.98 ± 0.71) × 102 ** |
| comp57563_c0_seq2 | cytochrome P450 71A1-like | 124.00 | 19.39 ± 1.42 ** |
| comp61518_c0_seq1 | cytochrome P450 90A1-like | 115.05 | (4.67 ± 0.34) × 102 ** |
| comp71681_c0_seq1 | cytochrome P450 71A1-like | 101.70 | (4.15 ± 0.10) × 102 ** |
| comp69723_c0_seq2 | cytochrome P450 86B1-like | 82.30 | 43.66 ± 8.29 * |
| comp60312_c0_seq1 | cytochrome P450 734A1-like | 56.02 | (1.12 ± 0.07) × 102 ** |
| comp60108_c0_seq3 | cytochrome P450 90B1-like | 47.42 | 73.56 ± 3.60 ** |
| comp60517_c0_seq1 | cytochrome P450 734A6-like | 39.25 | 2.10 ± 0.19 ** |
| comp66356_c0_seq1 | cytochrome P450 71A1-like | 33.79 | 18.72 ± 2.78 ** |
| comp70580_c1_seq4 | cytochrome P450 704C1-like | 22.36 | 24.68 ± 3.93 ** |
| comp60316_c0_seq2 | cytochrome P450 714B3-like | 21.86 | 33.65 ± 4.75 ** |
| comp66583_c0_seq3 | cytochrome P450 714D1-like | 19.16 | 19.30 ± 7.73 |
| comp67012_c0_seq1 | cytochrome P450 734A6-like | 16.32 | 17.48 ± 4.38 ** |
| comp70760_c4_seq11 | Cytochrome P450 86A1 | 12.32 | (2.99 ± 0.07) × 102 ** |
Foldchange equals the ratio of S1/S2. S1 means the FPKM value of a unigene in roots while S2 means the FPKM value of a unigene in leaves. RT-qPCR represents the relative expression level of unigenes in roots measured by RT-qPCR compared to those in leaves. Values are reported as mean ± SD from three independent experiments. Asterisks represented significant differences: * p < 0.05 and ** p < 0.01.
Summary of transcription factor (TF) unigenes in C. arachnoidea
| TF Family | Number of Unigenes Detected | Up-Regulated in Roots | Down-Regulated in Roots |
|---|---|---|---|
| bHLH | 146 | 13 | 10 |
| ERF | 80 | 3 | 1 |
| C2H2 | 79 | 6 | 3 |
| NAC | 78 | 4 | 4 |
| WRKY | 75 | 10 | 5 |
| MYB-related | 74 | 8 | 4 |
| bZIP | 63 | 7 | 0 |
| G2-like | 60 | 4 | 2 |
| MYB | 60 | 15 | 1 |
| HD-ZIP | 49 | 2 | 7 |
| Others | 548 | 18 | 21 |
| Total | 1312 | 90 | 58 |
The transcript abundance of each unigene was calculated through the FPKM method, and those with |log2 FoldChange| ≥ 1 with a p value ≤ 0.05 were considered as DEGs. Positive and negative log2 FoldChange values indicates up- and down-regulated unigenes, respectively.
Statistics of simple sequence repeats (SSRs) identified from C. arachnoidea transcriptome data.
| SSR Statistics | Number |
|---|---|
| Total number of sequences examined | 79,835 |
| Total size of examined sequences (bp) | 71,342,686 |
| Total number of identified SSRs | 19,158 |
| Number of SSR containing sequences | 15,313 |
| Sequences containing more than 1 SSR | 3067 |
| SSRs present in compound formation | 1194 |
| Mononucleotide repeats | 14,598 |
| Di-nucleotide repeat | 2100 |
| Tri-nucleotide repeat | 2270 |
| Tetra-nucleotide repeat | 155 |
| Penta-nucleotide repeat | 17 |
| Hexa-nucleotide repeat | 18 |