| Literature DB >> 25642758 |
Hongzhe Cao1, Mohammed Nuruzzaman2, Hao Xiu3, Jingjia Huang4, Kunlu Wu5, Xianghui Chen6, Jijia Li7, Li Wang8, Ji-Hak Jeong9, Sun-Jin Park10, Fang Yang11, Junli Luo7, Zhiyong Luo12.
Abstract
The Panax ginseng C.A. Meyer belonging to the Araliaceae has long been used as an herbal medicine. Although public databases are presently available for this family, no methyl jasmonate (MeJA) elicited transcriptomic information was previously reported on this species, with the exception of a few expressed sequence tags (ESTs) using the traditional Sanger method. Here, approximately 53 million clean reads of adventitious root transcriptome were separately filtered via Illumina HiSeq™2000 from two samples treated with MeJA (Pg-MeJA) and equal volumes of solvent, ethanol (Pg-Con). Jointly, a total of 71,095 all-unigenes from both samples were assembled and annotated, and based on sequence similarity search with known proteins, a total of 56,668 unigenes was obtained. Out of these annotated unigenes, 54,920 were assigned to the NCBI non-redundant protein (Nr) database, 35,448 to the Swiss-prot database, 43,051 to gene ontology (GO), and 19,986 to clusters of orthologous groups (COG). Searching in the Kyoto encyclopedia of genes and genomes (KEGG) pathway database indicated that 32,200 unigenes were mapped to 128 KEGG pathways. Moreover, we obtained several genes showing a wide range of expression levels. We also identified a total of 749 ginsenoside biosynthetic enzyme genes and 12 promising pleiotropic drug resistance (PDR) genes related to ginsenoside transport.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25642758 PMCID: PMC4346879 DOI: 10.3390/ijms16023035
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
An overview of assembled sequences produced by “Trinity” following Illumina/Solexa sequencing.
| Length of Nucleotides (bp) | Contigs | Unigenes | All-unigenes | ||
|---|---|---|---|---|---|
| Pg-Con | Pg-MeJA | Pg-Con | Pg-MeJA | ||
| 0–199 | 85,621 | 82,631 | 14,443 | 12,401 | 0 |
| 200–299 | 16,761 | 17,177 | 12,719 | 13,661 | 13,923 |
| 300–399 | 9576 | 9502 | 9213 | 9269 | 9976 |
| 400–499 | 5430 | 5063 | 5909 | 5715 | 6472 |
| 500–599 | 3330 | 3359 | 4428 | 4400 | 4916 |
| 600–699 | 2581 | 2429 | 3568 | 3622 | 4180 |
| 700–799 | 1897 | 1806 | 3039 | 2882 | 3459 |
| 800–899 | 1611 | 1538 | 2738 | 2608 | 3089 |
| 900–999 | 1351 | 1270 | 2356 | 2283 | 2789 |
| 1000–1099 | 1129 | 1049 | 2081 | 2033 | 2476 |
| 1100–1199 | 969 | 910 | 1899 | 1870 | 2321 |
| 1200–1299 | 864 | 761 | 1642 | 1582 | 2006 |
| 1300–1399 | 753 | 670 | 1572 | 1435 | 1862 |
| 1400–1499 | 686 | 641 | 1387 | 1276 | 1731 |
| 1500–1599 | 593 | 544 | 1179 | 1218 | 1558 |
| 1600–1699 | 519 | 482 | 1111 | 1083 | 1441 |
| 1700–1799 | 496 | 384 | 997 | 932 | 1273 |
| 1800–1899 | 395 | 383 | 905 | 875 | 1189 |
| 1900–1999 | 322 | 285 | 688 | 639 | 920 |
| 2000–2099 | 257 | 234 | 597 | 537 | 748 |
| 2100–2199 | 248 | 193 | 481 | 425 | 638 |
| 2200–2299 | 194 | 159 | 478 | 340 | 571 |
| 2300–2399 | 177 | 154 | 357 | 306 | 482 |
| 2400–2499 | 119 | 109 | 255 | 296 | 407 |
| 2500–2599 | 120 | 96 | 272 | 237 | 374 |
| 2600–2699 | 108 | 99 | 239 | 223 | 311 |
| 2700–2799 | 88 | 64 | 184 | 154 | 242 |
| 2800–2899 | 81 | 56 | 173 | 133 | 223 |
| 2900–2999 | 56 | 57 | 125 | 133 | 192 |
| ≥3000 | 444 | 369 | 921 | 871 | 1326 |
| Total Number | 136,776 | 132,474 | 75,956 | 73,439 | 71,095 |
| Total length (bp) | 43,209,082 | 40,612,192 | 52,997,248 | 50,685,207 | 61,942,217 |
| Mean length (bp) | 316 | 307 | 698 | 690 | 871 |
| N50 (bp) | 525 | 488 | 1161 | 1130 | 1307 |
Figure 1Comparison of unigene length between hit and no-hit unigenes in Nr database.
Figure 2Searching characterization of the assembled unigenes against Nr and Swiss-prot databases. (A) E-value distribution of Blast hits in Nr database; (B) E-value distribution of Blast hits in Swiss-prot database; (C) Distribution of assembled unigene similarities based on the top Blast hits in Nr database; (D) Distribution of assembled unigene similarities based on the top Blast hits in Swiss-prot database; and (E) Homology distribution among other plant species based on the top Blast hits in Nr database.
Figure 3Histogram presentation of gene ontology (GO) functional classification. The results are summarized in three GO ontologies: biological process, cellular component and molecular function. The right y-axis indicates the number of unigenes in a category and the left y-axis indicates the percentage in the same category.
Figure 4Histogram presentation of clusters of orthologous groups (COG) functional classification.
Figure 5(A) Unigenes of 11 sub-categories under “metabolism” category; (B) Unigenes of 13 pathways under “other secondary metabolites biosynthesis” sub-category.
Figure 6FPKM values of all-unigenes expressed in Pg-Con and Pg-MeJA. Each dot indicates one unigene. The dots above the yellow line indicate up-regulated genes and below this line are down-regulated genes. The horizontal coordinates represent the FPKM values of Pg-Con and the vertical coordinates the Pg-MeJA FPKM values. The X and Y-axis are in log-scale.
Differentially expressed unigenes encoding enzymes in the ginsenoside biosynthesis pathway.
| Ginsenoside Biosynthesis Steps | Name of Enzymes | Number of Unigenes | Up-Regulated | Down-Regulated | Not DEGs |
|---|---|---|---|---|---|
| Step I | AACT | 3 | 1 | 2 | - |
| HMGS | 9 | 2 | 7 | - | |
| HMGR | 15 | 4 | 11 | - | |
| MVK | 20 | 14 | 6 | - | |
| PMK | 16 | 10 | 6 | - | |
| MDD | 2 | 1 | 1 | - | |
| IDI | 2 | 1 | 1 | - | |
| Step II | GPS | 24 | 8 | 16 | - |
| FPS | 1 | 1 | 0 | - | |
| GGPPS | 14 | 6 | 8 | - | |
| SS | 10 | 10 | 0 | - | |
| SE | 11 | 7 | 4 | - | |
| Step III | DDS | 1 | 1 | 0 | - |
| β-AS | 9 | 2 | 7 | - | |
| CAS | 17 | 1 | 16 | - | |
| LS | 2 | 0 | 2 | - | |
| P450 | 335 | 161 | 168 | 6 | |
| GT | 142 | 48 | 92 | 2 | |
| UGT | 116 | 71 | 45 | - |
AACT, acetyl-CoA acetyltransferase; HMGS, HMG-CoA synthase; HMGR, HMG-CoA reductase; MVK, mevalonate kinase; PMK, phosphomevalonate kinase; MDD, mevalonate-5-diphosphate decarboxylase; IDI, isopentenyl diphosphate isomerase; GPS, geranyl diphosphate synthase; FPS, farnesyl diphosphate synthase; GGPPS, geranylgeranyl diphosphate synthase; SS, squalene synthase; SE, squalene epoxidase; DDS, dammarenediol synthase; β-AS, beta-amyrin synthase; CAS, cycloartenol synthase; LS, lanosterol synthase; P450, cytochrome P450; GT, glycosyltransferase; UGT, UDP-glycosyltransferase; DEGs, differentially expressed genes.
Figure 7Putative ginsenoside biosynthesis pathway in P. ginseng. Enzymes encoded by unigenes found in this study are circled. MVA, mevalonate; MVAP, mevalonate phosphate; MVAPP, mevalonate diphosphate; IPP, isopentenyl diphosphate; DMAPP, dimethylallyl diphosphate; GPP, geranyl diphosphate; FPP, farnesyl diphosphate.
Figure 8Phylogenetic analysis of 10 identified SS unigenes from our dataset and other plant SSs. Phylogenetic tree was generated using the neighbor-joining (NJ) method in MEGA4. indicates the unigenes from our current study. The box in red outline indicates one group of genes that are more similar with each other.
Figure 9Phylogenetic analysis of 12 potential PDR unigenes of our dataset and other plant PDRs. Among two kinds of unigenes, some having 70% similarity among them, are under one cluster (CL) initiated with “CL” along with the same number such as “CL1490”, followed by different number of contigs, whereas others start directly with “Unigene” such as “Unigene12975”. indicates the unigenes from our current study. The box in red outline indicates one group of genes that are more similar with each other.