| Literature DB >> 26543847 |
Hongling Tian1, Xiaoshuang Xu2, Fusheng Zhang3, Yaoqin Wang1, Shuhong Guo1, Xuemei Qin3, Guanhua Du4.
Abstract
Radix polygalae, the dried roots of Polygala tenuifolia and P. sibirica, is one of the most well-known traditional Chinese medicinal plants. Radix polygalae contains various saponins, xanthones, and oligosaccharide esters and these compounds are responsible for several pharmacological properties. To provide basic breeding information, enhance molecular biological analysis, and determine secondary metabolite biosynthetic pathways of P. tenuifolia, we applied Illumina sequencing technology and de novo assembly. We also applied this technique to gain an overview of P. tenuifolia transcriptome from samples with different years. Using Illumina sequencing, approximately 67.2% of unique sequences were annotated by basic local alignment search tool similarity searches against public sequence databases. We classified the annotated unigenes by using Nr, Nt, GO, COG, and KEGG databases compared with NCBI. We also obtained many candidates CYP450s and UGTs by the analysis of genes in the secondary metabolite biosynthetic pathways, including putative terpenoid backbone and phenylpropanoid biosynthesis pathway. With this transcriptome sequencing, future genetic and genomics studies related to the molecular mechanisms associated with the chemical composition of P. tenuifolia may be improved. Genes involved in the enrichment of secondary metabolite biosynthesis-related pathways could enhance the potential applications of P. tenuifolia in pharmaceutical industries.Entities:
Year: 2015 PMID: 26543847 PMCID: PMC4620389 DOI: 10.1155/2015/782635
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1Length distribution of unigenes.
Functional annotation of P. tenuifolia.
| Database | Total |
| Database version | |
|---|---|---|---|---|
| Number of annotated | Percent (%) | |||
| Total unigenes | 115,477 | |||
| Nt | 69,211 | 59.9 | 1.00 | 201301 |
| Nr | 77,599 | 67.2 | 1.00 | 201301 |
| Swissprot | 54,582 | 47.3 | 1.00 | 201301 |
| COG | 30,548 | 26.5 | 1.00 | No version |
| KEGG | 72,636 | 62.9 | 1.00 | Release58 |
| Interpro | 49,873 | 43.19 | Interproscan 4.8 | v36 |
| GO | 41,823 | 36.22 | ||
Figure 2Gene ontology classification assigned to the unigenes.
Figure 3COG classification assigned to the unigenes.
Pathway classification of P. tenuifolia.
| Category | Pathway | Count |
|---|---|---|
| Metabolism | Carbohydrate metabolism | 3702 |
| Amino acid metabolism | 2618 | |
| Enzyme families | 1781 | |
| Energy metabolism | 1762 | |
| Lipid metabolism | 1610 | |
| Glycan biosynthesis and metabolism | 1390 | |
| Nucleotide metabolism | 1055 | |
| Metabolism of cofactors and vitamins | 879 | |
| Metabolism of other amino acids | 716 | |
| Metabolism of terpenoids and polyketides | 620 | |
| Xenobiotics biodegradation and metabolism | 562 | |
| Biosynthesis of other secondary metabolites | 522 | |
|
| ||
| Genetic information processing | Folding, sorting, and degradation | 4312 |
| Translation | 4029 | |
| Transcription | 3191 | |
| Replication and repair | 3061 | |
|
| ||
| Environmental information processing | Signal transduction | 2050 |
| Membrane transport | 342 | |
| Signaling molecules and interaction | 327 | |
|
| ||
| Cellular processes | Cell growth and death | 1540 |
| Transport and catabolism | 1399 | |
| Cell motility | 507 | |
| Cell communication | 473 | |
|
| ||
| Human diseases | Neurodegenerative diseases | 1398 |
| Infectious diseases | 1371 | |
| Cancers | 1146 | |
| Immune system diseases | 236 | |
| Cardiovascular diseases | 103 | |
Figure 4Classifications of unigenes involved in the metabolism of terpenoids and polyketides (a) and biosynthesis of other secondary metabolites (b).
Figure 5Schematic of the putative biosynthetic pathways of two major classes of active compounds in P. tenuifolia. Saponin biosynthesis pathway (a) and phenylpropanoid biosynthesis pathway (b). Enzyme abbreviations: AACT, acetyl CoA C-acetyltransferase or acetoacetyl-CoA thiolase; HMGS, 3-hydroxy-3-methylglutaryl CoA synthase; HMGR, 3-hydroxy-3-methylglutaryl CoA reductase; MVK, mevalonate kinase; PMK, phosphomevalonate kinase; PMD, mevalonate pyrophosphate decarboxylase; DXS, 1-deoxy-d-xylulose-5-phosphate synthase; DXR, 1-deoxy-d-xylulose-5-phosphate reductoisomerase; MCT, 2-C-methyl-erythritol 4-phosphate cytidylytransferase; CMK, 4-(Cytidine 5′-diphospho)-2-C-methyl-d-erythritol kinase; MDS, 2-C-methyl-d-erythritol 2,4-cyclodiphosphate synthase; HDS, 1-hydroxy-2-methyl-butenyl-4-diphosphatesynthase; HDR, isopentenyl pyrophosphate (IPP)/3, 3-dimethylallyl pyrophosphate (DMAPP) synthase; IPPI, isopentenyl pyrophosphate isomerase; GPPS, geranyl diphosphate synthase; FPPS, farnesyl diphosphate synthase; SQS, squalene synthase; SE, squalene epoxidase; CAS, cycloartenol synthase; β-AS, β-amyrin synthase; DS, dammarenediol synthase; PAL, phenylalanine ammonia lyase; C4H, cinnamate 4-hydroxylase; 4CL, 4-coumarate CoA ligase; CHS, chalcone synthase; CHI, chalcone isomerase; F3′5′H, flavonoid 3′,5′-hydroxylase; F3H, flavanone 3-hydroxylase; FLS, flavonol synthase; HCT, hydroxycinnamoyl-transferase; C3′H, p-coumaroyl shikimate 3′ hydroxylase; CCoAoMT, caffeoyl CoA 3-O-methyltransferase; CCR, cinnanoyl-CoA reductase; CAD, cinnamyl alcohol dehydrogenase; CYPs, cytochrome P450s; UGTs, glycosyltransferases.
Frequency of identified SSR motifs in P. tenuifolia.
| Motif length | Repeat numbers | Total | % | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | >10 | |||
| Mononucleotide | — | — | — | — | — | 1554 | 1890 | 3444 | 51.75 |
| Dinucleotide | — | 638 | 374 | 333 | 333 | 317 | 125 | 2120 | 31.85 |
| Trinucleotide | 646 | 218 | 117 | 10 | 1 | 2 | 2 | 996 | 14.97 |
| Tetranucleotide | 64 | 12 | 0 | 0 | 0 | 0 | 0 | 76 | 1.14 |
| Pentanucleotide | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 3 | 0.045 |
| Hexanucleotide | 12 | 2 | 2 | 0 | 0 | 0 | 0 | 16 | 0.24 |
| Total | 724 | 871 | 493 | 343 | 334 | 1873 | 2017 | 6655 | |
| % | 10.87 | 13.09 | 7.41 | 5.15 | 5.02 | 4.79 | 1.91 | ||