| Literature DB >> 33281883 |
Jiexi Hou1, Jie Zhang1, Beihong Zhang1,2, Xiaofang Jin1, Haiyan Zhang1, Zhinong Jin1.
Abstract
The roots, bark, and leaves of Cinnamomum camphora are rich in essential oils, which mainly comprised monoterpenes and sesquiterpenes. Although the essential oils obtained from C. camphora have been widely used in pharmaceutical, medicinal, perfume, and food industries, the molecular mechanisms underlying terpenoid biosynthesis are poorly understood. To address this lack of knowledge, we performed transcriptome analysis to investigate the key regulatory genes involved in terpenoid biosynthesis in C. camphora. High-oil-yield trees of linalool type and low-oil-yield trees were used to assemble a de novo transcriptome of C. camphora. A total of 121,285 unigenes were assembled, and the total length, average length, N50, and GC content of unigenes were 87,869,987, 724, 1,063, and 41.1%, respectively. Comparison of the transcriptome profiles of linalool-type C. camphora with trees of low oil yield resulted in a total of 3,689 differentially expressed unigenes, among them 31 candidate genes had annotations associated with metabolism of terpenoids and polyketides, including four in the monoterpenoid biosynthesis pathway and three in the terpenoid backbone biosynthesis pathway. Collectively, this genome-wide transcriptome provides a valuable tool for future identification of genes related to essential oil biosynthesis. Additionally, the identification of a cohort of genes in the biosynthetic pathways of terpenoids provides a theoretical basis for metabolic engineering of essential oils in C. camphora.Entities:
Keywords: Cinnamomum camphora; chemotypes; leaves; terpenoid biosynthesis; transcriptional analysis
Year: 2020 PMID: 33281883 PMCID: PMC7689033 DOI: 10.3389/fgene.2020.598714
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Chemical composition of the essential oils from leaf extracts of C. camphora.
| No. | Compound | N-1 (%) | N-2 (%) | N-3 (%) | LI-1(%) | LI-2(%) | LI-3 (%) |
| 1 | α-Thujene | 1.51 | — | — | — | 0.1 | — |
| 2 | Pinene | 8.28 | 4.63 | — | — | 0.36 | 0.19 |
| 3 | Camphene | — | 1.49 | — | — | — | 0.14 |
| 4 | Phellandrene | 26.19 | 0.88 | — | 0.19 | 0.12 | 0.47 |
| 5 | β-Myrcene | 2.26 | — | — | 0.16 | 0.16 | — |
| 6 | — | — | — | 0.15 | 0.26 | 0.26 | |
| 7 | Eucalyptol | 37.7 | 2.14 | — | 0.59 | — | 0.46 |
| 8 | β-Ocimene | 0.56 | 4.87 | — | — | — | 0.68 |
| 9 | Terpinene | 3.48 | — | — | — | — | |
| 10 | β-Linalool | — | 2.06 | — | 90.38 | 86.8 | 89.16 |
| 11 | Borneol acetate | — | 1.95 | — | — | — | — |
| 12 | Camphor | — | — | — | — | 1.36 | — |
| 13 | Terpineol | 10.57 | — | — | — | 0.13 | 0.07 |
| 14 | Citronellol | — | — | — | — | 0.13 | — |
| 15 | Citral | — | — | — | 0.43 | 1.16 | 0.54 |
| 16 | Caryophyllene | 2.2 | 13.16 | 8.58 | 2.07 | 1.71 | 1.03 |
| 17 | Alloaromadendrene | — | 1.24 | 0.18 | — | — | — |
| 18 | Germacrene D | 1.37 | 1.28 | 3.11 | 1.26 | 0.24 | 0.59 |
| 19 | Naphthalene | — | 1.36 | — | — | — | |
| 20 | Humulene | 1.76 | 3.41 | 5.49 | — | — | 0.59 |
| 21 | ç-Elemene | 1.25 | 15.01 | 4.43 | 0.86 | 3.36 | 1.18 |
| 22 | Nerolidol | — | 22.2 | 44.95 | 0.11 | 0.14 | 0.24 |
| 23 | (−)-Spathulenol | — | 1.76 | 3.72 | — | 0.94 | 0.41 |
| 24 | Nerolidyl acetate | — | 1.29 | — | — | — | — |
| 25 | Guaiol | 0.9 | — | 0.24 | — | — | — |
| 26 | Caryophyllene oxide | — | 1.17 | 1.1 | 0.28 | 0.34 | 0.58 |
| 27 | Others and Unknown | 1.97 | 20.1 | 28.2 | 3.52 | 2.69 | 3.41 |
Statistics for de novo assembly of the C. camphora transcriptome.
| Total assembled bases | 87.9M |
| Total no. of unigenes | 121,285 |
| N50 (bp) | 1,063 |
| Shortest unigene length (bp) | 201 |
| Longest unigene length (bp) | 15,713 |
| Average unigene length (bp) | 724 |
| Mapped reads (%) | >98% |
FIGURE 1Functional annotation and GO classification of assembled unigenes in C. camphora. (A) Functional annotation of assembled unigenes by BLASTx against Nr and Swissprot databases with an E-value threshold of 1e–5. A total of 34,789 unigenes were commonly annotated in the two databases. (B) GO terms of C. camphora unigenes determined by Blast2GO fell into the three categories of biological process, cellular component, and molecular function, where metabolic process, cell part, and catalytic activity were the three most prevalent subcategories, respectively. (C) KEGG classification of annotated unigenes. The x-axis shows the number of unigenes annotated, and the y axis shows the KEGG metabolic pathways the unigenes fell into, including organismal system (O), metabolism (M), genetic information processing (G), environmental information processing (E), and cellular processes (C).
FIGURE 2Transcriptional cascade of DEGs in terpenoid backbone biosynthesis in linalool-type C. camphora. (A) KEGG pathway annotations of DEGs. The x-axis indicates the number of unigenes annotated in each corresponding pathway. The five categories of KEGG pathways are divided and color-coded. (B) DEGs in terpenoid backbone biosynthesis. The transcriptional cascade of terpenoid backbone biosynthesis is shown in the form of protein-coding enzymes or nodes annotated by KEGG pathway analysis. Enzyme expression patterns are indicated with logRPKM values, and DEGs are in azure. The left three and right three columns represent low oil content type (N) and linalool type (LI), respectively. AACT, acetyl-CoA C-acetyltransferase; DXS, 1-deoxy-D-xylulose-5-phosphate synthase; HMGS, hydroxymethylglutaryl-CoA synthase; PMK, phosphomevalonate kinase; MVD, mevalonate diphosphate decarboxylase; DXR, 1-deoxy-d-xylulose-5-phosphate reductoisomerase; HDS, hydroxymethylbutenyl 4-diphosphate synthase; HDR, (E)-4-hydroxy-3-methyl-but-2-enyl diphosphate reductase; IDI, isopentenyl-diphosphate delta-isomerase; GPPS, geranyl diphosphate synthase; FPPS, farnesyl pyrophosphate synthase; GGPPS, geranylgeranyl pyrophosphate synthase; LIS, linalool synthase; MNR1, (+)-neomenthol dehydrogenase-like isoform; GerS, geraniol synthase; GerD, (−)-germacrene D synthase; CLS, ent-copalyl diphosphate synthase; CYP88D6, cytochrome P450 88D6; GA2OX, Gibberellin 2-beta-dioxygenase. (C) qRT-PCR validation of RNA-Seq data. The black bars represent relative gene expression determined by qRT-PCR (left y-axis) and the dotted gray bars represent gene expression levels detected by RNA-Seq (right y-axis). Standard errors of the three technical replicates of each sample of the qRT-PCR analysis and three biological replicates in the two experimental groups of the RNA-Seq analysis are indicated by error bars.