| Literature DB >> 25072156 |
Mohit Verma1, Rajesh Ghangal1, Raghvendra Sharma1, Alok K Sinha1, Mukesh Jain1.
Abstract
The medicinal plant, Catharanthus roseus, accumulates wide range of terpenoid indole alkaloids, which are well documented therapeutic agents. In this study, deep transcriptome sequencing of C. roseus was carried out to identify the pathways and enzymes (genes) involved in biosynthesis of these compounds. About 343 million reads were generated from different tissues (leaf, flower and root) of C. roseus using Illumina platform. Optimization of de novo assembly involving a two-step process resulted in a total of 59,220 unique transcripts with an average length of 1284 bp. Comprehensive functional annotation and gene ontology (GO) analysis revealed the representation of many genes involved in different biological processes and molecular functions. In total, 65% of C. roseus transcripts showed homology with sequences available in various public repositories, while remaining 35% unigenes may be considered as C. roseus specific. In silico analysis revealed presence of 11,620 genic simple sequence repeats (excluding mono-nucleotide repeats) and 1820 transcription factor encoding genes in C. roseus transcriptome. Expression analysis showed roots and leaves to be actively participating in bisindole alkaloid production with clear indication that enzymes involved in pathway of vindoline and vinblastine biosynthesis are restricted to aerial tissues. Such large-scale transcriptome study provides a rich source for understanding plant-specialized metabolism, and is expected to promote research towards production of plant-derived pharmaceuticals.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25072156 PMCID: PMC4114786 DOI: 10.1371/journal.pone.0103583
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Assembly optimization/validation of Illumina data of C. roseus.
| Total high quality reads (best k-mer) | Non-redundant high quality reads (best k-mer) | MPGR assembly | Merged assembly | |||||||
| ABySS(k_87) | Velvet(k_93) | Oases(k_61) | Oases(k_61) | ABySS(k_87) | Velvet(k_93) | Oases(k_57) | Oases(k_57) | |||
|
| 133650 | 39010 | 128716 | 106736 | 71190 | 39276 | 53017 | 42909 | 86726 | 59220 |
|
| 72.78 | 25.38 | 165.01 | 160.92 | 57.62 | 25.62 | 51.86 | 49.81 | 107.79 | 76.03 |
|
| 100 | 185 | 100 | 250 | 100 | 185 | 107 | 250 | 251 | 250 |
|
| 15524 | 10893 | 17071 | 17071 | 15524 | 10913 | 17112 | 17112 | 12048 | 17141 |
|
| 544.6 | 650.7 | 1282 | 1507.8 | 809.5 | 652.3 | 978.2 | 1160.8 | 1243 | 1283.9 |
|
| 1087 | 848 | 2161 | 2205 | 1400 | 841 | 1911 | 1990 | 1683 | 2115 |
|
| 63706 | 28142 | 76866 | 73292 | 41643 | 28477 | 22593 | 21275 | 66788 | 32666 |
Similarity search was done against TAIR10 proteome.
Assembly generated by merging best k-mer with MPGR transcriptome using TGICL.
Figure 1Number of transcripts representing different transcription factor families in C. roseus transcriptome.
Figure 2Identification of simple sequence repeats (SSRs) in C. roseus transcriptome.
(A) Distribution of SSRs in different classes (B) Frequency of most common SSRs motifs is shown by bar graph.
Mapping of non-redundant high-quality reads on C. roseus transcriptome.
| Tissue samples | High quality reads | Total mapped reads (%) | Uniquely mapped reads (%) |
|
| 79025564 | 70974531 (89.81) | 68635709 (86.85) |
|
| 78728416 | 69955703 (88.86) | 67774853 (86.09) |
|
| 72961718 | 63571877 (87.13) | 61747179 (84.63) |
Figure 3Differential expression analysis of C. roseus transcriptome.
(A) Number of differentially expressed genes in different tissues in pairwise comparisons. Number of up-regulated genes are in bold, while down-regulated are in normal font. (B) Venn diagram showing number of up- and down-regulated (in parentheses) genes in leaf and root tissues as compared to flower. Asterisk represents genes up-regulated in one tissue and down-regulated in another tissue. (C) Heat-map showing expression patterns of differentially up-regulated TF encoding genes in different tissues. The scale at the bottom represents log2 fold change. (D) Graphical view showing GO terms associated with biological process enriched in up-regulated genes of leaf. The GO enrichment was performed using BiNGO. Node size is proportional to the number of genes in each category and shades represent the scale denoting significance level (white- no significant difference).
Figure 4Expression patterns of transcripts involved in TIA biosynthesis.
(A) Vindoline biosynthetic pathway showing important enzymes involved in different reactions. The IDs of C. roseus transcript encoding for the respective enzymes are also indicated. The important intermediates have been highlighted in bold font. (B) Heat-map showing expression patterns of TIA genes in different tissues and treatment. The scale at the bottom of each study represents log2 value of RPKM. Transcript IDs are given at left side and their putative annotation is on right side. (C) The correlation of gene expression results obtained from RNA-seq and real time RT-PCR analysis (D) Heat-map showing expression pattern of TF encoding genes in different tissues and treatment. The scale at the bottom of each study represents log2 value of RPKM. Transcript IDs are given at left side and their putative annotation is on right side.