| Literature DB >> 35482775 |
Tanzeem Fatima1, Rangachari Krishnan2, Ashutosh Srivastava1, Vageeshbabu S Hanur3, M Srinivasa Rao4.
Abstract
East Indian Sandalwood (Santalum album L.) is highly valued for its heartwood and its oil. There have been no efforts to comparative study of high and low oil yielding genetically identical sandalwood trees grown in similar climatic condition. Thus we intend to study a genome wide transcriptome analysis to identify the corresponding genes involved in high oil biosynthesis in S. album. In this study, 15 years old S. album (SaSHc and SaSLc) genotypes were targeted for analysis to understand the contribution of genetic background on high oil biosynthesis in S. album. A total of 28,959187 and 25,598869 raw PE reads were generated by the Illumina sequencing. 2.12 million and 1.811 million coding sequences were obtained in respective accessions. Based on the GO terms, functional classification of the CDS 21262, & 18113 were assigned into 26 functional groups of three GO categories; (4,168; 3,641) for biological process (5,758;4,971) cellular component and (5,108;4,441) for molecular functions. Total 41,900 and 36,571 genes were functionally annotated and KEGG pathways of the DEGs resulted 213 metabolic pathways. In this, 14 pathways were involved in secondary metabolites biosynthesis pathway in S. album. Among 237 cytochrome families, nine groups of cytochromes were participated in high oil biosynthesis. 16,665 differentially expressed genes were commonly detected in both the accessions (SaHc and SaSLc). The results showed that 784 genes were upregulated and 339 genes were downregulated in SaHc whilst 635 upregulated 299 downregulated in SaSLc S. album. RNA-Seq results were further validated by quantitative RT-PCR. Maximum Blast hits were found to be against Vitis vinifera. From this study, we have identified additional number of cytochrome family in high oil yielding sandalwood accessions (SaHc). The accessibility of a RNA-Seq for high oil yielding sandalwood accessions will have broader associations for the conservation and selection of superior elite samples/populations for further genetic improvement program.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35482775 PMCID: PMC9049570 DOI: 10.1371/journal.pone.0252173
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Summary of cDNA library, RNA-Seq and de novo sequence assembly of combined (SaSHc and SaSLc) S. album.
| Description | ||
|---|---|---|
| cDNA library size (bp) | 252–662 | 232–571 |
| Average cDNA size (bp) | 416 | 375 |
| No of raw reads | 32,959187 | 29,598869 |
| No. of PE reads | 2.99 billion | 2.55 billion |
| Number of bases | 435.67 billion | 384.98 billion |
| Total data in GB | 4.4 | 3.89 |
Samples wise Gene ontology (GO) category distribution of coding sequences (CDS) in S. album.
| SI No. | Biological Process | Cellular Component | Molecular Function |
|---|---|---|---|
|
| 5,108 | 4,168 | 5,758 |
|
| 4,441 | 3,641 | 4,971 |
Fig 1(A). Comparative GO biological regulation i. High oil yielding (SaSHc) and ii. low oil yielding (SaSHc) in S. album. (B). Comparative GO Cellular component i. High oil yielding (SaSHc) and ii. low oil yielding (SaSHc) in S. album. (C). Comparative GO Molecular function between i. High oil yielding (SaSHc) and ii. low oil yielding (SaSLc) in S. album.
Fig 2(A). Histogram of gene ontology classification (Wego plot); High oil yielding Sandalwood (SaSHc). (B). Histogram of gene ontology classification (Wego plot); Low oil yielding Sandalwood (SaSLc).
Comparative KEGG pathway classification of coding sequences in high oil (SaSHc) and low oil (SaSLc) yielding S. album.
| Pathways | |||||
|---|---|---|---|---|---|
|
| |||||
| Carbohydrate Metabolism | 556 | 494 | |||
| Energy metabolism | 323 | 281 | |||
| Lipid metabolism | 272 | 231 | |||
| Nucleotide metabolism | 162 | 147 | |||
| Amino acid metabolism | 393 | 362 | |||
| Metabolism of other amino acids | 156 | 138 | |||
| Glycan biosynthesis and metabolism | 99 | 88 | |||
| Metabolism of cofactors and vitamins | 218 | 193 | |||
| Metabolism of terpenoids and polyketides | 99 | 80 | |||
| Biosynthesis of other secondary metabolites | 86 | 77 | |||
| Xenobiotics biodegradation and metabolism | 85 | 57 | |||
|
| |||||
| Membrane transport | 34 | 30 | |||
| Signal transduction | 597 | 645 | |||
| Signaling molecules and interaction | 0 | 1 | |||
|
| |||||
| Transport and catabolism | 458 | 426 | |||
| Cell growth and death | 329 | 297 | |||
| Cellular community–eukaryotes | 94 | 87 | |||
| Cellular community–prokaryotes | 72 | 67 | |||
| Cell motility | 51 | 44 | |||
|
| |||||
| Transcription | 321 | 301 | |||
| Translation | 739 | 652 | |||
| Folding, sorting and degradation | 551 | 526 | |||
| Replication and repair | 151 | 126 | |||
|
| |||||
| Environmental adaptation | 264 | 253 | |||
Top 10 KEGG pathways mapped in sandalwood (S. album) transcripts.
| SI No | KEGG pathways | ||
|---|---|---|---|
| 1. | Metabolism | 2981 | 2633 |
| 2. | Terpenoid synthesis | 216 | 181 |
| 3. | Amino acid metabolism | 557 | 495 |
| 4. | Purine metabolism | 130 | 119 |
| 5. | Pyrimidine metabolism | 82 | 73 |
| 6. | Transcription | 321 | 301 |
| 7. | Translation | 303 | 234 |
| 8. | Amino acyl tRNA biosynthesis | 46 | 42 |
| 9. | DNA replication | 27 | 26 |
| 10. | Membrane transport | 34 | 30 |
Comparative analysis of DEGs involved in secondary metabolite biosynthesis pathway analysis of Kos in high oil (SaSHc) and low oil (SaSLc) yielding sandalwood (S. album).
| Pathway | Kos | Pathway ID | |
|---|---|---|---|
| Terpenoid backbone biosynthesis | 35 | 33 | Ko00900 |
| Monoterpenoid biosynthesis | 2 | 1 | Ko00902 |
| Sesquiterpenoid and triterpenoid biosynthesis | 4 | 3 | Ko00909 |
| Diterpenoid biosynthesis | 10 | 10 | Ko00904 |
| Polyprenoid biosynthesis | 31 | 30 | Ko00940 |
| Flavone and flavanol biosynthesis | 3 | 2 | Ko00944 |
| Isoquinolene alkaloid biosynthesis | 9 | 6 | Ko00950 |
| Drug metabolism: Cytochrome | 31 | 23 | Ko00982 |
| Metabolism of xenobiotics by Cytochrome P450 | 34 | 23 | Ko00980 |
| Stilbenoid diarylheptanoid and gingerol biosynthesis | 3 | 4 | Ko00945 |
| Tropane piperidine and pyridine alkaloid biosynthesis | 11 | 8 | Ko00960 |
| Carotenoid biosynthesis | 21 | 15 | Ko00906 |
|
| 194 | 158 | |
Fig 3Identification of differentially expressed genes (DEGs) between SaSHc and SaSLc.
Green Bar indicates commonly expressed DEGs. Blue and red bars represent upregulated and downregulated DEGs (significant at P-value threshold of 0.05).
Total biological process associated with differentially expresses genes (DEGs) in high and low oil yielding sandalwood (S. album).
| Up regulated genes | |||
|---|---|---|---|
|
|
| - | Glyoxylate and dicarboxylate metabolism [Pathway ID:ko00630] |
|
|
| Sulfur metabolism [Pathway ID:ko00920] | - |
| Cutin, suberine and wax biosynthesis [Pathway ID:ko00073] | - | ||
| Steroid biosynthesis [Pathway ID:ko00100] | - | ||
| Glycerolipid metabolism [Pathway ID:ko00561] | - | ||
| Glycerophospholipid metabolism [Pathway ID:ko00564] | - | ||
| - | Carbon fixation in photosynthetic organisms [Pathway ID:ko00710] | ||
|
|
| - | Fatty acid biosynthesis [Pathway ID:ko00061] |
| - | |||
| - | Steroid biosynthesis [Pathway ID:ko00100] | ||
| Sulfur metabolism [Pathway ID:ko00920] [Input number-1] | - | ||
|
|
| - | Purine metabolism [Pathway ID:ko00230] |
|
|
| - | Cysteine and methionine metabolism [Pathway ID:ko00270] |
| - | Arginine and proline metabolism [Pathway ID:ko00330] | ||
| - | Tyrosine metabolism [Pathway ID:ko00350] | ||
| - | Phenylalanine metabolism [Pathway ID:ko00360] | ||
| - | Phenylalanine, tyrosine and tryptophan biosynthesis [Pathway ID:ko00400] | ||
|
|
| Thiamine metabolism [Pathway ID:ko00730] | - |
| Folate biosynthesis [Pathway ID:ko00790] | - | ||
|
|
| Flavonoid biosynthesis [Pathway ID:ko00941] | - |
| Flavone and flavonol biosynthesis [Pathway ID:ko00944] | - | ||
| Isoquinoline alkaloid biosynthesis [Pathway ID:ko00950] | - | ||
|
|
| Biosynthesis of siderophore group nonribosomal peptides [Pathway ID:ko01053] | - |
|
|
| - | Protein export [Pathway ID:ko03060] |
| - | Protein processing in endoplasmic reticulum [Pathway ID:ko04141] | ||
| - | SNARE interactions in vesicular transport [Pathway ID:ko04130] | ||
| - | RNA degradation [Pathway ID:ko03018] | ||
|
| |||
|
|
| Photosynthesis [Pathway ID:ko00195] | - |
|
|
| - | Glycerophospholipid metabolism [PATH:ko00564] |
|
|
| Arginine and Proline metabolism [Pathway ID:ko00330] | - |
|
|
| - | Vitamin B6 metabolism [Pathway ID:ko00750] |
|
|
| - | Protein processing in endoplasmic reticulum [Pathway ID:ko04141] |
|
|
| - | ECM-receptor interaction [Pathway ID:ko04512] |
|
|
| Phagosome [Pathway ID:ko04145] | - |
List of DEGS commonly expressed in sandalwood (S. album).
| Sl No. | CDS_Unigenes_Transcript | DEGs of | log2Fold Change | p-val | Significance | Regulation |
|---|---|---|---|---|---|---|
|
| CDS_9540_Uni_11613_Trans_66422 | ARM20318.1ICE1 | -1.60 | 0.04 | No | Down |
|
| CDS_13993_Uni_17451_Trans_85781 | ARM20326.1RAP2-4-like protein [ | -0.46 | 0.75 | No | Down |
|
| CDS_16279_Uni_20501_Trans_95760 | ANQ46483.1,cytochrome P450 reductase [ | -0.87 | 0.32 | No | Down |
|
| CDS_1770_Uni_2262_Trans_32293 | ADO87007.1E, E-farnesyl diphosphate synthase/ AGV01244.1 farnesyl diphosphate synthase [ | 1.86 | 0.02 | Yes | Up |
|
| CDS_17842_Uni_22680_Trans_103059 | ARM20329.1C3H29 [ | 0.28 | 0.76 | No | Up |
| 0.31 | ||||||
| CDS_17843_Uni_22681_Trans_103060 | ||||||
|
| CDS_18136_Uni_23090_Trans_104382 | ANQ46482.1cytochrome P450 reductase [ | 0.47 | 0.68 | No | Up |
| 0.45 | ||||||
| CDS_18138_Uni_23092_Trans_104384 | -0.56 | |||||
| CDS_18143_Uni_23098_Trans_104395 | ||||||
|
| CDS_19033_Uni_24363_Trans_108464 | ANQ46485.1cytochrome b5 [ | 0.79 | 0.21 | No | Up |
|
| CDS_19930_Uni_25598_Trans_112532 | AHB33939.1bergamotene oxidase [ | 0.43 | 0.57 | No | Up |
| 1.05 | ||||||
| CDS_19932_Uni_25600_Trans_112537 | 1.00 | |||||
| CDS_19933_Uni_25601_Trans_112538 | 1.03 | |||||
| CDS_19934_Uni_25602_Trans_112543 | ||||||
|
| CDS_19935_Uni_25603_Trans_112546 | AHB33943.1CYP76F43 [ | 0.46 | 0.56 | No | Up |
|
| CDS_2184_Uni_2749_Trans_35028 | ARM20319.1COR413-TM1 [ | 0.37 | 0.62 | No | Up |
| 0.33 | ||||||
| CDS_2185_Uni_2750_Trans_35029 | ||||||
|
| CDS_22250_Uni_28800_Trans_123276 | ADK89203.1cinnamyl alcohol dehydrogenase, partial [ | 1.18 | 0.08 | No | Up |
|
| CDS_23330_Uni_30495_Trans_128410 | ANQ46486.1cytochrome b5 [ | 0.85 | 0.32 | No | Up |
|
| CDS_5419_Uni_6545_Trans_48603 | ADO87008.1isopentyl diphosphate isomerase [ | 1.82 | 0.03 | Yes | Up |
|
| CDS_13993_Uni_17451_Trans_85781 | ARM20326.1RAP2-4-like protein [ | -0.46 | 0.74 | No | Down |
Fig 4Visualization of differentially expressed gene transcription by (A) Scatter plot of differentially expressed genes between SaSHc and SaSLc (significant at P value <0.05); green dots represent the downregulated (significant) and red dot represents the upregulated (significant) genes for DGE combination (B) Volcano plot of differentially expressed genes; green dots represent the downregulated (significant) and red dot represents the upregulated (significant)genes for DGE combination (significant at P value <0.05).
Fig 5Heat map depicting the top 50 differentially expressed genes (significant); base mean SaSHc represents the normalized expression values for SaSHc sample and base mean and SaSLc represents the normalized expression values for DGE combination.
Relative expression of high oil yielding (SaSHc) genes, coding sequence, unigenes, transcripts, log2 fold change and regulation.
| SI No. | Genic SSR primers | CDS | Unigenes | Transcripts | Log2fold change | Regulation |
|---|---|---|---|---|---|---|
| 1. | Geranyl pyrophosphate synthase ( | 2915 | 3614 | 38599 | 2.61 | Upregulation |
| 2. | Geranyl geranyl pyrophosphate synthase ( | 9544 | 11617 | 66429 | 3.54 | Upregulation |
| 3. | 3-Hydroxy-3-methylglutaryl-CoA reductase ( | 11763 | 14481 | 75932 | 1.32 | Upregulation |
| 4. | 1-Deoxy-D-xylulose5-phosphate synthase ( | 21435 | 27658 | 119568 | 0.67 | Upregulation |
| 5. | E, E, Farnesyl diphosphate synthase ( | 7514 | 29031 | 123929 | 2.65 | Upregulation |
| 6. | Cytochrome P450 synthase ( | 6012 | 7205 | 5126 | 2.43 | Upregulation |
| 7. | Farnesyl pyrophosphate synthase ( | 1770 | 2262 | 32293 | 1.86 | Upregulation |
| 8. | Phenylalanine ammonia lyase ( | 21850 | 28225 | 121398 | 3.55 | Upregulation |
| 9. | Monoterpene synthase ( | 1948 | 2474 | 33744 | 2.98 | Upregulation |
| 10. | 5-enolpyruvylshikimate 3-phosphate synthase ( | 11286 | 13874 | 74066 | 2.17 | Upregulation |
List of transcription factors and genes encoding key enzymes for sandalwood oil biosynthesis whose expressions were altered in high oil (SaSHc) and low oil yielding (SaSLc) sandalwood (S. album).
| SI No. | Transcription factors (ID) ( | Annotations |
|---|---|---|
| 1. | TFIIA1, GTF2A1, TOA1 (K03122) (3&3) | Transcription initiation factor (TIF) TFIIA large subunit |
| 2. | TFIIA2, GTF2A2, TOA2 (K03123) (1&1) | (TIF) TFIIA small subunit |
| 3. | TFIIB, GTF2B, SUA7, tfb (K03124) (4&3) | (TIF) TFIIB |
| 4. | TBP, tbp (K03120) (2&1) | (TIF) TFIID TATA-box-binding protein |
| 5. | TAF1 (K03125) (1&1) | (TIF) TFIID subunit 1 |
| 6. | TAF2 (K03128) (1&1) | (TIF) TFIID subunit 2 |
| 7. | TAF8 (K14649) (2&1) | (TIF) TFIID subunit 8 |
| 8. | TAF5 (K03130) (2&2) | (TIF) TFIID subunit 5 |
| 9. | TAF4 (K03129) (2&2) | (TIF) TFIID subunit 4 |
| 10. | TAF12 (K03126) (1&1) | (TIF) TFIID subunit 12 |
| 11. | TAF6 (K03131) (5&4) | (TIF) TFIID subunit 6 |
| 12. | TAF9B, TAF9 (K03133) (1&1) | (TIF) TFIID subunit 9B |
| 13. | TAF11 (K03135) (1&1) | (TIF) TFIID subunit 11 |
| 14. | TFIIE1, GTF2E1, TFA1, tfe (K03136) (1&1) | (TIF) TFIIE subunit alpha |
| 15. | TFIIE2, GTF2E2, TFA2 (K03137) (1&1) | (TIF) TFIIE subunit beta |
| 16. | TFIIF1, GTF2F1, TFG1 (K03138) (2&2) | (TIF) TFIIF subunit alpha |
| 17. | TFIIH2, GTF2H2, SSL1 (K03142) (1&1) | TFIIH subunit 2 |
| 18. | TFIIF2, GTF2F2, TFG2 (K03139) (1&1) | (TIF) TFIIH subunit 2 |
| 19. | TFIIH3, GTF2H3, TFB4 (K03143) (1&1) | (TIF) TFIIH subunit 3 |
| 20. | TFIIH4, GTF2H4, TFB2 (K03144) (1&1) | (TIF)TFIIH subunit 4 |
| 21. | ERCC3, XPB (K10843) (1&1) | DNA excision repair protein ERCC-3 |
| 22. | ERCC2, XPD (K10844) (2&2) | DNA excision repair protein ERCC-2 |
| 23. | CDK7 (K02202) (4&3) | Cyclin-dependent kinase 7 |
| 24. | MNAT1 (K10842) (2&2) | CDK-activating kinase assembly factor MAT1 |
| 25. | CCNH (K06634) (3&2) | Cyclin H |
Fig 6Top blast hit species distribution of coding sequence (CDS); Majority of the hits were found to be against Vitis vinifera.
Fig 7Validation of relative gene expression levels of differentially expressed genes by qRT-PCR.
Purple and blue lines represent the RNA-Seq results, while red and green bars represent the qRT-PCR results. The error bars indicate the standard deviation.