| Literature DB >> 29321980 |
Ramaswamy Manimekalai1, Smita Nair2, A Naganeeswaran2, Anitha Karun2, Suresh Malhotra3, V Hubbali4.
Abstract
Areca catechu L. belongs to the Arecaceae family which comprises many economically important palms. The palm is a source of alkaloids and carotenoids. The lack of ample genetic information in public databases has been a constraint for the genetic improvement of arecanut. To gain molecular insight into the palm, high throughput RNA sequencing and de novo assembly of arecanut leaf transcriptome was undertaken in the present study. A total 56,321,907 paired end reads of 101 bp length consisting of 11.343 Gb nucleotides were generated. De novo assembly resulted in 48,783 good quality transcripts, of which 67% of transcripts could be annotated against NCBI non - redundant database. The Gene Ontology (GO) analysis with UniProt database identified 9222 biological process, 11268 molecular function and 7574 cellular components GO terms. Large scale expression profiling through Fragments per Kilobase per Million mapped reads (FPKM) showed major genes involved in different metabolic pathways of the plant. Metabolic pathway analysis of the assembled transcripts identified 124 plant related pathways. The transcripts related to carotenoid and alkaloid biosynthetic pathways had more number of reads and FPKM values suggesting higher expression of these genes. The arecanut transcript sequences generated in the study showed high similarity with coconut, oil palm and date palm sequences retrieved from public domains. We also identified 6853 genic SSR regions in the arecanut. The possible primers were designed for SSR detection and this would simplify the future efforts in genetic characterization of arecanut.Entities:
Keywords: Areca genome; Carotenoids; De novo assembly; Flavonoids
Year: 2018 PMID: 29321980 PMCID: PMC5755930 DOI: 10.1016/j.btre.2017.12.005
Source DB: PubMed Journal: Biotechnol Rep (Amst) ISSN: 2215-017X
Summary of raw and trimmed reads from sequencing results.
| Parameters | Raw read | Trimmed read |
|---|---|---|
| Number of paired end reads | 56,321,907 | 51,175,926 |
| Number of bases (Gb) | 5.69 | 4.20 |
| GC% | 49.01 | 46 |
| Read length (bp) | 101*2 | 82*2 |
Fig. 1Length distributions of transcripts in the arecanut leaf transcriptome.
Fig. 2Percentage of transcripts in arecanut leaf transcriptome based on expression values (Fragments per kilobase million, FPKM).
Highly expressed transcripts related to flavonoid and terpenoid biosynthesis based on FPKM values obtained from RNA- seq data of arecanut.
| Transcript Id | Gene name | Pathway | Read Count | FPKM |
|---|---|---|---|---|
| 436630 | Leucoanthocyanidin reductase [EC:1.17.1.3] | Flavonoid biosynthesis | 376,121 | 3572.9 |
| 434102 | Enyl diphosphate reductase [EC:1.17.1.2] | Terpenoid backbone biosynthesis | 134,099 | 1483.7 |
| 426667 | Flavanone 4-reductase [EC:1.1.1.219 1.1.1.234] | Flavonoid biosynthesis | 33,462 | 513.2 |
| 439086 | 1-deoxy- | Terpenoid backbone biosynthesis | 16,153 | 123.4 |
| 414549 | chalcone isomerase [EC:5.5.1.6] | Flavonoid biosynthesis | 5299 | 115.3 |
| 423583 | Cinnamyl-alcohol dehydrogenase [EC:1.1.1.195] | Phenylpropanoid biosynthesis | 6463 | 109.8 |
| 432856 | flavonoid 3′-monooxygenase [EC:1.14.13.21] | Flavonoid biosynthesis | 8953 | 105.8 |
Fig. 3Species wise distribution of blast hits of arecanut transcripts.
The highest percent identity was observed with Vitis vinifera sequences.
Fig. 4Gene ontology (GO) classification of assembled transcripts using UniProt database.
a. GO terms in the biological process, molecular function and cellular component.
b. Number of transcripts annotated with GO terms
Summary of the simple sequence repeat (SSR) types in the arecanut transcriptome.
| Description | Number |
|---|---|
| Total number of identified SSRs | 6853 |
| Number of SSR containing sequence | 6091 |
| Number of sequences containing more than 1 SSR | 673 |
| Number of SSRs present in simple form | 6435 |
| Number of SSRs present in compound form | 418 |
| Distribution to different repeat type classes − | |
| Di repeats | 3963 |
| Tri repeats | 2602 |
| Tetra repeats | 194 |
| Penta repeats | 45 |
| Hexa repeats | 49 |
Definement of microsatellites (unit size/minimum number of repeats) (2/6) (3/5) (4/5) (5/5) (6/5).
Fig. 5Simple sequence repeat (SSR) types in the arecanut transcriptome.
Comparative study of arecanut transcriptome with coconut, oil palm and date palm sequences.
| Name | No. of EST/mRNA/Transcriptome sequence | No. of Arecanut sequence alignment | Reference representation |
|---|---|---|---|
| Coconut | 57175 | 64781(∼54.5%) | 28761 (∼50.3%) |
| Oil palm | 37492 | 30473 (∼25.6%) | 14631 (∼39%) |
| Date palm | 28889 | 52351 (∼44%) | 17414 (60.2%) |