| Literature DB >> 26943165 |
Shivanjali Kotwal1, Sanjana Kaul1, Pooja Sharma1, Mehak Gupta1, Rama Shankar2, Mukesh Jain2, Manoj K Dhar1.
Abstract
Plantago ovata is an economically and medicinally important plant of the family Plantaginaceae. It is used extensively for the production of seed husk for its application in pharmaceutical, food and cosmetic industries. In the present study, the transcriptome of P. ovata ovary was sequenced using Illumina Genome Analyzer platform to characterize the mucilage biosynthesis pathway in the plant. De novo assembly was carried out using Oases followed by velvet. A total of 46,955 non-redundant transcripts (≥100 bp) using ~29 million high-quality paired end reads were generated. Functional categorization of these transcripts revealed the presence of several genes involved in various biological processes like metabolic pathways, mucilage biosynthesis, biosynthesis of secondary metabolites and antioxidants. In addition, simple sequence-repeat motifs, non-coding RNAs and transcription factors were also identified. Expression profiling of some genes involved in mucilage biosynthetic pathway was performed in different tissues of P. ovata using Real time PCR analysis. The study has resulted in a valuable resource for further studies on gene expression, genomics and functional genomics in P. ovata.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26943165 PMCID: PMC4778938 DOI: 10.1371/journal.pone.0150273
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Plantago ovata plant at (a) vegetative and (b) reproductive phase, (c) P. ovata ovaries.
QC summary of the paired end sequencing data.
| File name | Paired end library |
|---|---|
| Total number of reads | 31,280,458 |
| Total number of bases | 2,252,192,976 |
| Number of primer/ adapter contaminated HQ reads | 270 |
| Total number of HQ filtered reads | 29,861,418 |
| Percentage of HQ filtered reads | 95.46% |
Fig 2Comparison of de novo assembly of the data obtained through Velvet, Abyss, CLC genomics workbench, Oases and merged assembly of Oases programs.
The merged assembly produces less number of transcripts but the N50 transcript length and Average transcript length is much higher as compare to others.
Assembled transcripts obtained after merging of Oases assembly with k-mer lengths 31–41.
| Parameters | Longest isoforms |
|---|---|
| Number of transcripts | 46,955 |
| Total bases | 19,251,438 |
| Minimum transcript length | 100 |
| Maximum transcript length | 3,544 |
| Average transcript length | 410 |
| N50 length | 570 |
Fig 3Percentage GC content of P. ovata, A. thaliana, rice, tomato and Eucalyptus transcripts.
The percentage GC content of P. ovata and Eucalyptus falls into the range of monocots.
Number of unigenes showing sequence homology with proteomes of twenty fiveplants with an e-value ≤1e -5 and ≥80% coverage.
| Plant reference | Family | 1E-5 cut off | > = 80% coverage |
|---|---|---|---|
| Brassicaceae | 24,866 (52.95%) | 2,184 (4.65%) | |
| Brassicaceae | 24,561 (52.30%) | 2,109 (4.49%) | |
| Brassicaceae | 24,659 (52.51%) | 2,194 (4.67%) | |
| Solanaceae | 26,626 (56.70%) | 2,351 (5.0%) | |
| Solanaceae | 24,080 (51.28%) | 2,656 (5.65%) | |
| Fabaceae | 25,718 (54.77%) | 2,293 (4.88%) | |
| Fabaceae | 25,366 (54.02%) | 2,189 (4.66%) | |
| Fabaceae | 25,633 (54.59%) | 2,251 (4.79%) | |
| Rosaceae | 26,010 (55.39%) | 2,222 (4.73%) | |
| Rosaceae | 25,060 (53.37%) | 1,659 (3.53%) | |
| Rosaceae | 25,294 (53.86%) | 1,530 (3.25%) | |
| Euphorbiaceae | 25,789 (54.92%) | 2,237 (4.76%) | |
| Caricaceae | 24,609 (52.40%) | 2,287 (4.87%) | |
| Vitaceae | 25,566 (54.44%) | 2,113 (4.50%) | |
| Cucurbitaceae | 25,141 (53.54%) | 2,277 (4.84%) | |
| Malvaceae | 26,354 (56.12%) | 2,165 (4.61%) | |
| Linaceae | 24,928 (53.08%) | 2,007 (4.27%) | |
| Rutaceae | 25,388 (54.06%) | 2,495 (5.31%) | |
| Saliaceae | 25,941 (55.24%) | 2,370 (5.04%) | |
| Myrtaceae | 25,380 (54.05%) | 2,084 (4.43%) | |
| Scrophulariaceae | 27,604 (58.76%) | 2,482 (5.28%) | |
| Poaceae | 23,454 (49.94%) | 1,957 (4.16%) | |
| Poaceae | 23,088 (49.17%) | 1,914 (4.07%) | |
| Poaceae | 22,875 (48.71%) | 2,115 (4.50%) | |
| Poaceae | 23,396 (49.82%) | 1,848 (3.93%) |
Fig 4Gene Ontology classification of the assembled Plantago ovatatranscripts in different categories of biological process, molecular function and cellular component.
Fig 5KOG functional classification of unigenes clustered into four major categories: poorly characterized, information storage and processing, cellular process and signaling and metabolism.
Transcripts showing homology to several biosynthesis pathways: Mucilage, Carotenoid, Flavonoid and Phenylproponoid biosynthesis pathways.
| S. No. | Name of the gene | Biosynthesis pathway | Transcript Information |
|---|---|---|---|
| 1 | Galacturonosyltransferase- like 3 (GATL3) | Mucilage biosynthesis pathway | Locus_10033_transcript_5/5 |
| 2 | Galacturonosyltransferase 11 (GAUT11) | Locus_13594_transcript_1/3 | |
| 3 | Polygalacturonate 4- alpha- galacturonosyltransferase (PARVUS) | Locus_31653_transcript_1/1 | |
| 4 | Polygalacturonate 4-alpha-galacturonosyltransferase activity (LGT9) | Locus_10033_transcript_5/5 | |
| 5 | Galacturonosyltransferase 10 (GAUT 10) | Locus_9890_transcript_12/14 | |
| 6 | Galacturonosyltransferase 9 (GAUT 9) | Locus_3331_transcript_16/16 | |
| 7 | Galacturonosyltransferase1 (GAUT 1) | Locus_5760_transcript_1/5 | |
| 8 | Polygalacturonate 4-alpha-galacturonosyltransferase activity (GATL6) | Locus_15519_transcript_4/5 | |
| 9 | Galacturonosyltransferase4 (GAUT4) | Locus_18562_transcript_1/3 | |
| 10 | Glucuronoxylanglucuronosyltransferase (GUT1) | Locus_4011_transcript_6/19 | |
| 11 | Locus_104366_transcript_3/8 | ||
| 12 | Transparent Testa Glabra 1 (TTG1) | Locus_5224_transcript_1/6 | |
| 13 | Mucilage-Modified 4 (MUM4) | Locus_395_transcript_38/47 | |
| 14 | Prairie (PRA) | Locus_16030_transcript_2/6 | |
| 15 | Radial swelling3 (RSW3) | Locus_2050_transcript_7/12 | |
| 16 | Mucilage-Modified 2 (MUM2) | Locus_41504_transcript_1/1 | |
| 17 | Glabrous1 (GL1) | Locus_485_transcript_6/8 | |
| 18 | Arabinose 4-epimerase (MUR4) | Locus_28719_transcript_1/1 | |
| 19 | Zeta-carotene desaturase (ZDS) [EC:1.3.5.6] | Carotenoid biosynthesis pathway | Locus_27865_Transcript_2/8 |
| 20 | Phytoene synthase (PSY) [EC:2.5.1.32] | Locus_5318_Transcript_5/5 | |
| 21 | Lycopene β-cyclase (lcyB) [EC:5.5.1.19] | Locus_15899_Transcript_3/3 | |
| 22 | ProlycopeneIsomerase (crtISO) [EC:5.2.1.13] | Locus_23335_Transcript_3/4 | |
| 23 | Carotene epsilon-monooxygenase (LUT1) [EC:1.14.99.45] | Locus_11363_Transcript_1/1 | |
| 24 | Zeaxanthinepoxidase (ZEP) [EC:1.14.13.90] | Locus_3049_Transcript_7/8 | |
| 25 | Violaxanthin de-epoxidase (VDE) [EC:1.10.99.3] | Locus_13468_Transcript_1/1 | |
| 26 | 9-cis-epoxycarotenoid dioxygenase (NCED) [EC:1.13.11.51] | Locus_15430_Transcript_2/2 | |
| 27 | Abscisic-aldehyde oxidase (AAO3) [EC:1.2.3.14] | Locus_19045_Transcript_1/4 | |
| 28 | 15-cis-zeta-carotene isomerase (Z-ISO) [EC:5.2.1.12] | Locus_20366_Transcript_2/2 | |
| 29 | Beta-carotene hydroxylase (chyb/crtZ) [EC:1.14.13.129] | Locus_5008_Transcript_2/2 | |
| 30 | Beta-ring hydroxylase (LUT5) [EC:1.14.-.-] | Locus_11330_Transcript_3/9 | |
| 31 | Phenylalanine ammonia lyase (PAL) [EC:4.3.1.24] | Phenylproponoid and flavonoid biosynthesis pathway | Locus_234_Transcript_30/67 |
| 32 | Cinnamate 4-hydroxylase (C4H) [EC:1.14.13.11] | Locus_518_Transcript_27/37 | |
| 33 | 4-coumarate CoA ligase (4CL 1) [EC:6.2.1.12] | Locus_7647_Transcript_1/5 | |
| 34 | 4-coumarate CoA ligase (4CL 2) [EC:6.2.1.12] | Locus_18824_Transcript_1/1 | |
| 35 | 4-coumarate CoA ligase (4CL 5) [EC:6.2.1.12] | Locus_107_Transcript_6/8 | |
| 36 | Chalcone synthase (CHS) [EC:2.3.1.74] | Locus_598_Transcript_48/50 | |
| 37 | ChalconeIsomerase (CHI) [EC:5.5.1.6] | Locus_7400_Transcript_15/20 | |
| 38 | Flavanone 3-hydroxylase (F3H) [EC:1.14.11.9] | Locus_2097_Transcript_9/16 | |
| 39 | Flavonoid 3’-hydroxylase (F3’H) [EC:1.14.13.21] | Locus_1209_Transcript_8/11 | |
| 40 | Flavonol synthase (FLS 1) [EC:1.14.11.23] | Locus_1678_Transcript_5/8 | |
| 41 | Flavonol synthase (FLS 2) [EC:1.14.11.23] | Locus_2087_Transcript_12/13 | |
| 42 | Dihydroflavonol 4-reductase (DFR) [EC:1.1.1.219] | Locus_6271_Transcript_8/10 | |
| 43 | Anthocyanidin synthase (ANS) [EC:1.14.11.19] | Locus_8842_Transcript_4/4 | |
| 44 | Anthocyanidin reductase (ANR) [EC:1.3.1.77] | Locus_14453_Transcript_1/4 | |
| 45 | Flavone synthase (FNS 1) [EC:1.14.11.22] | Locus_38242_Transcript_1/1 | |
| 46 | Flavone synthase (FNS 2) [EC:1.14.11.22] | Locus_438_Transcript_39/47 | |
| 47 | Flavonoid 3’-monooxygenase [EC:1.14.13.21] | Locus_1209_Transcript_8/11 | |
| 48 | Flavonol 3-O-methyltransferase [EC:2.1.1.76] | Locus_5520_Transcript_7/8 |
Fig 6Distribution of Plantago ovata transcripts in different transcription factor families.
C3H, PHD and MADS transcription factor families were enriched in number however CCAAT and Tify were least in number.
Statistics of SSRs identified in Plantago transcripts.
| SSR Mining | |
|---|---|
| Total number of sequences examined | 46,955 |
| Total size of examined sequences (bp) | 19,251,438 |
| Total number of identified SSRs | 1,224 |
| Number of SSRs containing sequences | 1,119 |
| Number of sequences containing more than one SSR | 95 |
| Number of SSRs present in compound formation | 70 |
| Maximal number of bases interrupting two SSRs in a compound microsatellite | 100 |
Fig 7Distribution of SSR repeat type and repeat motifs.
Fig (a) different repeat type SSRs and Fig (b) different repeat motifs and percentage of transcripts corresponding to them.
Fig 8Different classes of non-coding RNAs (ncRNAs) as identified by the Repeat Masker software in P. ovata.