| Literature DB >> 26017011 |
Shubhra Rastogi1, Alok Kalra2, Vikrant Gupta3, Feroz Khan4, Raj Kishori Lal5, Anil Kumar Tripathi6, Sriram Parameswaran7, Chellappa Gopalakrishnan8, Gopalakrishna Ramaswamy9, Ajit Kumar Shasany10.
Abstract
BACKGROUND: Ocimum sanctum L. (O. tenuiflorum) family-Lamiaceae is an important component of Indian tradition of medicine as well as culture around the world, and hence is known as "Holy basil" in India. This plant is mentioned in the ancient texts of Ayurveda as an "elixir of life" (life saving) herb and worshipped for over 3000 years due to its healing properties. Although used in various ailments, validation of molecules for differential activities is yet to be fully analyzed, as about 80 % of the patents on this plant are on extracts or the plant parts, and mainly focussed on essential oil components. With a view to understand the full metabolic potential of this plant whole nuclear and chloroplast genomes were sequenced for the first time combining the sequence data from 4 libraries and three NGS platforms.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26017011 PMCID: PMC4445982 DOI: 10.1186/s12864-015-1640-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Process workflow of Ocimum sanctum whole genome sequencing and assembly
Assembly statistics of contigs and scaffolds generated using the three sequencing platforms Illumina HiSeq2000, 454 GS FLX and SOLiD 5500XL
| Description | Contigs | Scaffolds | GapClosed scaffolds | Super-scaffolds |
|---|---|---|---|---|
| Contigs generated | 107785 | 22776 | 22776 | 9059 |
| Maximum Contig Length | 115044 | 414711 | 411690 | 2211552 |
| Minimum Contig Length | 147 | 200 | 200 | 200 |
| Average Contig Length | 3454 | 16984 | 16629 | 44354 |
| Total Contigs Length | 372395755 | 386828951 | 378759759 | 401803260 |
| Total Number of non-ATGC characters | 0 | 17898452 | 2665702 | 26110056 |
| Percentage of non-ATGC characters | 0 | 4.627 | 0.704 | 6.498 |
| Contigs > = 1 Kb | 43174 | 14791 | 14769 | 4159 |
| Contigs > = 10 Kb | 11594 | 7544 | 7407 | 2357 |
| N50 value | 12769 | 61854 | 61242 | 303233 |
| N90 value | 2071 | 12742 | 12534 | 73672 |
Fig. 2Gene map of the Ocimum sanctum chloroplast genome Genes drawn inside the circle are transcribed clockwise, and those outside are counterclockwise. Genes belonging to different functional groups are color-coded. The darker gray in the inner circle corresponds to GC content, while the lighter gray corresponds to AT content
Fig. 3Frequency distribution of SSRs based on motif types. p1: Mono-nucleotide repeats; p2: Di-nucleotide repeats; p3: Tri-nucleotide repeats, p4: Tetra-nucleotide repeats; p5: Penta-nucleotide repeats; p6: Hexa-nucleotide repeats; Complex: no. of SSRs involved in compound formation
Fig. 4Pie-chart showing top 10 functional classes in each of the 3 categories of gene ontology classification. The three main categories are: biological process, cellular component and molecular function representing the assignment O. sanctum predicted proteins with BLAST matches in NR (non-redundant) protein sequences of Arabidopsis to each GO term
Fig. 5The phylogenetic tree of the asterid clade based on 63 protein-coding genes of chloroplast genome. Numbers above each node are bootstrap support values. Spinacea oleracea and Arabidopsis thaliana were set as outgroups
Fig. 6Abundance of phenylpropanoid, mevalonate and non- mevalonate pathway genes as per the annotation of predicted genes against all Viridiplantae clade genes in Uniprot. [Abbreviations used- Alcohol dehydrogenase (ADH); Polyphenol oxidase (PPO); Flavonoid O-methyltransferase (FOMT); Cinnamate 4-hydroxylase (C4H); Cinnamyl alcohol dehydrogenase (CAD); Cinnamoyl-CoA reductase (CCR); Hydroxyphenylpyruvate reductase (HPPR); 4-Coumarate:coenzyme A ligase (4CL); Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase (PF3R4); Chalcone synthase (CHS); Tyrosine aminotransferase (TAT); Hydroxycinnamoyl transferase (HSHCT); Rosmarinic acid synthase (RAS); Phenylalanine ammonia-lyase (PAL); Dihydroflavonol 4-reductase (DFR); Flavonoid 3’ 5’-hydroxylase (F3’5’H); UDP-glucose: flavonoid 7-O-glucosyltransferase (UFGT); Eugenol synthase 1 (EGS); p-Coumaroyl shikimate 3’-hydroxylase (CS3’H); p-Coumarate 3-hydroxylase (C3H); Alcohol acyltransferase (AAT2); Caffeoyl CoA O-methyltransferase (CCOMT); Chalcone isomerase (CHI); Ferulate 5-hydroxylase (F5H); Arogenate dehydrogenase (ADH); Chavicol O-methyltransferase (CVOMT); Prephenate aminotransferase (PAT); Chorismate mutase (CM); Dehydroquinate dehydratase/ shikimate dehydrogenase (DHQ-SDH); Anthocyanidin synthase (ANS); Cinnamate/p-coumarate carboxyl methyltransferase (CCMT); Caffeic acid 3-O-methyltransferase (COMT); 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase (DAHPS); Flavanone 3-hydroxylase (F3H); Flavonoid 3’-hydroxylase (F3’H); Glutathione S-transferase (GST); 4-Hydroxyphenylpyruvate dioxygenase (HPPD); Chorismate synthase (CS); Eugenol O-methyltransferase (EOMT); Benzoate carboxyl methyltransferase (BAMT); Arogenate dehydratase (ADT); 3-dehydroquinate synthase (DHQS); Copalyl diphosphate synthase(CPS); Bicyclogermacrene synthase (Ov-TPS4); Sesquiterpene synthase (SesquiTPS); 3-hydroxy-3-methylglutaryl-coenzyme A reductase (HMGR); (R)-limonene synthase (LS); Terpene synthase (TPS); Geraniol synthase (GES); Gamma-cadinene synthase (CDS); Secologanin synthase (SLS); Selinene synthase (SES); Multifunctional triterpene synthase (Multifunc triTPS); (+)-menthofuran synthase (MFS); Ent-kaurene synthase (KS); Lupeol synthase(LUP); Nerolidol/linalool synthase (NES/LIS-1); Alpha-zingiberene synthase (ZIS); Germacrene D synthase (GDS); 3-hydroxy-3-methylglutaryl coenzyme A synthase(HMGS); Mevalonate kinase (MVK); Mevalonate diphosphate decarboxylase (MDC); Farnesyl diphosphate synthase(FPPS); Squalene synthase (SQS); Beta-amyrin synthase (bAS); Mixed amyrin synthase (AS); 5-epi-aristolochene synthase (EAS); Beta-myrcene synthase (MYS); Cis-muuroladiene synthase (MxpSS1); Monoterpene synthases (MTPS); Cineole synthase (CinS2); Terpinolene synthase (TES); Valencene synthase (ValCS); 5-phosphomevalonate kinase (PMK); (-)-endo-fenchol synthase (FES); (+)-epi-alpha-bisabolol synthase (LdTPS8); Tricyclene synthase 0e23 /(E)-beta-ocimene synthase 0e23/ Terpenoid synthase 0e23) (Terpenoid synthase); Geranyl diphosphate synthase (GPPS); 1-deoxy-D-xylulose 5-phosphate synthase (DXS); 4-hydroxy-3-methylbut-2-enyl diphosphate reductase (HDR); Geranylgeranyl diphosphate synthase (GGPPS); Isopentenyl diphosphate isomerise (IDI); 1-deoxy-D-xylulose-5-phosphate reductoisomerase (DXR); 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase (MCT); 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase (CMK); 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase (MECPP); 4-hydroxy-3-methylbut-2-enyl diphosphate synthase (HDS)]