Literature DB >> 33613076

Plastome of Saraca asoca (Detarioideae, Fabaceae): Annotation, comparison among subfamily and molecular typing.

Mohammad Ajmal Ali1, Tapan Kumar Pan2, Arun Bahadur Gurung3, Mohammad Abul Farah4, Fahad Al-Hemaid1, Khalid Mashay Alanazi4, Meena Elangbam5, Joongku Lee6, Shankar Kumar Pandey7, M Oliur Rahman8, Soo-Yong Kim9.   

Abstract

Saraca asoca (Roxb.) Willd. (subfamily Detarioideae, family Fabaceae) is a perennial evergreen sacred medicinal tree classified under 'vulnerable' by the IUCN. The chloroplast (cp) genome/plastome which follows uniparental inheritance contains many useful genetic information because of its conservative rate of evolution. The assembled cp genome of S. asoca which maps as a conserved circular structure revealed extensive rearrangement in gene organization, comprising total length 160,003 bp including LSC, SSC, IRa, and IRb, and GC content was 35.26%. Herein a set of rbcL and matK gene were established using molecular phylogenetic analyses for molecular typing of S. asoca.
© 2020 The Author(s).

Entities:  

Keywords:  Detarioideae; Fabaceae; Genomic rearrangement; Molecular authentication; Plastome; Saraca asoca; Simple sequence repeat

Year:  2020        PMID: 33613076      PMCID: PMC7878682          DOI: 10.1016/j.sjbs.2020.12.008

Source DB:  PubMed          Journal:  Saudi J Biol Sci        ISSN: 2213-7106            Impact factor:   4.219


Introduction

Saraca asoca (Roxb.) de Wilde [family Fabaceae, subfamily Detarioideae (APG IV, 2016, LPWG, 2017)], commonly known as ‘asoca’ (Fig. 1A-B), indigenous to Assam, E. Pakistan, Upper Burma, Malaya, Ceylon and South India (Singh et al., 2015), is one of the most sacred tree of the Indian subcontinent (Murthy et al., 2008, Mollik et al., 2010). Apart from its various pharmacological significance e.g. antimicrobial (Seetharam et al., 2003, Shirolkar et al., 2013), anticancer (Cibin et al., 2012), anti-inflammatory (Cibin et al., 2012, Saha et al., 2012), antiarthritic (Preethi and Krishnakumar, 2011) activities, the barks, leaves, flowers, and seeds of ‘asoca’ have extensively been used against uterine infections and as astringent in the cases of the internal haemorrhoids in modern as well as in the Indian traditional systems of medicine (Nudrat et al., 2005, Singh et al., 2015).
Fig. 1

Saraca asoca. A. The tree in the flowering stage. B. An enlarged view of inflorescence.

Saraca asoca. A. The tree in the flowering stage. B. An enlarged view of inflorescence. The continuous development in the next-generation sequencing (NGS) platforms (Shendure et al., 2017), and bioinformatics tools (Yang and Rannala, 2012) including cloud computing for genomic data analysis (Kwon et al., 2015, Langmead and Nellore, 2018) during last two decades have (a) greatly propelled to sequencing of the organellar genome e.g. mitochondria (Kozik et al., 2019), chloroplast (Daniell et al., 2016) and whole genome (Chen et al., 2018), (b) revolutionized the understanding of various biological disciplines (Ali et al., 2020) e.g. tree of life (Philippe et al., 2005, Rokas, 2006), evolution of plant genomes (Wendel et al., 2016), gene families and gene function (Leebens-Mack et al., 2019), conservation biology (Johnson and Koepfli, 2014, Wambugu et al., 2018), and (c) alleviate the enhancement of the agronomic traits (Rogalski et al., 2015, Daniell et al., 2016, Lima et al., 2016). The over-exploitation of S. asoca from the wild habitat due to increasing commercial demand of the bark of ‘asoca’ as crude drug material leads it to vulnerable (IUCN, 2019); hence, the characterization of plastome/whole chloroplast (cp) genome of ‘asoca’ and its genetic comparison will facilitate the development of DNA markers for diversity assessment, conservation, and in unraveling function of genes and gene families to produce its enhanced agronomic traits through genetic engineering.

Materials and methods

Leaf sampling and DNA sequencing

The green young leaves material of S. asoca was collected [voucher information: ‘MAA & TKPAN-116′ (BHAG, KSUH)] from the tree growing at conservatory of the botanical garden, Tilka Manjhi Bhagalpur University (TMBU), Bhagalpur, India, without harming the plant, were used for the extraction of DNA using # DNeasy Plant Mini Kit (QIAGEN). The de novo sequencing (as a single end run of 51 bp) was performed at Illumina platform, Illumina Pipeline 1.3.2 (Nie et al., 2012) was used for base calling.

Cp genome assembly and annotation

The raw reads were first filtered using fastqc. The high-quality reads were then assembled using spades (Bankevich et al., 2012), and annotated using the online tool GeSeq (https://chlorobox.mpimp-golm.mpg.de/geseq.html) at Tamarindus indica L. (GenBank NC_026685.1) as reference (Hansen et al., 2007, Tillich et al., 2017). The repeat structure and small inversion (Timme et al., 2007, Yang et al., 2010, Maia et al., 1991, Doorduin et al., 2011, Castro et al., 2013, Beier et al., 2017) in cp genome were analyzed.

Comparison of cp genome and phylogenetic analysis

The cp genome of S. asoca was compared with the five other complete Detarioideae (Fabaceae) cp genomes including Crudia harmsiana Wild., (NC_036743.1), Daniellia pilosa (J. Léonard) Estrella, (NC_036744.1), Guibourtia leonensis J. Leonard, (NC_036742.1) and Tamarindus indica L. (NC_026685.1) by aligning using Kalign (Lassmann and Sonnhammer, 2005) and UPGMA analysis (Sneath and Sokal, 1973) employing MEGA X (Kumar et al., 2018) followed by the verification of the taxon proximity under UPGMA tree with MAUVE alignment. The plant DNA barcoding genes i.e. rbcL and matK of adulterant species (a) Bauhinia variegata L. (GU135196, GU135033), (b) Mesua ferrea L. (KY654490, JN114759), (c) Polyalthia longifolia (Sonn.) Thwaites (JX856748, AY518786), (d) Shorea robusta Gaertn. (KY654498, KY973059) and (e) Trema orientalis (L.) Blume (KY654502, AB924756) were retrieved from the GenBank, and analyzed together with the sequences of the S. asoca (KY678341, KC592386). The sequences were aligned (Thompson et al., 1994), and the molecular phylogenetic analyses by Maximum Evolution method (Rzhetsky and Nei, 1992) rooted using outgroup Sarcandra glabra [Clade: Angiosperms, Order: Chloranthales, Family: Chloranthaceae (KP208901, JN407112) were performed using MEGA X (Kumar et al., 2018).

Results and discussion

The assembled cp genome maps as a conserved circular structure (Fig. 2A), comprising total length 160,003 bp including LSC, SSC, IRa, and IRb, and GC content was 35.26% (NCBI GenBank accession number: MN866115) as similar to those of other angiosperms (Daniell et al., 2016). The cp genome possessed 111 genes (97 CDS, 29 tRNA, 4 rRNA genes) (Fig. 2B). Twelve of the CDS and eight of the tRNAs contain introns; 18 of these contain single intron, and two genes (ycf3 and clpP) possess 2 introns each (Fig. 2A).
Fig. 2

A. The cp geneome map Saraca asoca, B. the genes of different groups are color-coded.

A. The cp geneome map Saraca asoca, B. the genes of different groups are color-coded. The tandem and dispersed repeats were analyzed for S. asoca cp genome. It is evident that the number of tandem and dispersed repeats were more in 15–20 bp and 31–40 bp category, respectively (Fig. 3A). The repeat structures of S. asoca and other five species of Fabaceae were analyzed by REPuter and were compared. The forward and palindrome repeats were common in these species (Fig. 3B-C). A total of 70 different SSR loci repeated more than 1 time (Table 1), contribute to the A–T richness of cp genome. The repeat regions play very significant roles in genome recombination (Yang et al., 2010). The SSRs are highly polymorphic due to higher mutation rate that affects the number of repeat units (Tsai et al., 2008).
Fig. 3

The repeat structure analysis of the cp genome S. asoca. [A. The frequency of repeat by length; B. The repeat type; C. Comparison among six sequenced Fabaceae cp genomes (F: forward, P: palindrome, R: reverse, C: complement orientations)].

Table 1

The SSR loci of S. asoca cp genome.

S.TypeSSRSizeStartsEnd
1P1(T)101029933002
2p2(TA)61239333944
3p2(CT)61294779488
4p2(TA)61297999810
5p1(A)111111,18511,195
6p1(A)101011,51811,527
7p1(T)101014,44414,453
8p1(A)101015,74615,755
9c(T)11seq (T)1410516,11116,215
10c(T)15 seq (A)1010917,23617,344
11p1(T)121218,45418,465
12p1(T)131318,90618,918
13p1(A)111119,42119,431
14p1(A)101048,18748,196
15p1(A)141450,36950,382
16p1(T)141451,45551,468
17p2(TA)81651,74651,761
18p110(A)1053,12953,138
19p1(T)101053,45453,463
20p1(T)141454,01954,032
21c(AT)7 seq (T)1111959,27159,389
22p1(T)101059,79559,804
23c(AT)6 seq (AT)6 seq (AT)716360,24960,411
24p3(TAT)51560,63460,648
25p1(A)101061,43461,443
26c(T)11 g(A)102263,23663,257
27p1(T)121265,35165,362
28p1(T)101065,95865,967
29p4(TTAA)62469,69669,719
30p1(T)121273,24073,251
31p1(T)101074,76174,770
32c(A)10 seq (A)98976,39276,480
33c(A)10 seq (AT)65877,23277,289
34p1(T)111177,94077,950
35p1(A)111179,22479,234
36p1(T)101080,73180,740
37p1(G)101082,97782,986
38p1(A)151584,34884,362
39p1(C)111184,76284,772
40p2(AT)61285,25485,265
41p1(A)101091,24591,254
42p1(T)101091,78191,790
43p1(T)131392,48192,493
44p1(A)141493,45293,465
45p1(A)121293,79993,810
46p2(AT)102094,89494,913
47p2(TA)61296,51596,526
48p2(AT)61296,64896,659
49p1(A)1212101,516101,527
50p1(T)1010103,439103,448
51p2(TA)714103,785103,798
52p1(T)1515105,822105,836
53p1(T)1010106,166106,175
54p2(AT)612106,328106,339
55p1(T)1212108,073108,084
56p1(T)1010109,129109,138
57c(A)11 seq (T)1038109,557109,594
58p1(A)1313112,044112,056
59p2(AT)612112,200112,211
60c(AT)7 t(TA)729112,852112,880
61p1(A)1010114,163114,172
62p1(A)1313115,814115,826
63p1(T)1010116,265116,274
64p1(T)1010117,561117,570
65p1(T)1010120,635120,644
66p1(A)1010121,197121,206
67p1(T)1010126,128126,137
68p1(T)1313130,202130,214
69p1(A)1111131,334131,344
70p1(T)1010133,319133,328
The repeat structure analysis of the cp genome S. asoca. [A. The frequency of repeat by length; B. The repeat type; C. Comparison among six sequenced Fabaceae cp genomes (F: forward, P: palindrome, R: reverse, C: complement orientations)]. The SSR loci of S. asoca cp genome. The comparison of cp genome of S. asoca with the five other complete Detarioideae (Fabaceae) cp genomes e.g. C. harmsiana, D. pilosa, G. leonensis, and T. indica revealed extensive rearrangement in gene organization (Fig. 4). Further, the ME tree from the set of the GenBank accession number [Bauhinia variegata [(Clade: Rosids, Order: Fabales, Family: Fabaceae; GenBank accession number: GU135196, GU135033)], Mesua ferrea [Clade: Rosids, Order: Malpighiales, Family: Calophyllaceae; GenBank accession number: KY654490, JN114759)], Polyalthia longifolia [Clade: Magnoliids, Order: Magnoliales, Family: Annonaceae; GenBank accession number: JX856748, AY518786)], Shorea robusta [Clade: Rosids, Order: Malvales, Family: Dipterocarpaceae; GenBank accession number: KY654498, KY973059] and Trema orientalis [Clade: Rosids, Order: Rosales, Family: Cannabaceae; GenBank accession number: KY654502, AB924756)] of rbcL and matK [- the cp genes used in the plant DNA barcoding (CBOL, 2009)] with the sequence of S. asoca (KY678341, KC592386/MN866115) revealed the optimal tree with the sum of branch length 0.57133802 (Fig. 5), and have potential to be used as molecular typing of S. acoca from its adulterants (Hegde et al., 2018) as NMR spectroscopy (Urumarudappa et al., 2016) and rbcL-ISSR based DNA barcodes (Hegde et al., 2018) are least user-friendly.
Fig. 4

The MAUVE alignment of cp genomes of five different Detarioideae, showing genomic rearrangement.

Fig. 5

The minimum evolution tree based on combined sequence of rbcL and matK gene representative species of Rosids. (R, F, F: Clade: Rosids, Order: Fabales, Family: Fabaceae; R, M, C: Clade: Rosids, Order: Malpighiales, Family: Calophyllaceae; M, M, A: Clade: Magnoliids, Order: Magnoliales, Family: Annonaceae; R, M, C: Clade: Rosids, Order: Malvales, Family: Dipterocarpaceae; R, R, C: Clade: Rosids, Order: Rosales, Family: Cannabaceae).

The MAUVE alignment of cp genomes of five different Detarioideae, showing genomic rearrangement. The minimum evolution tree based on combined sequence of rbcL and matK gene representative species of Rosids. (R, F, F: Clade: Rosids, Order: Fabales, Family: Fabaceae; R, M, C: Clade: Rosids, Order: Malpighiales, Family: Calophyllaceae; M, M, A: Clade: Magnoliids, Order: Magnoliales, Family: Annonaceae; R, M, C: Clade: Rosids, Order: Malvales, Family: Dipterocarpaceae; R, R, C: Clade: Rosids, Order: Rosales, Family: Cannabaceae).

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  2 in total

1.  Comparative plastome data analysis of Dendrosicyos socotranus and Corallocarpus boehmii.

Authors:  Mohammad Ajmal Ali; Khalid Mashay Al-Anazi; Fahad Al-Hemaid; Waquar Akhtar Ansari; Joongku Lee
Journal:  Bioinformation       Date:  2021-07-31

2.  Comparison Analysis Based on Complete Chloroplast Genomes and Insights into Plastid Phylogenomic of Four Iris Species.

Authors:  Jing-Lu Feng; Li-Wei Wu; Qing Wang; Yun-Jia Pan; Bao-Li Li; Yu-Lin Lin; Hui Yao
Journal:  Biomed Res Int       Date:  2022-07-27       Impact factor: 3.246

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.