Literature DB >> 34220230

The cp genome characterization of Adenium obesum: Gene content, repeat organization and phylogeny.

Khalid Mashay Alanazi1, Mohammad Ajmal Ali2, Soo-Yong Kim3, M Oliur Rahman4, Mohammad Abul Farah1, Fahad Alhemaid2, Meena Elangbam5, Arun Bahadur Gurung6, Joongku Lee7.   

Abstract

Adenium obesum (Forssk.) Roem. & Schult. belonging to the family Apocynaceae, is remarkable for its horticultural and ornamental values, poisonous nature, and medicinal uses. In order to have understanding of cp genome characterization of highly valued medicinal plant, and the evolutionary and systematic relationships, the complete plastome / chloroplast (cp) genome of A. obesum was sequenced. The assembled cp genome of A. obesum was found to be 154,437 bp, with an overall GC content of 38.1%. A total of 127 unique coding genes were annotated including 96 protein-coding genes, 28 tRNA genes, and 3 rRNA genes. The repeat structures were found to comprise of only mononucleotide repeats. The SSR loci are compososed of only A/T bases. The phylogenetic analysis of cp genomes revealed its proximity with Nerium oleander.
© 2021 The Author(s).

Entities:  

Keywords:  Adenium obesum; Apocynaceae; Chloroplast genome; Medicinal plant; Plastome; Poisonous plant

Year:  2021        PMID: 34220230      PMCID: PMC8241589          DOI: 10.1016/j.sjbs.2021.03.048

Source DB:  PubMed          Journal:  Saudi J Biol Sci        ISSN: 2213-7106            Impact factor:   4.219


Introduction

Adenium obesum (Forssk.) Roem. & Schult. (family Apocynaceae), the ‘Desert Rose’ is a poisonous, medicinal plant, distributed from Africa to Arabia, and is used traditoinally in the treatment of various ailments e.g. skin diseases, wounds, muscle pain, joint pain, venereal diseases, tooth decay, septic wounds, and nasal infections (Dimmitt and Hanson, 2002, Mouza and Hossain, 2015, Hossain et al., 2017). It is also used as a pesticide (Versiani et al., 2014), arrow poison for hunting in Africa (Oyen, 2008) and fish toxin (Wiseman, 2009). The A. obesum plant extract reported to possess cytotoxic (Almehdar et al., 2012), antimicrobial (Hossain et al., 2017) and anti-influenza (Kiyohara et al., 2012) activities. The phytochemical compounds identified from A. obesum include cardiac glycosides (cardenolides), pregnanes, triterpenes, flavonoids, and acetyldigitoxigenin (Versiani et al., 2014). The molecular docking of acetyldigitoxigenin elucidates the plausible mechanisms underlying the anticancer properties (Gurung et al., 2020). The recent development in plastome or chloroplast (cp) genomics due to massive progress in the next-generation sequencing (NGS) platforms (Eid et al., 2009, Rothberg et al., 2011, Pattnaik et al., 2014, Jain et al., 2016, Shendure et al., 2017) and bioinformatics tools (Mavromatis et al., 2007, Knudsen et al., 2010, Huang et al., 2012, McElroy et al., 2012, Shendure and Aiden, 2012, Yang and Rannala, 2012, Caboche et al., 2014, Shcherbina, 2014, Kwon et al., 2015, Langmead and Nellore, 2018) have greatly impact on biotechnology application (Spök et al., 2008, Zhang et al., 2015, Daniell et al., 2016). We herein for the first time report the cp genome characterization of highly valued medicinal plant A. obesum, and discuss its structure including gene content, repeat organization, and phylogeny.

Materials and methods

DNA sequencing, assembly and annotation

The fresh leaves of A. obesum were collected from the wild condition of desert habitat near to Riyadh, Saudi Arabia. The total genomic DNA was isolated using QIAGEN DNeasy DNA extraction kit. The de novo sequencing base calling was performed using the Illumina Pipeline 1.3.2 (Nie et al., 2012). The raw reads were filtered using FastQC to obtain the high-quality clean data by removing adaptor sequences using trimmomatic and low-quality reads with Q-value ≤ 20. The filtered reads were assembled using Spades (Bankevich et al., 2012), and annotated using GeSeq (https://chlorobox.mpimp-golm.mpg.de/geseq.html) (Tillich et al., 2017, Hansen et al., 2007). Further downstream analysis from the assembled cp genome included the repeat structure (Benson, 1999, Timme et al., 2007) and small inversion (Nagano et al., 1991, Yang et al., 2010, Doorduin et al., 2011, Castro et al., 2013, Beier et al., 2017).

Comparison of cp genome and phylogenetic analysis

The cp genome of A. obesum were plotted using the mVISTA (http://genome.lbl.gov/vista/mvista/submit.shtml) program with a total number of nine complete cp genomes of Apocynaceae [i.e. (1) Asclepias nivea Forssk., (2) Carissa macrocarpa (Eckl.) A. DC., (3) Catharanthus roseus (L.) G. Don, (4) Cynanchum auriculatum Buch.-Ham. ex Wight, (5) Echites umbellatus Jacq., (6) Nerium oleander L., (7) Oncinotis tenuiloba Stapf, (8) Pentalinon luteum (L.) B.F. Hansen & Wunderlin, and (9) Rhazya stricta Decne.] (Table 1).
Table 1

The ingroup and outgroup taxon with their classification and GenBank accession number included in the phylogenetic analyses. The GenBank accession number marked with * were included in the mVISTA alignment.

Sl. No.TaxonOrderFamilySubfamilyTribeSubtribeGenBank
Ingroup
1.Adenium obesum (Forssk.) Roem. & Schult.GentianalesApocynaceaeApocynoideaeNerieaeNeriinaeMN765097*
2.Asclepias nivea Forssk.GentianalesApocynaceaeAsclepiadoideaeAsclepiadeaeAsclepiadinaeNC_022431.1*
3.Cynanchum auriculatum Buch.-Ham. ex WightGentianalesApocynaceaeAsclepiadoideaeAsclepiadeaeCynanchinaeNC_029460.1*
4.Carissa macrocarpa (Eckl.) A. DC.GentianalesApocynaceaeRauvolfioideaeCarisseaeNC_033354.1*
5.Catharanthus roseus (L.) G. DonGentianalesApocynaceaeRauvolfioideaeVinceaeCatharanthinaeNC_021423.1*
6.Rhazya stricta Decne.GentianalesApocynaceaeRauvolfioideaeAmsonieaeNC_024292.1*
7.Echites umbellatus Jacq.GentianalesApocynaceaeApocynoideaeEchiteaeEchitinaeNC_025655.1*
8.Pentalinon luteum (L.) B.F. Hansen & WunderlinGentianalesApocynaceaeApocynoideaeEchiteaePentalinoninaeNC_025658.1*
9.Nerium oleander L.GentianalesApocynaceaeApocynoideaeNerieaeNeriinaeNC_025656.1*
10.Oncinotis tenuiloba StapfGentianalesApocynaceaeApocynoideaeBaisseeaeNC_025657.1*
11.Anethum graveolens L.ApialesApiaceaeNC_029470.1
12.Ilex delavayi Franch.AquifolialesAquifoliaceaeKX426470.1
13.Helianthus annuus L.AsteralesApocynaceaeNC_007977.1
14.Viburnum betulifolium BatalinDipsacalesAdoxaceaeNC_037951.1
15.Eucommia ulmoides Oliv.GarryalesEucommiaceaeNC_037948.1
16.Gentiana tibetica King ex Hook. f.GentianalesGentianaceaeNC_025319.1
17.Iodes cirrhosa Turcz.IcacinalesIcacinaceaeNC_036254.1
18.Premna microphylla Turcz.LamialesLamiaceaeNC_026291.1
19.Iochroma australe Griseb.SolanalesSolanaceaeNC_029833.1
Outgroup
20.Cornus controversa Hemsl.CornalesCornaceaeMG525004.1
The ingroup and outgroup taxon with their classification and GenBank accession number included in the phylogenetic analyses. The GenBank accession number marked with * were included in the mVISTA alignment. The cp sequences of 48 genes [e.g. ATP synthase genes (atpA, atpB, atpE, atpF, atpH, and atpI), c-type cytochrome synthesis gene (ccsA), envelope membrane protein gene (cemA), Maturase gene (matK), cytochrome b6/f genes (petA, petB, petD, petG, and petN), Photosystem I genes (psaA, psaB, psaC, and psaJ), Photosystem II genes (psbA, psbC, psbE, psbH, psbI, psbJ, psbK, psbN, and psbT), Rubisco gene (rbcL), Large-subunit ribosomal protein genes (rpl14, rpl2, rpl20, rpl32, rpl33, and rpl36), RNA polymerase subunit genes (rpoB, rpoC1, and rpoC2), Small-subunit ribosomal protein genes (rps14, rps15, rps18, rps19, rps2, rps3, rps4, rps7, and rps8), Genes of unknown function (ycf3, and ycf4)] were retrieved from 19 ingroup taxon comprising 10 species of the family Apocynaceae, the representative of the family Apiaceae, Aquifoliaceae, Apocynaceae, Adoxaceae, Eucommiaceae, Gentianaceae, Icacinaceae, Lamiaceae, Solanaceae, and the outgroup from the family Cornaceae (Table 1), and aligned using Clustal X (Thompson et al., 1994), and the molecular phylogenetic analysis was performed by Maximum Evolution method (Rzhetsky and Nei, 1992) using in MEGA X (Kumar et al., 2018).

Results and discussion

The present study reports assembly of the complete cp genome map as a conserved circular structure comprising a total length of 154,437 bp (including LSC, SSC, IRa, and IRb), with an overall GC content of 38.1% (Fig. 1). The results revealed the gene contents, orientation, and the conservation as well as polymorphisms were found in the chloroplast genome as similar to those of other cp genome of angiosperms (Daniell et al., 2016). A total numner of 127 genes were annotated including 96 protein-coding genes, 28 tRNA genes, and 3 rRNA genes (NCBI GenBank accession number: MN765097).
Fig. 1

The gene map and genes contained in the cp genome of A. obesum.

The gene map and genes contained in the cp genome of A. obesum. The sequence identity of A. obesum plotted with the nine different complete cp genomes from the family Apocynaceae e.g. A. nivea, C. macrocarpa, C. roseus, C. auriculatum, E. umbellatus, N. oleander, O. tenuiloba, P. luteum and R. stricta using the mVISTA revealed high similarities amongst them with few regions where the identities was below 90% (Fig. 2).
Fig. 2

The percent identity plot for comparison of cp genome of A. obesum with the other Apocynaceae genomes. Lane from up to down: A. nivea, C. macrocarpa, C. roseus, C. auriculatum, E. umbellatus, N. oleander, O. tenuiloba, P. luteum, and R. stricta.

The percent identity plot for comparison of cp genome of A. obesum with the other Apocynaceae genomes. Lane from up to down: A. nivea, C. macrocarpa, C. roseus, C. auriculatum, E. umbellatus, N. oleander, O. tenuiloba, P. luteum, and R. stricta. Moreover, the present study depicted the distribution and location of repeated structures and microsatellites in the cp genome. The microsatellites or simple sequence repeats (SSRs) are tandem repeats which ranges from 1 to 6 bp and are present commonly in cp genomes (Meng et al., 2018). SSRs have been served as an important marker for molecular characterization of plant species. A total of 40 SSRs were predicted in A. obesum (Table 2) which were composed of a length of at least 10 bp, all of which were found to be homopolymers containing multiple A or T nucleotides at each locus. These reveal that SSR loci are rich in A–T content in the A. obesum cp genome which supports previous chloroplast SSRs reports (Li et al., 2017). Among these SSRs, four SSRs were situated in coding regions and 31 were located in the intergenic regions (Table 2). A total number of 19 genes including 11 protein-coding genes and 8 tRNA genes contained one or two introns (Table 3). Furthermore, five SSRs were found in intronic regions. Thus, most of the repeats were situated in the intergenic region. Tandem and dispersed repeats were analyzed for A. obesum cp genomes and a total of 25 tandem and 19 dispersed repeats were observed (Fig. 3).
Table 2

The SSR loci in the cp genome of Adenium obesum.

StartEndRepeatRepeat length of consensusLocusRegion
21092189(A)10(A)1281ycf1CDS
29142925(A)1212ycf1CDS
95579566(T)1010ndhI-ndhGintergenic
13,83113,840(A)1010ccsA-trnL-UAGintergenic
15,37815,388(T)1111rpl32-ndhFintergenic
15,61415,624(A)1111rpl32-ndhFintergenic
23,93323,950(T)1818rrn23-trnA-UGCintergenic
43,87843,887(A)1010rpl2-rps19-fragmentintergenic
49,25449,266(T)1313rps16intron
52,13252,141(A)1010psbI-trnS-GCUintergenic
53,34753,356(T)1010trnG-GCCintron
53,60753,620(T)1414trnG-GCC-trnR-UCUintergenic
53,76353,775(T)1313trnR-UCU-atpAintergenic
55,42655,435(A)1010atpA-atpFintergenic
56,12956,139(T)1111atpFintron
57,86557,874(T)1010atpH-atpIintergenic
58,08258,093(T)1212atpH-atpIintergenic
60,13860,147(A)1010rps2-rpoC2intergenic
62,36762,377(T)1111rpoC2CDS
72,57672,585(T)1010trnC-GCA-petNintergenic
79,73679,747(T)1212psbC-trnS-UGAintergenic
88,26188,272(T)1212ycf3intron
95,45695,465(T)1010ndhC-trnV-UACintergenic
96,13496,143(A)1010ndhC-trnV-UACintergenic
97,20697,257(T)12, (T)1352trnM-CAU-atpEintergenic
104,329104,341(T)1313psaI-ycf4intergenic
105,304105,315(A)1212ycf4-cemAintergenic
105,629105,642(T)1414ycf4-cemAintergenic
109,707109,720(T)1414psbE-petLintergenic
109,940109,949(A)1010psbE-petLintergenic
111,353111,366(T)1414trnP-UGG-psaJintergenic
113,000113,009(A)1010rps18-rpl20intergenic
115,212115,222(T)1111clpPintron
120,988120,997(A)1010petB-petDintergenic
122,446122,455(A)1010petD-rpoAintergenic
125,125125,136(T)1212rps8-rpl14intergenic
125,645125,710(A)10, (T)1066rpl14-rpl16intergenic
128,414128,424(T)1111rpl22CDS
128,795128,804(T)1010rps19-rpl2intergenic
148,732148,749(A)1818trnA-UGC-rrn23intergenic
Table 3

The intron containing genes in the cp genome of Adenium obesum.

GeneLocationExon I bpIntron I bpExon II bpIntron II bpExon III bp
trnA-UGCIR3581838
trnI-GAUIR3594342
rps12*LSC-IR23453625114
ndhBIR777684756
rpl2IR391649434
trnK-UUULSC35247637
rps16LSC22684341
trnG-GCCLSC2369137
atpFLSC411706144
rpoC1LSC1613749451
ycf3LSC155773228738124
trnL-UAALSC3749150
trnV-UACLSC3758638
clpPLSC22965729174671
rpl2IR434649391
ndhBIR756684777
rps12IR25536234
trnI-GAUIR4294335
trnA-UGCIR3881835

*rps12 is trans-spliced gene with 5′ end exon located in the LSC region and the duplicated 3′ end exon located in IR regions.

Fig. 3

(A-C). The repeat structure analysis in the cp genome of A. obesum. The cutoff value for tandem repeat is 15 bp and 30 bp for dispersed repeat. A. Frequency of repeats by length; B. Repeat type; C. The location distribution of all the repeats.

The SSR loci in the cp genome of Adenium obesum. The intron containing genes in the cp genome of Adenium obesum. *rps12 is trans-spliced gene with 5′ end exon located in the LSC region and the duplicated 3′ end exon located in IR regions. (A-C). The repeat structure analysis in the cp genome of A. obesum. The cutoff value for tandem repeat is 15 bp and 30 bp for dispersed repeat. A. Frequency of repeats by length; B. Repeat type; C. The location distribution of all the repeats. The phylogenetic relationships of a total number of 48 cp genes from the 19 cp genomes including the family Apocynaceae and the representative members of the family Apiaceae (Apiales), Aquifoliaceae (Aquifoliales), Adoxaceae (Dipsacales), Eucommiaceae (Garryales), Gentianaceae (Gentianales), Icacinaceae (Icacinales), Lamiaceae (Lamiales), Solanaceae (Solanales), and the outgroup at the family Cornaceae (Cornales) revealed the proximity of A. obesum (Subfamily Apocynoideae, Tribe Nerieae, Subtribe Neriinae) with Nerium oleander (Subfamily Apocynoideae, Tribe Nerieae, Subtribe Neriinae) (Fig. 4). The family Apocynaceae is one of the 10 largest angiosperm families with c. 4,500 species under c. 370 genera globally with the greatest diversity in the tropics and subtropics (Stevens, 2001, Endress et al., 2014, APG, 2016). Apart from the large number of molecular phylogenetic studies on the family Apocynaceae (e.g. Liede and Täuber, 2000, Liede and Täuber, 2002, Liede, 2001, Liede and Meve, 2001, Liede and Meve, 2002, Meve and Liede, 2001, Meve and Liede, 2002, Meve and Liede, 2004a, Meve and Liede, 2004b, Potgieter and Albert, 2001, Liede and Kunze, 2002, Liede et al., 2002a, Liede et al., 2002b, Verhoeven et al., 2003, Rapini et al., 2003, Rapini et al., 2004, Rapini et al., 2006, Rapini et al., 2007, Simões et al., 2004, Simões et al., 2006, Simões et al., 2007, Liede-Schumann et al., 2005, Venter et al., 2006, Endress et al., 2007, Goyder et al., 2007, Ionta and Judd, 2007, Lahaye et al., 2007, Livshultz et al., 2007, Meve and Liede-Schumann, 2007, Wanntorp and Forster, 2007), the family has also been intensely studied for their pollination biology, plant–herbivore interactions, and secondary chemistry (Wyatt and Broyles, 1994, Góngora Castillo et al., 2012, Courdavault et al., 2014, Agrawal et al., 2015). The phylogenetic nesting of the family Asclepiadaceae in Apocynaceae s.s. has been demonstrated repeatedly (Wanntorp, 1988, Judd et al., 1994, Sennblad and Bremer, 1996, Potgieter and Albert, 2001). The most recent classification of Apocynaceae (Endress et al., 2014) segregated the family into five subfamilies, two paraphyletic which correspond to the former Apocynaceae s.s. (Rauvolfioideae and Apocynoideae) and three monophyletic that relates to the former Asclepiadaceae (Periplocoideae, Secamonoideae, and Asclepiadoideae).
Fig. 4

The maximum likelihood tree inferred from the cp genome of A. obesum analyzed together with the members of the family Apocynaceae and Aquifoliaceae, Adoxaceae, Eucommiaceae, Gentianaceae, Icacinaceae, Lamiaceae, and Solanaceae.

The maximum likelihood tree inferred from the cp genome of A. obesum analyzed together with the members of the family Apocynaceae and Aquifoliaceae, Adoxaceae, Eucommiaceae, Gentianaceae, Icacinaceae, Lamiaceae, and Solanaceae.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  39 in total

1.  In vitro cytotoxic screening of selected Saudi medicinal plants.

Authors:  Hussein Almehdar; Hossam M Abdallah; Abdel-Moneim M Osman; Essam A Abdel-Sattar
Journal:  J Nat Med       Date:  2011-09-28       Impact factor: 2.343

2.  Phylogenetic and evolutionary implications of complete chloroplast genome sequences of four early-diverging angiosperms: Buxus (Buxaceae), Chloranthus (Chloranthaceae), Dioscorea (Dioscoreaceae), and Illicium (Schisandraceae).

Authors:  Debra R Hansen; Sayantani G Dastidar; Zhengqiu Cai; Cynthia Penaflor; Jennifer V Kuehl; Jeffrey L Boore; Robert K Jansen
Journal:  Mol Phylogenet Evol       Date:  2007-06-16       Impact factor: 4.286

3.  The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.).

Authors:  Meng Yang; Xiaowei Zhang; Guiming Liu; Yuxin Yin; Kaifu Chen; Quanzheng Yun; Duojun Zhao; Ibrahim S Al-Mssallem; Jun Yu
Journal:  PLoS One       Date:  2010-09-15       Impact factor: 3.240

4.  Generic delimitations in tuberous Periplocoideae (Apocynaceae) from Africa and Madagascar.

Authors:  Ulrich Meve; Sigrid Liede
Journal:  Ann Bot       Date:  2004-02-23       Impact factor: 4.357

5.  SInC: an accurate and fast error-model based simulator for SNPs, Indels and CNVs coupled with a read generator for short-read sequence data.

Authors:  Swetansu Pattnaik; Saurabh Gupta; Arjun A Rao; Binay Panda
Journal:  BMC Bioinformatics       Date:  2014-02-05       Impact factor: 3.169

6.  A computer simulator for assessing different challenges and strategies of de novo sequence assembly.

Authors:  Bjarne Knudsen; Roald Forsberg; Michael M Miyamoto
Journal:  Genes (Basel)       Date:  2010-09-13       Impact factor: 4.096

Review 7.  Chloroplast genomes: diversity, evolution, and applications in genetic engineering.

Authors:  Henry Daniell; Choun-Sea Lin; Ming Yu; Wan-Jung Chang
Journal:  Genome Biol       Date:  2016-06-23       Impact factor: 13.583

8.  Comparative Analysis of the Complete Chloroplast Genomes of Four Aconitum Medicinal Species.

Authors:  Jing Meng; Xuepei Li; Hongtao Li; Junbo Yang; Hong Wang; Jun He
Journal:  Molecules       Date:  2018-04-26       Impact factor: 4.411

9.  Molecular docking elucidates the plausible mechanisms underlying the anticancer properties of acetyldigitoxigenin from Adenium obesum.

Authors:  Arun Bahadur Gurung; Mohammad Ajmal Ali; Joongku Lee; Fahad Al-Hemaid; Mohammad Abul Farah; Khalid Mashay Al-Anazi
Journal:  Saudi J Biol Sci       Date:  2020-04-20       Impact factor: 4.219

10.  Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data.

Authors:  Ségolène Caboche; Christophe Audebert; Yves Lemoine; David Hot
Journal:  BMC Genomics       Date:  2014-04-05       Impact factor: 3.969

View more
  2 in total

1.  Comparison Analysis Based on Complete Chloroplast Genomes and Insights into Plastid Phylogenomic of Four Iris Species.

Authors:  Jing-Lu Feng; Li-Wei Wu; Qing Wang; Yun-Jia Pan; Bao-Li Li; Yu-Lin Lin; Hui Yao
Journal:  Biomed Res Int       Date:  2022-07-27       Impact factor: 3.246

2.  Complete Chloroplast Genome Sequence of Fagus longipetiolata Seemen (Fagaceae): Genome Structure, Adaptive Evolution, and Phylogenetic Relationships.

Authors:  Daqu Liang; Haoyun Wang; Jun Zhang; Yuanxiang Zhao; Feng Wu
Journal:  Life (Basel)       Date:  2022-01-09
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.