Literature DB >> 32462024

Complete Chloroplast Genomes of 14 Mangroves: Phylogenetic and Comparative Genomic Analyses.

Chengcheng Shi1,2, Kai Han2, Liangwei Li2, Inge Seim3,4, Simon Ming-Yuen Lee5, Xun Xu2,6,7, Huanming Yang1,6, Guangyi Fan2,5,6, Xin Liu2,6,7,8.   

Abstract

Mangroves are a group of plant species that occupy the coastal intertidal zone and are major components of this ecologically important ecosystem. Mangroves belong to about twenty diverse families. Here, we sequenced and assembled chloroplast genomes of 14 mangrove species from eight families spanning five rosid orders and one asterid order: Fabales (Pongamia pinnata), Lamiales (Avicennia marina), Malpighiales (Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, and Ceriops tagal), Malvales (Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea), Myrtales (Laguncularia racemosa, Sonneratia ovata, and Pemphis acidula), and Sapindales (Xylocarpus moluccensis). These chloroplast genomes range from 149 kb to 168 kb in length. A conserved structure of two inverted repeats (IRa and IRb, ~25.8 kb), one large single-copy region (LSC, ~89.0 kb), and one short single-copy region (SSC, ~18.9 kb) as well as ~130 genes (85 protein-coding, 37 tRNAs, and 8 rRNAs) was observed. We found the lowest divergence in the IR regions among the four regions. We also identified simple sequence repeats (SSRs), which were found to be variable in numbers. Most chloroplast genes are highly conserved, with only four genes under positive selection or relaxed pressure. Combined with publicly available chloroplast genomes, we carried out phylogenetic analysis and confirmed the previously reported phylogeny within rosids, including the positioning of obscure families in Malpighiales. Our study reports 14 mangrove chloroplast genomes and illustrates their genome features and evolution.
Copyright © 2020 Chengcheng Shi et al.

Entities:  

Mesh:

Year:  2020        PMID: 32462024      PMCID: PMC7225854          DOI: 10.1155/2020/8731857

Source DB:  PubMed          Journal:  Biomed Res Int            Impact factor:   3.411


1. Introduction

Mangroves grow on the intertidal zone of the ocean, the transition zone connecting the land and ocean. Mangrove ecosystems provide essential habitats for marine creatures and benthic organisms and play important roles in regulating energy cycle and maintaining biodiversity [1, 2]. According to their habitats, root morphology, and salt metabolism patterns, mangroves are generally categorized into true mangroves and mangrove associates (or semi-mangroves) [3]. The true mangroves exclusively live in mangrove ecosystems and usually have distinct marine environment adaptations, including the ability to grow in seawater, complex root structures (allowing enhanced nutrient absorption and respiratory metabolism), and viviparous reproduction (seeds germinating on trees) [4]. Semi-mangroves are amphibious, and many can inhabit both terrestrial and aquatic environments (for instance, Pongamia pinnata (L.) Pierre). In mangrove ecosystems, they may grow at the edge of the true mangroves and are often dominant species on degraded beaches. There are more than 80 mangrove species, covering approximately twenty families [5, 6]. Due to their ecological importance, wide distribution, and unique biological features for adaptation, the genome features and genome evolution of these species would be of considerable interest yet remain largely unexplored. As an essential organelle of plants, the chloroplast has an independent genome with stable sequence structure and a relatively conserved number of genes associated with energy production and metabolism. Chloroplast genes such as rbcL and psbA were once evidenced to be resultful in inferring the evolutionary origins and phylogenetic relationship of mangroves species from different clades or geographical regions [7-9]. DNA barcodes of rbcL, matK, and trnH-psbA genes have also been used to identify unknown mangrove species [6]. However, whole chloroplast genomes of mangrove species were limited until now [10]. Detailed whole chloroplast genome comparison and phylogenetic analysis has to date been lacking. In order to acquire more mangrove genetic resources and determine the evolutionary location of mangroves in rosids, we sequenced and assembled the complete chloroplast genomes of 14 mangrove species, including Pongamia pinnata (L.) Pierre, Avicennia marina (Forssk.) Vierh., Excoecaria agallocha L., Bruguiera sexangula (Lour.) Poir., Kandelia obovata Sheue, Liu & Yong, Rhizophora stylosa Griff., Ceriops tagal (Perr.) C.B.Rob., Hibiscus tiliaceus L., Heritiera littoralis Dryand., Thespesia populnea (L.) Sol. ex Correa, Laguncularia racemosa (L.) C.F. Gaertn., Sonneratia ovata Backer, Pemphis acidula Forst., and Xylocarpus moluccensis (Lamk.) Roem.. They represent mangroves of eight families, five rosid orders, and one asterid order (as an outgroup for the phylogenetic analysis). We examined their genome structures and gene contents. Comparative genomics and molecular evolution analyses were performed to illustrate mangrove chloroplast genome features further and reveal relationships among mangrove species.

2. Materials and Methods

2.1. Sequencing, Chloroplast Genome Assembly, and Annotation

Fresh leaves of mangroves were provided by collaborators in Guangzhou, China. DNA were extracted according to a CTAB method and then sequenced on a BGISEQ-500 platform. After sequencing, we randomly extracted five million pair-end reads. We used the MITObim v1.9 [11] for the initial assembly, following a closest reference-based strategy. The size of each chloroplast genome was estimated by SPAdes v3.13.0 [12]. With the initial assembly and the estimated genome size, we applied NOVOPlasty v2.7.2 [13] to assemble the complete chloroplast genome. Finally, we carried out manual curation to obtain circular sequences. Chloroplast genes (including protein coding genes, rRNA genes, and tRNA genes) were predicted and annotated by GeSeq [14] with the MPI-MP chloroplast reference option. The identity cutoffs for protein and rRNA searching were set as 60 and 85, respectively. ARAGORN v1.2.38 [15] was used to annotate tRNAs. Genes were visualized using OGDRAW [16]. IR (inverted repeat) boundaries were identified by chloroplast genome self-alignment using BLAST v2.2.6 [17] (-p blastn -m 8 -F F -e 1). Regions aligned reversely and of the same length were manually curated as inverted repeat regions. Simple sequence repeats (SSRs) with 1-6 bp repeat units were detected using MISA v2.0 [18]. The minimum repeat times were set to be 10 for mononucleotides, 5 for dinucleotide, 4 for trinucleotide, 3 for tetranucleotide, 3 for pentanucleotide, and 3 for hexanucleotide.

2.2. Phylogenetic Analysis

A total of 44 conserved genes (atpA, atpB, atpE, atpH, atpI, ccsA, cemA, matK, ndhA, ndhC, ndhG, ndhI, ndhJ, petA, petN, psaA, psaB, psaC, psaJ, psbA, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbT, rbcL, rpoC1, rpoC2, rpl14, rpl2, rpoA, rpoB, rps11, rps14, rps15, rps19, rps2, rps3, rps4, rps8, and ycf3) found in all 71 plant chloroplast genomes were used to construct robust phylogenetic trees (species listed in Table S2). The coding sequences were aligned by MAFFT v7.407 [19] with the “--auto --adjustdirection” setting. Based on the global alignments, the phylogenetic trees of the 71 representative species were constructed using several methods. Both partitioned and nonpartitioned strategies were implemented with different phylogenetic inference tools based on the concatenated aligned sequences from the 44 conserved genes, including (1) a BI tree constructed using MrBayes v3.2.7 [20] with a single priori GTR+GAMMA model, (2) a ML tree constructed using RAxML v8.2.12 [21] with a GTRGAMMA model, (3) a BI tree constructed using MrBayes with the best partition schemed models estimated by PartitionFinder v2.1.1 [22], and (4) a ML tree constructed using RAxML with the best partition schemed models estimated by PartitionFinder, as well as two trees using methods in (1) and (2) with four commonly used genes (ndhF, matK, rbcL, and atpB). For the Bayesian inference, MCMC analysis was run for 1,000,000 generations with four chains and sampling every 1,000 generations. The first 25% trees were discarded, and the final consensus tree was summarized using the remaining trees. For ML trees, the bootstrap number was set to 1,000. The trees were assessed using CONSEL v1.20 [23].

2.3. Ka/Ks Calculation

The nonsynonymous (Ka) and synonymous (Ks) substitution ratios (Ka/Ks) of genes in 14 mangroves, as well as in species from Lamiales, Fabales, Malpighiales, Malvales, Myrtales, Sapindales, Oxalidales, Celastrales, Fagales, Cucurbitales, Rosales, Brassicales, Huerteales, Geraniales, and Saxifragales (Table S2), were calculated. For all the species in an order, we selected one species outside the order to be used for comparison and Ka/Ks calculation. As these orders cover Asterid I, Rosid I, and Rosid II of core eudicots, we further chose one species from orders Asterales (Helianthus divaricatus: NC_023109.1), Zygophyllales (Larrea tridentata: NC_028023.1), Geraniales (Hypseocharis bilobata: NC_023260.1), and Vitales (Vitis rotundifolia: NC_023790.1) as outgroups of Asterid I, Rosid I, Rosid II, and the remaining rosid species. The two Lamiales species from asterids were compared to Helianthus divaricatus (NC_023109.1). Pairwise alignments were processed using MAFFT v7.407 [19], and Ka/Ks values were calculated using the KaKs Calculator [24]. Genes with low Ks values (cutoff 0.1, determined by considering the Ks distribution) were excluded as genes with unreliable omega values.

2.4. Synteny and Divergence Analyses

Genomic comparison and similarity calculations were performed using mVISTA [25]. The most closely related species used for synteny and divergence comparison for 14 mangroves were Euphorbia tirucalli (NC_042193.1), Erythroxylum novogranatense (NC_030601.1), Wisteria floribunda (NC_027677.1), Gossypium lobatum (NC_039569.1), Hibiscus rosa-sinensis (NC_042239.1), Heritiera angustata (NC_037784.1), Xylocarpus granatum (NC_039925.1), Trapa natans (NC_042895.1), Punica granatum (NC_035240.1), Lumnitzera littorea (NC_039752.1), and Aphelandra knappiae (NC_041424.1). For comparison within orders, the whole chloroplast genome sequences of Lamiales (Sesamum indicum: NC_016433.2, Lindenbergia philippensis: NC_022859.1, Ajuga reptans: NC_023102.1, Hesperelaea palmeri: NC_025787.1, Scrophularia takesimensis: NC_026202.1, Tanaecium tetragonolobum: NC_027955.1, Erythranthe lutea: NC_030212.1, Paulownia coreana: NC_031435.1, Haberlea rhodopensis: NC_031852.1, Aloysia citrodora: NC_034695.1, Echinacanthus lofouensis: NC_035876.1, and one mangrove Avicennia marina), Malpighiales (Byrsonima coccolobifolia: NC_037191.1, Erythroxylum novogranatense: NC_030601.1, Garcinia mangostana: NC_036341.1, Hirtella racemosa: NC_024060.1, Ricinus communis: NC_016736.1, Salix interior: NC_024681.1, Viola seoulensis: NC_026986.1, and mangroves Bruguiera sexangula, Ceriops tagal, Excoecaria agallocha, Kandelia obovata, and Rhizophora stylosa), Myrtales (Allomaieta villosa: NC_031875.1, Eucalyptus obliqua: NC_022378.1, Lagerstroemia fauriei: NC_029808.1, and mangroves Laguncularia racemosa, Pemphis acidula, and Sonneratia ovata), Sapindales (Azadirachta indica: NC_023792.1, Boswellia sacra: NC_029420.1, Citrus aurantiifolia: NC_024929.1, Leitneria floridana: NC_030482.1, Sapindus mukorossi: NC_025554.1, Spondias bahiensis: NC_030526.1, and mangrove Xylocarpus moluccensis), and Malvales species (Gossypium arboreum: NC_016712.1, Daphne kiusiana: NC_035896.1, and mangroves Hibiscus tiliaceus and Thespesia populnea) were aligned with MAFFT (v7.407) [19], and single nucleotide polymorphisms (SNPs) and insertion and deletions (InDels) were identified and counted in 200 bp windows with an in-house python script.

3. Results and Discussions

3.1. Chloroplast Genome Features

Using a reference genome-based strategy (see Materials and Methods), a total of 483 Mb chloroplast data were generated for 14 mangrove species in six orders: Fabales (Pongamia pinnata), Lamiales (Avicennia marina), Malpighiales (Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, and Ceriops tagal), Malvales (Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea), Myrtales (Laguncularia racemosa, Sonneratia ovata, and Pemphis acidula), and Sapindales (Xylocarpus moluccensis). The coverage of chloroplasts ranges from 28X to 526X (Table 1), which might be related with different chloroplast DNA content in the total DNA. The chloroplast genomes were assembled into single circular sequences, ranging from 149 kb (Pongamia pinnata) to 168 kb (Kandelia obovata) (Table 1 and Figure S1). We observed typical quadripartite structures in these chloroplast genomes, with two inverted repeat regions (IRa and IRb), a short single-copy (SSC) region, and a long single-copy (LSC) region. The lengths of these four regions are similar among species from the same order but slightly different between orders. The average lengths of IRs of mangrove species in Fabales, Lamiales, Malpighiales, Malvales, Myrtales, and Sapindales are approximately 23.6 kb, 25.6 kb, 26.3 kb, 26.0 kb, 25.3 kb, and 27.0 kb, respectively. The sizes of the SSC range from 17.9 kb to 20.0 kb and the LSC range from 83 kb to 91 kb (Table 1). We found a GC content between 35% and 39% in the 14 chloroplast genomes. The GC content of IR regions (~43%) is higher than those of the SSC (30%) and LSC (34%) regions (Table S1).
Table 1

Chloroplast genome features of the 14 mangroves species.

SpeciesFamilyExtracted chloroplast data (Mb)Mean coverage (×)Total length (bp)IR length (bp)SSC length (bp)LSC length (bp)
Fabales
Pongamia pinnataFabaceae4.3028.72149,63523,65318,53483,795
Lamiales
Avicennia marinaAcanthaceae32.54213.66152,28825,63817,92483,088
Malpighiales
Excoecaria agallochaEuphorbiaceae11.8273.10161,66726,52519,33689,282
Bruguiera sexangulaRhizophoraceae32.86202.52162,28226,40318,14491,332
Kandelia obovataRhizophoraceae88.42526.82168,00826,32019,95595,413
Rhizophora stylosaRhizophoraceae71.24436.76163,10126,32519,24391,208
Ceriops tagalRhizophoraceae42.89260.80164,47626,31319,15392,697
Malvales
Hibiscus tiliaceusMalvaceae18.75116.25161,31826,15919,71789,283
Heritiera littoralisMalvaceae16.40102.87159,40126,26119,00287,877
Thespesia populneaMalvaceae6.4240.04160,45125,58220,30688,981
Myrtales
Pemphis acidulaLythraceae35.84223.93160,05125,69518,88689,775
Sonneratia ovataLythraceae68.08444.79153,05723,90618,00787,238
Laguncularia racemosaCombretaceae27.43170.77160,67226,35318,88689,071
Sapindales
Xylocarpus moluccensisMeliaceae25.98163.08159,31727,00017,99887,319
The number of chloroplast genes is usually conserved [26], with subtle differences between different species [27, 28]. The mangrove chloroplasts contain ~85 protein-coding genes, ~37 tRNA genes, and eight rRNA genes (Table 2). The gene components of photosystem I (five genes), cytochrome b/f complex (six genes), ATP synthase (six genes), NADH dehydrogenase (12 genes), Rubisco large subunit (rbcL), RNA polymerase (four genes), assembly/stability of photosystem I (ycf3 and ycf4), RNA processing (matK), chloroplast envelope membrane protein (cemA), cytochrome c synthesis (ccsA), ATP-dependent protease (clpP), fatty acid biosynthetic (accD), and proteasome subunit beta type-1 (pbf1) are the same in all the mangrove chloroplasts. We only found infA, a chloroplast genome translation initiation factor gene, in Avicennia marina and Heritiera littoralis (Table 2). This agrees with the fact that infA is commonly lost in angiosperms, especially in rosid species [29].
Table 2

Gene content of the 14 mangrove chloroplast genomes.

Pongamia pinnata Avicennia marina Excoecaria agallocha Bruguiera sexangula Kandelia obovata Rhizophora stylosa Hibiscus tiliaceus Heritiera littoralis Thespesia populnea Ceriops tagal Laguncularia racemosa Sonneratia ovata Pemphis acidula Xylocarpus moluccensis
Total gene number127133132128129129130129129130130130131131
Transfer RNAs (tRNA)3537373737373736373737373737
Ribosomal RNAs (rRNAs)88888888888888
Coding genes8488878384848585848585858686
 Photosystems I55555555555555
 Photosystems II1514141414141414141414141414
 Cytochrome b/f complex66666666666666
 ATP synthase66666666666666
 NADH dehydrogenase1212121212121212121212121212
 Rubisco large subunit11111111111111
 RNA polymerase44444444444444
 Ribosomal proteins (small subunit)1414141313131414141414141515
 Ribosomal proteins (large subunit)1011111011111111111111111111
 Assembly/stability of photosystem I22222222222222
 RNA processing11111111111111
 Chloroplast envelope membrane protein11111111111111
 Cytochrome c synthesis11111111111111
 Proteins of unknown function34444443344444
 ATP-dependent protease11111111111111
 Fatty acid biosynthetic11111111111111
 Translation initiation factor01000001000000
 Putative uncharacterized protein02200000000000
 Proteasome subunit beta type-111111111111111

3.2. Simple Sequence Repeat Content

Simple sequence repeats (SSRs) are tandem repeats (1~6 bp units repeated multiple times) in the genome which have been widely applied as markers for population studies and crop improvements [30-33]. In this study, we detected and compared SSRs in the mangrove and 57 terrestrial plant chloroplast genomes (Table S2). The SSR contents are highly variable among species (Figure S2). Of the 14 mangroves, Kandelia obovate has the highest number of SSRs (194), while Avicennia marina has the fewest (61) (Table 3). Comparing between orders, the Malpighiales (number of SSRs ranges from 133 to 194; Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Ceriops tagal, and Rhizophora stylosa) has more SSRs than species of orders Malvales (number of SSRs ranges from 80 to 110; Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea) and Myrtales (number of SSRs ranges from 88 to 118; Laguncularia racemosa, Sonneratia ovata, and Pemphis acidula) (Table 3). Assessing SSR categories in the 14 mangrove chloroplast genomes, we found that the mononucleotide type accounted for at least half of the total SSRs (in Laguncularia racemosa and Avicennia marina up to 80%). A/T tandem repeats are most frequent, followed by dinucleotide, tetranucleotide, trinucleotide, pentanucleotide, and hexanucleotide repeats. Similar patterns of SSR variability and constitution were also observed in the 57 terrestrial plant chloroplasts (Figure S2). We propose that the SSRs identified here can serve as useful genetic resources for future population and evolution studies.
Table 3

Number of SSRs in the 14 mangrove chloroplast genomes. Mononucleotide, dinucleotide, trinucleotide, tetranucleotide, pentanucleotide, and hexanucleotide repeat units are abbreviated as u1 to u6.

SpeciesTotalu1u1 : A/Tu1 : C/Gu2u3u4u5u6
Pongamia pinnata 13084822316711
Avicennia marina 614945414700
Excoecaria agallocha 133928751971311
Bruguiera sexangula 175106103331141851
Kandelia obovata 1941061060362020102
Rhizophora stylosa 16911511142213892
Ceriops tagal 1427875316202341
Hibiscus tiliaceus 86555321541020
Heritiera littoralis 11076742761362
Thespesia populnea 8047443173751
Sonneratia ovata 112777251551041
Pemphis acidula 886464077910
Laguncularia racemosa 1189896265810
Xylocarpus moluccensis 9872711661130

3.3. Phylogenetic Relationships of Mangroves

Similar to mitochondrial genomes used in vertebrate genetics, chloroplast genomes are widely used to settle phylogenetic and evolutionary disputes [28]. In order to reveal the phylogenetic relationships of mangroves, we constructed phylogenetic trees from chloroplast genomes from the 14 mangroves species and 57 terrestrial plant families in 16 rosid orders and one asterid order (a data set of 71 plant species) (Table S2). Based on these complete chloroplast genomes, we identified 44 highly conserved genes in the 71 species and constructed three Bayesian inference (BI) trees and three maximum-likelihood (ML) trees (see Materials and Methods; Figure 1 and Figures S3-S7). Three trees constructed using BI and ML strategies exhibited the same topology (Figure 1, Figure S3, Figure S5, and Table S3). Thus, we have produced a well-supported phylogenetic tree of mangrove and terrestrial plants.
Figure 1

The Bayesian-inferred phylogenetic tree based on all chloroplast genes of 14 mangroves and 57 land plant species. Mangroves are indicated in red, and the background colors highlight the orders of mangroves and their sister species. Posterior probabilities are indicated near the branches.

Based on our phylogenetic tree, we found species from the same order to be in one group, and Rosid I (Malpighiales, Oxalidales, Celastrales, Fagales, Cucurbitales, Fabales, Rosales, and Zygophyllales), Rosid II (Malvales, Brassicales, Huerteales, Sapindales, Myrtales, and Geraniales), and a clade of rosids (Saxifragales and Vitales) were classified. For mangrove species, Avicennia marina is close to the other asterid terrestrial plant Echinacanthus lofouensis. Using these two species as outgroups, we obtained a clear phylogenetic relationship of the rest 13 rosid mangrove species. Myrtales is close to Geraniales in this tree and contains five families, of which Myrtaceae and Melastomataceae are in one clade, while Onagraceae and Lythraceae (including two mangroves Sonneratia ovata and Pemphis acidula) are in another clade. The relationships indicated here are consistent with a reported ML tree of Myrtales species [34]. Furthermore, we found that Combretaceae (Laguncularia racemosa) is a separate node close to Onagraceae and Lythraceae, supported by 1.00 posterior probability. For Sapindales, there is one mangrove Xylocarpus moluccensis, a member of Meliaceae whose position coincides with a previous study [35]. For Malvales, three mangroves (Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea) together with Huerteales and Brassicales species are clustered as neighboring orders. The relationship of genera within the family of Malvaceae is in agreement with trees in Aquilaria sinensis [36] and Heritiera angustata [37] chloroplast studies, and we further confirmed that the semi-mangrove Thespesia populnea is close to Gossypium species. However, for Malpighiales, an order of high morphological and ecological diversities, the phylogenetic relationship of different families especially Linaceae was less resolved. Other than the grouping of families Linaceae and Euphorbiaceae, our phylogenetic tree is consistent with a previous study which employed 82 plastid genes of 58 species from Malpighiales [38]. We found that Euphorbiaceae constitutes a single branch, while Rhizophoraceae (including four mangroves Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, and Ceriops tagal) is a neighboring branch to Erythroxylaceae and Clusiaceae. Also, Linaceae forms a sister lineage with Chrysobalanaceae and Malpighiaceae. Again, these relationships are largely accordant with a study of Linum plastome [39]. Finally, our phylogenetic tree supports a sister relationship between the mangrove Pongamia pinnata with the other orders of Rosales, Cucurbitales, and Fagales.

3.4. Synteny and Divergence of the Chloroplast Genomes

We next analyzed the synteny and divergence between the mangrove and related chloroplast genomes. For mangrove species, the genomes have a conserved gene order similar to sister clades, except Heritiera littoralis which probably had been subjected to segmental rearrangements (Figures 2 and 3). Compared to its closely related species Hibiscus tiliaceus and Thespesia populnea, we found a notable rearrangement at position 8,109 bp to 33,498 bp in the Heritiera littoralis chloroplast genome. This region encodes 16 genes, including trnC-GCA, petN, psbM, trnD-GUC, trnY-GUA, trnE-UUC, rpoB, rpoC1, rpoC2, rps2, atpI, atpH, atpF, atpA, trnR-UCU, and trnS-CGA. Assessing the genetic divergence of mangrove chloroplast genomes by the most closely related species and within the orders (see Materials and Methods), we found the lowest divergence in the genera Heritiera and Xylocarpus (Figure 3). Compared to Heritiera and Xylocarpus, there is a relatively higher divergence between Hibiscus rosa-sinensis and Hibiscus tiliaceus, reflecting a higher level of genetic polymorphism of chloroplast genomes within genus Hibiscus. We also observed a more distinct divergence between species from one family/order in most other comparisons (Euphorbiaceae: Euphorbia tirucalli vs. Excoecaria agallocha, Malpighiales: Erythroxylum novogranatense vs. Kandelia obovata/Ceriops tagal/Rhizophora stylosa/Bruguiera sexangula, Fabaceae: Wisteria floribunda vs. Pongamia pinnata, Malvaceae: Gossypium lobatum vs. Thespesia populnea, Lythraceae: Trapa natans vs. Sonneratia ovata, Lythraceae: Punica granatum vs. Pemphis acidula, Combretaceae: Lumnitzera littorea vs. Laguncularia racemosa, Acanthaceae: Aphelandra knappiae vs. Avicennia marina) (Figure 3). Furthermore, according to the whole genomic comparison among multiple chloroplasts within orders, we found that the similarity of coding regions is generally higher than that of the intergenic regions, and the tRNA and rRNA genes are almost identical in all species (Figure S8). For the four regions, variations in SSC and LSC regions are more frequent comparing to the IR regions, indicating the IRs to be more conserved than the single-copy sequences (Figure 4). This is consistent with reports on other plant chloroplasts [40-42].
Figure 2

Synteny between the chloroplast genomes. The species are shown in the phylogenetic tree with red fonts to indicate the mangrove species, and the synteny is shown by linking the homologous genes. Colors at the bottom indicate genes with different functions.

Figure 3

Pairwise comparisons of mangrove species and their most closely related species. Whole chloroplast genome comparisons are shown on the left, and the corresponding divergences are shown on the right.

Figure 4

Divergence within orders of Malpighiales, Myrtales, Malvales, Lamiales, and Sapindales. The IR regions are shown in grey background, and the average divergences of each region are indicated above the lines in the corresponding regions. Two bars at the bottom show the SNPs and InDels in both the gene regions (red) and the intergenic regions (blue).

3.5. Genes under Selective Pressures

Genes in the chloroplasts are functionally important and might have been under selection during evolution. To analyze the selective pressures in mangrove chloroplast genomes, we calculated the nonsynonymous substitutions and synonymous substitution ratio (Ka/Ks) of coding genes in 14 mangrove species and 57 terrestrial plants (see Materials and Methods). Genes with Ka/Ks values above 1.0 should be under positive selection and might be candidate genes responsible for functional adaptations, while genes with values lower than 1.0 should be under negative (purifying) selection [43]. We found the Ka/Ks values of most gene pairs (within and between orders) to be lower than 1.0 (Figure 5), reflecting selection pressures to maintain the gene functions. For instance, genes involved in photosystems (psaA, psaB, psaC, psbA, psbC, psbD, psbE, psbI, and psbM), the cytochrome b/f complex (petB, petD, and petG), and some ATP synthases (atpB and atpH) in all species have Ka/Ks values close to 0. Genes encoding other ATP synthases (atpA, atpE, and atpF), NADH dehydrogenases (ndhA, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, and ndhK), ribosomal proteins (rps2, rps3, rps4, rps8, rps11, rps14, rps15, and rps19), and RNA polymerases (rpoA, rpoB, rpoC1, and rpoC2) also have low Ka/Ks values (mostly between 0 and 0.5). Similar to other plants [39, 44], chloroplast genes involved in photosynthesis and energy metabolism are conserved (very low Ka/Ks ratios) in mangroves.
Figure 5

Ka/Ks values of protein-coding genes in chloroplasts. Ka/Ks values (horizontal axis) of genes in the mangrove species and their related species in six orders (Lamiales, Fabales, Malpighiales, Malvales, Myrtales, and Sapindales) are indicated. Red dots indicate Ka/Ks values of genes in mangrove species, while blue dots indicate Ka/Ks values in terrestrial species.

We further investigated the genes under positive selection. We found that the Ka/Ks values of four genes (petL, psaI, rpl36, and ycf1) were greater than 1.0 in the mangrove species Excoecaria agallocha, Laguncularia racemosa, Pemphis acidula, and Sonneratia ovata. These genes are from four different functional groups, including subunits of cytochrome (petL), subunits of photosystems (psaI), subunits of ribosomes (rpl36), and unclassified genes (ycf1). The gene petL is a component of the cytochrome b6/f complex required for photosynthesis. The Ka/Ks of petL is ~1.5 in mangrove species Sonneratia ovata and Pemphis acidula, and positive selection on this gene was found only in these two species (Figure 5 and Figures S9 and S10). For psaI, a member of photosystem I (PSI), we observed a Ka/Ks value of 1.22 in Laguncularia racemosa (family Combretaceae, order Myrtales), suggesting potential positive selection on this gene in this mangrove. Among all species, Ka/Ks values of psaI range widely (from 0.2 to 1.5, especially in Malpighiales, Myrtales, and Sapindales), which may reflect the different selection pressures and adaptations in the diverse clades examined. Although a report on tobacco showed a role for psaI in stabilizing PSI during leaf senescence [45], the function of the protein remains unknown in most plant species. For rpl36 (LSU), the Ka/Ks values are greater than 1.0 in Pemphis acidula (1.41) and Sonneratia ovata (1.04). The loss of rpl36 might result in severe morphological aberrations, low translational efficiency, and poor photoautotrophic growth [46]. We speculate that relaxed selection on this gene might be associated with adaptations of plants to highly diverse environments. Gene ycf1 is one of the largest genes in the chloroplast genome, and there are usually two [39, 41] or one functional gene copies (i.e., one copy has become a pseudogene) [36, 44, 47, 48]. In this study, one functional copy and one fragment of the ycf1 gene were annotated in 11 mangroves (Avicennia marina, Xylocarpus moluccensis, Hibiscus tiliaceus, Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, Ceriops tagal, Sonneratia ovata, Pemphis acidula, and Laguncularia racemosa). Furthermore, we found that ycf1 genes in many species from Rosid I (such as Malpighiales including mangrove Excoecaria agallocha) have Ka/Ks values around or higher than 1.0, while ycf1 genes in species from Rosid II plants have lower Ka/Ks values (~0.4 in Malvales, Myrtales, Sapindales, Huerteales, and Brassicales). Together with the fact that the ycf1 gene showed relatively lower sequence similarity among different species (Figure S8), we found different selection pressures on this gene in Rosid I and Rosid II here (Figure 5 and Figures S9 and S10). The observed potential positive selection on petL, psaI, and rpl36 in mangroves (especially in the three mangroves Laguncularia racemosa, Pemphis acidula, and Sonneratia ovata of order Myrtales) and a relaxed selection pressure on ycf1 possibly reflect the functional importance of those genes during adaptation.

4. Conclusions

Our study reports 14 complete mangrove chloroplast genomes, as well as a comprehensive comparative chloroplast genome analysis of mangrove and related plant species. The sequenced mangroves span six orders (five rosids and one asterid), making it the first large-scale study on mangrove chloroplast genomes. We found that mangrove chloroplast genomes are similar in structure and gene content. Notable exceptions include the retainment of the translation initiation factor gene infA in two mangrove species (the asterid Avicennia marina and the rosid Heritiera littoralis) and an inversion in the LSC region of mangrove Heritiera littoralis. We used our new mangrove genomes to create a well-supported phylogeny. Protein-coding genes of mangroves were found to be under pressure to maintain gene function, with only a small number of genes in a handful of species showing evidence of positive or relaxed selection. In conclusion, we report 14 complete chloroplast genomes from diverse mangrove species and analyzed their phylogeny and genome features. This study provides a useful resource for future studies on the evolution of mangroves and environmental adaptation.
  44 in total

1.  Mangroves enhance the biomass of coral reef fish communities in the Caribbean.

Authors:  Peter J Mumby; Alasdair J Edwards; J Ernesto Arias-González; Kenyon C Lindeman; Paul G Blackwell; Angela Gall; Malgosia I Gorczynska; Alastair R Harborne; Claire L Pescod; Henk Renken; Colette C C Wabnitz; Ghislane Llewellyn
Journal:  Nature       Date:  2004-02-05       Impact factor: 49.962

Review 2.  The neutral theory of molecular evolution and the world view of the neutralists.

Authors:  M Kimura
Journal:  Genome       Date:  1989       Impact factor: 2.166

3.  New chloroplast microsatellite markers suitable for assessing genetic diversity of Lolium perenne and other related grass species.

Authors:  Kerstin Diekmann; Trevor R Hodkinson; Susanne Barth
Journal:  Ann Bot       Date:  2012-03-14       Impact factor: 4.357

4.  Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns.

Authors:  Robert K Jansen; Zhengqiu Cai; Linda A Raubeson; Henry Daniell; Claude W Depamphilis; James Leebens-Mack; Kai F Müller; Mary Guisinger-Bellian; Rosemarie C Haberle; Anne K Hansen; Timothy W Chumley; Seung-Bum Lee; Rhiannon Peery; Joel R McNeal; Jennifer V Kuehl; Jeffrey L Boore
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-28       Impact factor: 11.205

5.  The Linum usitatissimum L. plastome reveals atypical structural evolution, new editing sites, and the phylogenetic position of Linaceae within Malpighiales.

Authors:  Amanda de Santana Lopes; Túlio Gomes Pacheco; Karla Gasparini Dos Santos; Leila do Nascimento Vieira; Miguel Pedro Guerra; Rubens Onofre Nodari; Emanuel Maltempi de Souza; Fábio de Oliveira Pedrosa; Marcelo Rogalski
Journal:  Plant Cell Rep       Date:  2017-10-30       Impact factor: 4.570

6.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

7.  The Complete Chloroplast Genome Sequences of the Medicinal Plant Forsythia suspensa (Oleaceae).

Authors:  Wenbin Wang; Huan Yu; Jiahui Wang; Wanjun Lei; Jianhua Gao; Xiangpo Qiu; Jinsheng Wang
Journal:  Int J Mol Sci       Date:  2017-10-31       Impact factor: 5.923

8.  NOVOPlasty: de novo assembly of organelle genomes from whole genome data.

Authors:  Nicolas Dierckxsens; Patrick Mardulyn; Guillaume Smits
Journal:  Nucleic Acids Res       Date:  2017-02-28       Impact factor: 16.971

9.  Phylogeographic pattern of Rhizophora (Rhizophoraceae) reveals the importance of both vicariance and long-distance oceanic dispersal to modern mangrove distribution.

Authors:  Eugenia Y Y Lo; Norman C Duke; Mei Sun
Journal:  BMC Evol Biol       Date:  2014-04-17       Impact factor: 3.260

10.  Complete chloroplast genome sequences of Lilium: insights into evolutionary dynamics and phylogenetic analyses.

Authors:  Yun-Peng Du; Yu Bi; Feng-Ping Yang; Ming-Fang Zhang; Xu-Qing Chen; Jing Xue; Xiu-Hai Zhang
Journal:  Sci Rep       Date:  2017-07-18       Impact factor: 4.379

View more
  7 in total

1.  Comparative genomics and phylogenetic relationships of two endemic and endangered species (Handeliodendron bodinieri and Eurycorymbus cavaleriei) of two monotypic genera within Sapindales.

Authors:  Jiaxin Yang; Guoxiong Hu; Guangwan Hu
Journal:  BMC Genomics       Date:  2022-01-06       Impact factor: 3.969

Review 2.  Genetic and molecular mechanisms underlying mangrove adaptations to intertidal environments.

Authors:  Ashifa Nizam; Suraj Prasannakumari Meera; Ajay Kumar
Journal:  iScience       Date:  2021-11-30

3.  Comparative chloroplast genome and phylogenetic analyses of Chinese Polyspora.

Authors:  Zhi-Feng Fan; Chang-Le Ma
Journal:  Sci Rep       Date:  2022-09-26       Impact factor: 4.996

4.  Characterization of the complete chloroplast genome of mangrove Bruguiera gymnorrhiza (L.) Lam. ex Savigny.

Authors:  Su-Yuan Li; Yi-Chan Li; Tian-Hao Zhang; Lan-Li Qin; Yi-Dong An; Yu-Kun Pang; Guo-Feng Jiang
Journal:  Mitochondrial DNA B Resour       Date:  2021-06-23       Impact factor: 0.658

5.  Complete chloroplast genome of a semi-mangrove plant Hibiscus tiliaceus (Malvaceae).

Authors:  Gang Hu; Cong Hu; Xingbing Peng; Shaonuan Lu; Zhonghua Zhang
Journal:  Mitochondrial DNA B Resour       Date:  2021-06-14       Impact factor: 0.658

6.  Characterization of the complete chloroplast genome of mangrove Rhizophora apiculata Blume (Rhizophoraceae).

Authors:  Yi-Chan Li; Su-Yuan Li; Tian-Hao Zhang; Lan-Li Qin; Yi-Dong An; Yu-Kun Pang; Guo-Feng Jiang
Journal:  Mitochondrial DNA B Resour       Date:  2021-06-23       Impact factor: 0.658

7.  Comparative Analysis and Phylogenetic Relationships of Ceriops Species (Rhizophoraceae) and Avicennia lanata (Acanthaceae): Insight into the Chloroplast Genome Evolution between Middle and Seaward Zones of Mangrove Forests.

Authors:  Panthita Ruang-Areerate; Thippawan Yoocha; Wasitthee Kongkachana; Phakamas Phetchawang; Chatree Maknual; Wijarn Meepol; Darunee Jiumjamrassil; Wirulda Pootakham; Sithichoke Tangphatsornruang
Journal:  Biology (Basel)       Date:  2022-02-28
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.