Literature DB >> 31653891

Structural variation of the complete chloroplast genome and plastid phylogenomics of the genus Asteropyrum (Ranunculaceae).

Jian He1, Min Yao1, Ru-Dan Lyu1, Le-Le Lin1, Hui-Jie Liu2, Lin-Ying Pei3, Shuang-Xi Yan4, Lei Xie5, Jin Cheng1.   

Abstract

Two complete chloroplast genome sequences of Asteropyrum, as well as those of 25 other species from Ranunculaceae, were assembled using both Illumina and Sanger sequencing methods to address the structural variation of the cp genome and the controversial systematic position of the genus. Synteny and plastome structure were compared across the family. The cp genomes of the only two subspecies of Asteropyrum were found to be differentiated with marked sequence variation and different inverted repeat-single copy (IR-SC) borders. The plastomes of both subspecies contains 112 genes. However, the IR region of subspecies peltatum carries 27 genes, whereas that of subspecies cavaleriei has only 25 genes. Gene inversions, transpositions, and IR expansion-contraction were very commonly detected in Ranunculaceae. The plastome of Asteropyrum has the longest IR regions in the family, but has no gene inversions or transpositions. Non-coding regions of the cp genome were not ideal markers for inferring the generic relationships of the family, but they may be applied to interpret species relationship within the genus. Plastid phylogenomic analysis using complete cp genome with Bayesian method and partitioned modeling obtained a fully resolved phylogenetic framework for Ranunculaceae. Asteropyrum was detected to be sister to Caltha, and diverged early from subfamily Ranunculoideae.

Entities:  

Mesh:

Year:  2019        PMID: 31653891      PMCID: PMC6814708          DOI: 10.1038/s41598-019-51601-2

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

In recent years, the use of whole chloroplast (cp) genome data for plant phylogenetic reconstruction has been greatly improved our understanding of evolutionary relationships of angiosperms at a wide range of taxonomic levels[1-4]. Chloroplast genome shows uniparental inheritance in most angiosperm species and has a size ranging from 115 to 165 kb[5]. It usually has a conserved circular structure containing a large single copy (LSC) and a small single copy (SSC), which are separated by two copies of inverted repeat (IR) regions. Rates of nucleotide substitution in the cp genome are relatively slow and therefore can provide resolution of plant phylogeny at generic and familial levels[6-13]. The gene number and arrangement of the cp genome are often well conserved in angiosperms[14]. However, variations in the structure of the cp genome, including inversions, transpositions of certain regions, and expansion of IRs, are not uncommon in many families, including Ranunculaceae, Fabaceae, and Asteraceae[11,12,15-20]. Recent studies of the cp genomes of Ranunculales showed that inversions, transpositions, and IR expansions may provide strong phylogenetic information, and plastid phylogenomic analysis can yield infra-familial phylogeny with high resolution[3,11,12,15,20]. Asteropyrum Drumm. et Hutch. is a small but very distinctive genus in Ranunculaceae distributed predominantly in China. It is famous for its great pharmaceutical value and can be used as substitutes of goldthread (Coptis Salisb.) for curing icterus, hydroncus, and diarrhea in rural areas of southern China[21]. The plants of Asteropyrum are small perennial herbs with simple peltate leaves, white sepals and small golden yellow petals (Fig. 1). Traditionally, the genus was considered to have two species, A. peltatum (Franch.) Drumm. et Hutch. and A. cavaleriei (Lévl. et Vant.) Drumm. et Hutch. They have different sizes and leaf shapes[21-23]. Asteropyrum peltatum is often smaller (usually less than 10 cm tall) than A. cavaleriei with suborbicular to inconspicuously 5-sided peltate leaves (Fig. 1A,B). Whereas, scape of A. cavaleriei is often 12–20 cm tall and the leaf is also much larger with five-angled blade (Fig. 1C,D). However, intermediate forms are common in the overlap zones of these two taxa (Fig. 1E,F). Thus, A. cavaleriei has often been treated as a subspecies of A. peltatum[24].
Figure 1

Field photographs of Asteropyrum taxa. (A,B) Asteropyrum peltatum ssp. peltatum (photos taken by L. Xie from Emei Mountain, Sichuan, China). (C,D) Asteropyrum peltatum ssp. cavaleriei (photos taken by L. Xie from Eryanping, Yibin, Sichuan, China). (E,F) An intermedium form of the two subspecies (photos taken by L. Xie from Laojunshan, Yibin, Sichuan, China).

Field photographs of Asteropyrum taxa. (A,B) Asteropyrum peltatum ssp. peltatum (photos taken by L. Xie from Emei Mountain, Sichuan, China). (C,D) Asteropyrum peltatum ssp. cavaleriei (photos taken by L. Xie from Eryanping, Yibin, Sichuan, China). (E,F) An intermedium form of the two subspecies (photos taken by L. Xie from Laojunshan, Yibin, Sichuan, China). The systematic position of Asteropyrum within Ranunculaceae has long been an interesting issue and disputed for almost a century. Drummond & Hutchinson[25] separated this genus from Isopyrum L. and reported that Asteropyrum may be closely related to Caltha L. and Trollius L. However, later authors argued that this genus may be closely related to Isopyrum[26], Copis[21,27], or Calathodes Hook. f. & Thoms.[23], and its chromosome type was also interpreted differently[24,28-31]. Recent molecular phylogenetic studies also did not clearly resolve the position of Asteropyrum. It was found to be closely related to the (Beesia, (Eranthis, Cimicifuga)) clade by a study using rbcL and 26s rDNA[32], or sister to Caltha by a study using three plastid regions, one nuclear region, and morphological data[33], or sister to tribe Cimicifugeae by a study using five plastid and one nuclear regions[34], and sister to Callianthemum C. A. Mey. in a study using four plastid, three nuclear, and one mitochondrial regions[35]. However, these studies did not obtain a robust phylogenetic framework of Ranunculaceae, and the systematic position of Asteropyrum in the family still remains to be clarified. Plastid phylogenomic studies may be particularly suitable for resolving the generic relationship within Ranunculaceae, and structural variations of the cp genome, e.g., gene inversion, gene transposition, and IR expansion-contraction, may provide important systematic information about the family[3,11,12,15,20]. Zhai et al. carried out a plastid phylogenomic study focusing on Ranunculaceae based on a well sampling scheme[15]. They also discussed structural variation of plastid genome sequences of the family. However, their study focused on tribal relationship of Ranunculaceae and only included one sample of Asteropyrum. In their study, they detected and reported five types of plastome sequences of the family based on gene inversions and transpositions but left IR expansion-contraction (widely distributed and highly diverged in the family) issue unaddressed. For phylogenomic analyses, they used two matrices (coding region and complete plastome sequences) and three phylogenetic reconstruction methods (Parsimony, Maximum Likelihood, and Bayes) with no partitioned modeling applied the Bayesian analysis. The results gave a better resolution of the family. However, position of Asteropyrum was still not well supported. In this study, we reported the complete cp genome sequences of both subspecies of Asteropyrum, as well as 25 plastome sequences from other genera in Ranunculaceae using genome skimming data. We then compared the synteny and plastome structure across the family and conducted a plastid phylogenomic study using partitioned modeling method to resolve deep-level relationships and explore plastome structural evolution across Ranunculaceae. The aims of these analyses were to clarify variation in the cp genome across the genus Asteropyrum, to detect structural variation in the Asteropyrum cp genome in comparison with other genera in the family, to infer the phylogenetic position of Asteropyrum within the family, and to try to compare and reconstruct the deep-level relationships of Ranunculaceae using different plastome partitions.

Results

Plastome organization and features of Asteropyrum and its relatives

We obtained 3.0 Gb and 3.2 Gb Next-Generation Sequencing (NGS) clean datasets for Asteropyrum peltatum ssp. peltatum (Franch.) Drumm. & Hutch. (16,552,494 reads) and A. peltatum ssp. cavaleriei (Lévl. & Vant.) Q. Yuan & Q. E. Yang (17,655,994 reads), respectively. Blat analysis was used to select 133,486 putative plastid reads for ssp. peltatum and 144,722 reads for ssp. cavaleriei. The reads from ssp. peltatum were used to obtain two large contigs from the de novo assembly (51,204 bp, and 81,421 bp). One gap was bridged using Sanger sequencing (with primers: LSCF: TGCGATGCTCTAACCTCTGAG; LSCR: AGAGCAATCCTAACCAGAATCATCT). For the ssp. cavaleriei data, only one large contig (131,836 bp), which included complete LSC, IR, and SSC regions was derived. Information regarding the cp genome assembly for all of the other newly sequenced samples from Ranunculaceae is presented in Supplementary Table S1. The complete cp genome sequences of ssp. peltatum and ssp. cavaleriei are 164,455 bp and 164,274 bp, respectively (Fig. 2, Table 1), and the rate of identical sites between the two plastome sequences was 98.1%. The two complete cp genome sequences had similar GC content (38.0% for ssp. cavaleriei and 37.9% for ssp. peltatum). Within the circular plastid genome of Asteropyrum, the IR region had the richest GC content, followed by the LSC region, while the SSC region had the lowest GC content (Table 1).
Figure 2

The complete plastid genomes of the two subspecies of Asteropyrum. Upper: Schematic representation of the two plastome sequences of Asteropyrum showing the different IR-SC boundaries of the two subspecies; Lower: Chloroplast genome maps of the two subspecies. The thick lines on the outer complete circle identify the inverted repeat regions (IRa and IRb). The innermost track of the plastome shows the G + C content. Genes on the outside of the map are transcribed in a clockwise direction, whereas genes on the inside of the map are transcribed in a counter clockwise direction.

Table 1

Summary of the chloroplast genomes of Asteropyrum.

CategoryAsteropyrum peltatum ssp. peltatumAsteropyrum peltatum ssp. cavaleriei
Total cp genome size (bp)164455164274
Length of large single copy region (bp)8181984284
Length of inverted repeat region (bp)3265931429
Length of small single copy region (bp)1731817132
Coding size (bp)106585105105
Intron size (bp)1566515730
Spacer size (bp)4220543439
Total GC content (%)38.037.9
GC content of LSC (%)36.136.0
GC content of IR (%)41.742.1
GC content of SSC (%)32.532.6
Total number of genes112112
Number of genes in LSC7374
Number of genes in SSC1212
Number of genes duplicated in IR2725
Number of protein encoding genes78 (rpl32 absent*)78 (rpl32 absent*)
Number of tRNA genes3030
Number of rRNA genes44

*Comparing to Amborella.

The complete plastid genomes of the two subspecies of Asteropyrum. Upper: Schematic representation of the two plastome sequences of Asteropyrum showing the different IR-SC boundaries of the two subspecies; Lower: Chloroplast genome maps of the two subspecies. The thick lines on the outer complete circle identify the inverted repeat regions (IRa and IRb). The innermost track of the plastome shows the G + C content. Genes on the outside of the map are transcribed in a clockwise direction, whereas genes on the inside of the map are transcribed in a counter clockwise direction. Summary of the chloroplast genomes of Asteropyrum. *Comparing to Amborella. The LSC region of ssp. cavaleriei was found to be longer than that of ssp. peltatum (84,284 bp vs. 81,819 bp), whereas the IR region of ssp. cavaleriei was significantly shorter than that of ssp. peltatum (31,429 bp vs. 32,659 bp), because the LSC region of ssp. cavaleriei carried two more genes (rpoA and rps11) at the border of the LSC and IR than that of ssp. peltatum. At the same time, the gene number of the IR region of ssp. cavaleriei was two less than that of ssp. peltatum (Fig. 2). Because the gene numbers in the IR regions of the two subspecies differed, we designed two pairs of primers to assess the stability of the IR-SC boundary of both subspecies using additional samples. We also tested the IR-SC boundary of an intermediate form as well. The results showed that the IR-SC boundaries within each subspecies are stable. The intermediate individual collected from Yibin, Sichuan had an IR-SC boundary that was identical with that of ssp. cavaleriei. The features of the other 25 newly sequenced cp genomes and their gene maps are presented in Supplementary Table S2 and Supplementary Fig. S1.

Structural variation of the chloroplast genome in Ranunculaceae

When the LAGAN alignment program was applied, the mVISTA results showed a large proportion of un-matched area in tribe Anemoneae and the genus Adonis (Fig. 3), which indicated inversions and gene transpositions in the cp genomes of these plants. The shuffle-LAGAN method was used to obtain a well-matched global alignment for all plastome sequences of Ranunculales (Supplementary Fig. S2). Inversions and transpositions in cp genomes of tribe Anemoneae have been reported previously[11,12,15,20], and the cp genome of Adonis was found to have a large inversion in the LSC region between rps16 and trnT-UGU, which contains 35 genes (43,411 bp in total length), including 21 protein-coding genes and 14 tRNA genes (Supplementary Fig. S1-Adonis). The inversion within Adonis showed no phylogenetic relationship with that in tribe Anemoneae.
Figure 3

Sequence alignment of the complete plastome sequences of Ranunculaceae and outgroup samples compared in this study using the mVISTA program and LAGAN method. A cut-off of 70% similarity was used for the plot, and the Y-scale represents the percent similarity ranging from 50–100%. Blue represents coding regions, and pink represents non-coding regions.

Sequence alignment of the complete plastome sequences of Ranunculaceae and outgroup samples compared in this study using the mVISTA program and LAGAN method. A cut-off of 70% similarity was used for the plot, and the Y-scale represents the percent similarity ranging from 50–100%. Blue represents coding regions, and pink represents non-coding regions. IR expansion-contraction was found to be very common in Ranunculaceae and its relatives (Table 2, Supplementary Fig. S3). After comparing a wide range of angiosperm cp genomes, we identified the cp genome of Amborella as the standard IR composition (with 17 genes) that is commonly shared by most of angiosperms. In Ranunculaceae, the cp genomes of Asteropyrum, Dichocarpum W. T. Wang & P. K. Hsiao, Hydrastis L., and tribe Anemoneae (Anemone L., Pulsatilla Mill., Hepatica Mill., Anemoclema (Franch.) W. T. Wang, Clematis L., Naravelia DC., etc.) showed IR expansion, whereas IR contraction was detected in Helleborus L. and Ceratocephala Moen. (detailed IR information for all tested Ranunculaceae samples is presented in Supplementary Table S3). The longest IR in Ranunculales was found to be that of Berberis amurensis Rupr., which carries 32 genes. Asteropyrum showed the longest IR regions within the family Ranunculaceae (Fig. 2, Table 2).
Table 2

Information regarding IR expansion-contraction for all analyzed Ranunculales species using the Amborella plastome as the standard.

SpeciesGene number in IRlength of IRAdditional (or absent) genes when IR expansion-contraction occurs (Amborella as the standard)
Amborella trichopoda 1726651
Aconitum barbatum 1726090
Aconitum kuznezoffii 1726282
Aconitum reclinatum 1726061
Adonis coerulea 1726087
Anemoclema glaucifolium 2431256 infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Anemone tomentosa 2431490 infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Anemone trullifolia 2431022 infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Asteropyrum peltatum spp. cavaleriei 2531429 rpl36, infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Asteropyrum peltatum spp. peltatum 2732659 rpoA, rps11, rpl36, infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Batrachium bungei 1725352
Beesia calthifolia 1726500
Berberis amurensis 3237152 psbB, psbT, psbN, psbH, petB, petD, rps11, rpl36, infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Callianthemum alatavicum 1725978
Caltha palustris 1726421
Ceratocephala falcata 1624165rpl2 (absent)
Cimicifuga dahurica 1726572
Circaeaster agrestis 2128023 rps19, trnQ-UUG, trnL-UAG, rpl32
Clematis aethusifolia 2331041 rps8, rpl14, rpl16, rps3, rpl22, rps19
Archiclematis alternata 2331037 rps8, rpl14, rpl16, rps3, rpl22, rps19
Clematis fusca 2431039 infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Clematis terniflora 2431045 infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Coptis quinquesecta 1726442
Delphinium anthriscifolium 1725977
Delphinium ceratophorum 1726560
Dichocarpum dalzielii 1726535
Dichocarpum sutchuenense 1927622 rpl22, rps19
Gymnaconitum gymnandrum 1726140
Halerpestes sarmentosa 1725057
Helleborus thibetanus 1624999rpl2 (absent)
Hepatica henryi 2431039 infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Hydrastis canadensis 1827032 rps19
Kingdonia uniflora 1931109ndhB (loss); rps19, ycf1, rps15
Megaleranthis saniculifolia 1726608
Naravelia pilulifera 2331054 rps8, rpl14, rpl16, rps3, rpl22, rps19
Nigella_damascena 1725167
Oxygraphis glacialis 1725094
Pulsatilla chinensis 2431115 infA, rps8, rpl14, rpl16, rps3, rpl22, rps19
Ranunculus macranthus 1725791
Ranunculus sceleratus 1725302
Sinopodophyllum hexandrum 1725950
Souliea vaginata 1726533
Stephania japonica 1724340
Thalictrum coreanum 1726403
Thalictrum minus 1726482
Thalictrum petaloideum 1726480
Thalictrum tenue 1726504
Trollius chinensis 1726627
Trollius ranunculoides 1726500
Information regarding IR expansion-contraction for all analyzed Ranunculales species using the Amborella plastome as the standard.

Plastid phylogenomic analyses

Seven datasets and three methods were used to obtain 21 phylogenetic frameworks for the family Ranunculaceae. All of these frameworks were largely similar to each other (presented in Supplementary Fig. S4). Topology conflicts among different datasets or methods were usually not well supported statistically. The parameters of the parsimony analyses are presented in Table 3. This analysis yielded less resolved phylogenies in comparison with the ML and Bayesian methods. The phylogenetic inferences using the parsimony method showed largely congruent results among the seven datasets. The complete cp genome dataset obtained the best resolved and supported phylogenetic tree, in which Asteropyrum was sister to Nigella with weak bootstrap support (BS = 62, Supplementary Fig. S4). This result was different from ML and Bayesian analyses (Asteropyrum sister to Caltha).
Table 3

Characteristics of the seven datasets used for parsimony analysis.

DatasetaNo. of taxaAligned lengthbVariable sites (percentage of aligned length)Informative sites (percentage of aligned length)No. of MP treesTree lengthConsistency indexRetention index
Large single copy4877968 bp36926 bp (47.36%)24041 bp (30.83%)1932690.58500.6891
Small single copy4818551 bp11030 bp (60.93%)7645 bp (41.21%)2329220.54090.6496
Inverted repeats4824318 bp5713 bp (23.49%)2066 bp (8.50%)382450.80180.7851
Coding regions4874772 bp24741 bp (33.09%)15204 bp (20.33%)2584540.58030.6834
Intergenic spacers4831036 bp21476 bp (69.20%)14081 bp (45.37%)1587310.58410.6781
Introns4814373 bp6755 bp (47.00%)3979 bp (27.68%)1156480.61820.7007
Complete cp genome48120181 bp52972 bp (44.08%)33264 bp (27.68%)11201810.58600.6825

aOnly one IR region was used for analyses.

bWith removal of ambiguous alignments.

Characteristics of the seven datasets used for parsimony analysis. aOnly one IR region was used for analyses. bWith removal of ambiguous alignments. The topologies from the Bayesian and ML analyses were almost the same for each dataset, but the ML analyses also yielded less resolved and supported trees than the Bayesian method. Among all the 21 phylogenetic trees, the one obtained from the complete plastome sequence dataset with Bayesian method showed a fully resolved phylogeny with all the branches supported by PP value of 1 (Fig. 4). Therefore, the phylogenetic relationships discussed below were mainly based on this result.
Figure 4

Phylogeny of Ranunculaceae species inferred from complete plastome sequences using Bayesian methods with partitioned modeling. The Bayesian phylograms show the posterior probability (PP) values on each node.

Phylogeny of Ranunculaceae species inferred from complete plastome sequences using Bayesian methods with partitioned modeling. The Bayesian phylograms show the posterior probability (PP) values on each node.

Discussions

After comparing the first diverged Amborella[36] and other angiosperm species, we determined that a total number of 113 genes (with 17 genes in IR region) in the cp genome can be set as a primitive cp genome structure based on its prevalence in angiosperms. Then, we summarized the structural variation of the cp genomes (including gene composition, gene inversion and transposition, and IR expansion-contraction) across Ranunculaceae based on all the available data. Structural variations of the cp genome within Ranunculaceae have been reported[11,12,15,20,37,38]. Using restriction site mapping, Johansson[38] reported large inversions and gene loss in plastomes of Adonis species. In this study, one inversion in A. coerulea, which was identical with the A. vernalis inversion described by Johansson[38], was confirmed and located in detail. Zhai et al. reported the same inversion in A. sutchuenensis[15]. We also detected one gene loss (rpl32) in A. coerulea (Supplementary Table S3). The inversion within Adonis plastomes could be a synapomorphy within the genus. Structural variations of cp genome have been reported within tribe Anemoneae by previous studies[11,12,15,20,37]. In comparison with Adonis, those cp genome structural variations of tribe Anemoneae differed in size and number. No evidence indicates that these inversions in the cp genomes of Adonis and tribe Anemoneae are phylogenetically related. IR expansion-contraction is also an important process involved in cp genome variation within Ranunculaceae. Based on a broad comparison with other angiosperm species, ten genera (Asteropyrum, Clematis, Archiclematis, Naravelia, Pulsatilla, Anemone, Anemoclema, Hepatica, Dichocarpum, and Hydrastis) were found to exhibit IR expansion, whereas two genera (Helleborus and Ceratocephala) showed IR contraction. Some of these IR expansion events may be phylogenetically informative. For example, within the well supported tribe Anemoneae, a single IR-expansion, which may be a synapomorphy of the plastome variation in the tribe, was detected[12]. However, in other case, IR expansion-contraction may occur independently in some genera. For example, the cp genomes from two species of Dichocarpum showed different IR regions. One species, D. dalzielii, had a normal 17-gene IR, whereas the other, D. sutchuenense, carried expanded IR regions (19 genes) (Table 2). On the other hand, IR contraction and gene loss in Helleborus and Ceratocephala (Table 2) also evolved independently within Ranunculaceae because these two genera were separated phylogenetically (Fig. 4). The genus Asteropyrum did not have gene inversions or transpositions in their cp genomes, but they showed significant IR expansion and carried the longest IR regions in the family (Table 2). Different gene compositions in IR regions of the two subspecies were detected (Fig. 2, Table 2). Two more genes (rpoA and rps11) were present in the IR region of ssp. peltatum, and this variation is stable within the subspecies. These phenomena suggested that plastome structural variation can occur not only among genera but also within a single species. In the present study, an intermediate sample showed the same IR-SC boundary with A. peltatum ssp. cavaleriei. Morphologically, ssp. peltatum and ssp. cavaleriei can be easily distinguishable by their size and shape of leaves. Intermediate forms between the two subspecies in morphology and palynology are sometimes present at their overlapping zones[24] and can be either more similar to ssp. peltatum or to ssp. cavaleriei. The intermediate individual sampled in this study was small in its leaf and scape size but with slightly five-angled leaves (Fig. 1E,F). According to Yuan & Yang[24], it can be recognized as an intermediate form but more similar to ssp. cavaleriei. The IR-SC boundary indicated that this intermediate individual could be a hybrid one between the two subspecies with ssp. cavaleriei as its maternal parent. A large number of molecular phylogenetic as well as several phylogenomic analyses have been conducted for Ranunculaceae[33-35,39-45]. However, these studies have suffered from poor resolution largely due to an insufficient phylogenetic signal or insufficient sampling. Zhai et al. carried out a well sampled plastid phylogenomic study on Ranunculaceae[15], and they obtained a better phylogenetic framework of the family. However, this study did not separate the plastid sequence in detail and the substitution model was tested without partitioning. Thus, we still do not know the diversification and phylogenetic resolving ability of different partition of the plastid genome sequences. In this study, phylogenetic relationships within Ranunculaceae were inferred from six separate datasets, as well as a complete plastome sequence data. Among all the six separate datasets, intergenic spacer data showed the highest rate of informative sites (45.37%). Whereas, the IR region showed the lowest rate of informative sites (8.50%), indicating that it is the most conserved region in the entire plastid genome (Table 3). The phylogenetic resolution of IR region was also shown to be the worst among the seven datasets (Supplementary Fig. S4) because of its small number of phylogenetic signals. All the phylogenetic trees inferred by different datasets were largely congruent (Supplementary Fig. S4). Hydrastis and Coptis consistently appeared as the earliest diverged lineages in the family. Major clades, including subfamily Thalictroideae, tribe Adonideae, tribe Ranunculeae, tribe Anemoneae, tribe Cimicifugeae, and tribe Delphinieae were resolved in all the analyses. However, the systematic positions of Asteropyrum, Caltha, Helleborus, Callianthemum and Nigella were found to be unstable and often had low statistical support values using the six separate datasets. These genera may have undergone unusual evolutionary processes, such as ancient hybridization or rapid radiation, coupled with their origin and evolution process. Because of the different rates and patterns of nucleotide substitutions among the cp genome sequences[46], data partitioning methods are required for phylogenetic reconstruction to ensure the accuracy of the analysis[2,47-50]. Our results showed that Bayesian analyses with partitioned models always obtained the best resolved Ranunculaceae phylogeny for each dataset. Using Bayesian analysis, the complete plastome dataset obtained a fully resolved phylogeny for Ranunculaceae (Fig. 4), which was better resolved than all the previous molecular and phylogenomic studies[15,33-35,39-45]. In general, the Bayesian phylogeny inferred from the complete cp genome sequences is largely congruent with the phylogeny obtained by Cossard et al.[35] Using eight DNA fragments from the chloroplast, mitochondrial, and nuclear genomes, Cossard et al. identified the sister relationship of subfamilies Thalictroideae and Adonideae with insignificant statistical support (PP = 0.80)[35]. This relationship was not resolved by most previous molecular phylogenetic studies[32,33,40-42,44,51,52]. In the present study, our plastid phylogenomic analysis resolved the sister relationship between subfamilies Thalictroideae and Adonideae (Fig. 4), and supported the hypothesis of Cossard et al. From the Bayesian analysis, we detected the sister relationship of Callianthemum and Helleborus (Fig. 4) which was not resolved by all the previous studies of Ranunculaceae phylogeny[32,33,40-42,44,51,52]. The clade of Callianthemum and Helleborus was found to be related to a well-supported clade of tribe Ranunculeae + tribe Anemoneae. The sister relationship of tribe Cimicifugaea and the tribe Delphinieae + tribe Nigellaea clade (Fig. 4) was resolved by our analysis. This relationship was previously reported by Hoot but without sufficient statistical support[41]. The phylogenetic position of Asteropyrum has been disputed for a century and still remains to be resolved by now[20,24,32,35,53]. In this study, the sister relationship of Asteropyrum and Caltha was discovered using the complete cp genome datasets with ML and Bayesian methods. This clade was further found to be first diverged from subfamily Ranunuloideae (Fig. 4). Wang et al. also proposed sister relationship of Asteropyrum and Caltha by combining molecular and morphological data, but their results was not statistically supported[32]. It is noteworthy that Cossard et al. resolved a sister relationship of Asteropyrum and Callianthemum with a strong support value using a combination of chloroplast, mitochondrial, and nuclear genes[35]. This unexpected result was solely contributed by the nuclear gene RanaCYL1 dataset. The contradictory results by our cp genome analysis and nuclear RanaCYL1 data may be caused by either an ancient hybridization event or incomplete lineage sorting of nuclear RanaCYL1. Further studies focused on this issue may be conducted using additional markers from nuclear genome. In the Bayesian phylogram, many major clades of Ranunculaceae had a very short branch length, including the Asteropyrum + Caltha clade, Cimicifugeae + Delphinieae + Nigelleae clade, and Ranunculeae to Nigelleae clade (Fig. 4). This indicated ancient lineage radiation of the subfamily Ranunculoideae, which was also proposed by Zhai et al.[15] Thus, it is not surprising that previous molecular phylogenetic analyses using limited molecular markers failed to resolve the phylogenetic framework of Ranunculaceae. Further phylogenomic studies using additional evidence, such as mitochondrial and nuclear genomic data, may help to deeper our understanding of the evolution of Ranunculaceae.

Conclusion

The two subspecies of Asteropyrum carried quite different plastid genomes with different IR-SC borders and much sequence variation. The plastome sequence of Asteropyrum has the longest IR regions in the family. Unlike Adonis and species from tribe Anemoneae, no gene inversions and transpositions were detected in the Asteropyrum cp genome. The complete cp genome showed excellent suitability for drawing phylogenetic inferences within Ranunculaceae. The complete cp genome sequence, as well as its structural variation (gene inversions-transpositions and IR expansion-contraction), can provide abundant phylogenetic information for the family. In contrast, non-coding regions have excessive variations and a high level of noise, so they are not suitable for resolving the generic relationships within the family. As mentioned by Ma et al., analyses in plastid genomic studies should always be conducted using partitioned datasets, and parsimony analysis often obtains unsatisfactory results[2]. Our plastid phylogenomic inferences, which were obtained using the complete cp genome sequence and Bayesian analysis, provided a better resolved phylogenetic framework for Ranunculaceae in comparison with all the previous studies. Asteropyrum was detected to be closely related to Caltha, and unusual ancient evolutionary processes for Asteropyrum was also suggested by our findings.

Methods

Taxon sampling and sequencing

Fresh leaves were collected from plants from field by the authors (Table 4) and the samples of Nigella and Helleborus were from cultivated plants (seeds were obtained from Mr. Fothergill’s Seeds Limited Company). The leaf samples were dried with tell-tale silica-gel. Samples from both subspecies of Asteropyrum were collected from different populations. We also sequenced 25 other species from Ranunculaceae and mined 21 accessions of complete cp genomes (including five outgroups from Ranunculales) from GenBank for comparative analysis. The samples covered 31 genera and represented most tribes of Ranunculaceae, but did not include the basal clades of Trib. Glaucidieae and Trib. Xanthorhizeae[33] (Table 4). For all leaf tissue samples, total genomic DNA was isolated using the CTAB method[54] and assessed by agarose gel electrophoresis.
Table 4

Information regarding the sequenced Asteropyrum materials and other samples from Ranunculaceae*.

SpeciesSample localityVoucher (Herbarium)Genbank accessionReference
Aconitum barbatum Songshan, Beijing, ChinaL. Xie 20140820 (BJFC)MK253470This study
Aconitum kuznezoffii Donglingshan, Beijing, ChinaL. Xie 20150709 (BJFC)MK253471This study
Aconitum reclinatum NANAMF186593Kong et al.[69]
Adonis coerulea Xiaojin, Sichuan, ChinaH. J. Liu I-1109 (BJFC)MK253469This study
Anemoclema glaucifolium Shangri-la, Yunnan, ChinaB. Xu-M417-090 (SWFC)MH205609Liu et al.[12]
Anemone tomentosa Barkam, Sichuan, ChinaH. J. Liu I”-1080 (BJFC)MG001339Liu et al.[12]
Anemone trullifolia Dinggye, Xizang, ChinaPE2013 Tibet 2588 (PE)MH205608Liu et al.[12]
Archiclematis alternata Nyalam, Xizang, ChinaPE2010 Tibet 963 (PE)MG675221Liu et al.[11]
Asteropyrum peltatum ssp. peltatumEmei, Sichuan, ChinaL. Xie 20150094 (BJFC)MG734862This study
A. peltatum ssp. cavalerieiYiBin, Sichuan, ChinaL. Xie 2014-YB013 (BJFC)MG734861This study
Batrachium bungei Ali, Xizang, ChinaTibet2013 4048 (PE)MK253468This study
Beesia calthifolia Emei, Sichuan, ChinaL. Xie 2015-EM19(BJFC)MK253467This study
Berberis amurensis NANAKM057374Unpublished
Callianthemum alatavicum Urumqi, Xinjiang, ChinaZ. Z. Yang 0524 (BJFC)MK253466This study
Caltha palustris Emei, Sichuan, ChinaL. Xie 2015-EM25 (BJFC)MK253465This study
Ceratocephala falcata Altay, Xinjiang, ChinaL. Xie 2016003 (BJFC)MK253464This study
Cimicifuga dahurica Xiaowutai, Hebei, China XWT2011033 MK253463This study
Circaeaster agrestis Shennongjia, Hubei, ChinaY. X. Sun 1510 (HIB)KY908400Sun et al.[3]
Clematis aethusifolia Donglingshan, Beijing, ChinaL. Xie 2015014 (BJFC)MK253462This study
Clematis fusca var. coreanaNANAKM652489Park & Park[70]
Clematis terniflora Huzhou, Zhejiang, ChinaUnknown number (HZU)KJ956785Li et al.[71]
Coptis quinquesecta Jinping, Yunnan, ChinaLP174738 (HZU)MG585353Zhang et al.[72]
Delphinium ceratophorum Binchuan, Yunnan, ChinaQ. He 2017091301 (BJFC)MK253460This study
Delphinium anthriscifolium Huixian, Henan, ChinaL. Xie 20160402 (BJFC)MK253461This study
Dichocarpum dalzielii Yibin, Sichuan, ChinaL. Xie sc2014008 (BJFC)MK253459This study
Dichocarpum sutchuenense Taibai, Shaanxi, ChinaH. J. Liu BE05 (BJFC)MK253458This study
Gymnaconitum gymnandrum NANAKT964697Unpublished
Halerpestes sarmentosa Shidu, Beijing, ChinaFS2015001MK253457This study
Helleborus thibetanus Cult. in Beijing Forest. Univ.J. He C2018001 (BJFC)MK253456This study
Hepatica_henryi Emei, Sichuan, ChinaL. Xie 2015EM039 (BJFC)MG001340Liu et al.[12]
Hydrastis canadensis NANAKY085918Unpublished
Kingdonia uniflora Meixian, Shaanxi, ChinaY. X. Sun 1606 (HIB)KY908401Sun et al.[3]
Megaleranthis saniculifolia Mt. Sobaek, KoreaUnknown number (Korea University Herbarium)FJ597983Kim et al.[73]
Naravelia pilulifera Longzhou, Guangxi, ChinaL. Xie 201511 (BJFC)MK253455This study
Nigella_damascena Cult. in Beijing Forest. Univ.J. He C2018002 (BJFC)MK253454This study
Oxygraphis glacialis Urumqi, Xinjiang, ChinaZ. Z. Yang 0422 (BJFC)MK253453This study
Pulsatilla chinensis Songshan, Beijing, ChinaL. Xie 2015YQ002 (BJFC)NC_039452Liu et al.[12]
Ranunculus macranthus NANADQ359689Raubeson et al.[74]
Ranunculus sceleratus Mentougou, Beijing, ChinaL. Xie 2014098 (BJFC)MK253452This study
Sinopodophyllum hexandrum NANAMG593048Ye et al.[75]
Souliea vaginata Shangri-la, Yunnan, ChinaL. Xie 2012-X110 (BJFC)MK253451This study
Stephania japonica Wuhan, Hubei, ChinaY. X. Sun 1405 (HIB)KU204903Sun et al.[76]
Thalictrum coreanum Gangwon-do, KoreaNAKM206568Park et al.[77]
Thalictrum minus Wulingshan, Hebei, ChinaL. Xie 20171102 (BJFC)MK253450This study
Thalictrum petaloideum Donglingshan, Beijing, ChinaL. Xie 20150705 (BJFC)MK253449This study
Thalictrum tenue Mentougou, Beijing, ChinaL. Xie 20160502 (BJFC)MK253448This study
Trollius chinensis NANAKX752098Unpublished
Trollius ranunculoides Shangri-la, Yunnan, ChinaL. Xie 2012-X137 (BJFC)MK253447This study

*All the newly sequenced samples used in this study are not endangered species and were not obtained in conserved area.

Information regarding the sequenced Asteropyrum materials and other samples from Ranunculaceae*. *All the newly sequenced samples used in this study are not endangered species and were not obtained in conserved area. The total DNA samples of the 27 newly sequenced species were sent to Novogene (http://www.novogene.com, China) for library construction and next-generation sequencing. Short-insert (350 bp) paired-end read library preparation and 2 × 150 bp sequencing were performed on an Illumina (HiSeq4000) genome analyzer platform. Approximately 2–4 Gb of raw data for each species were first filtered using the FASTX-Toolkit to obtain high-quality clean data by removing adaptors and low-quality reads (http://hannonlab.cshl.edu/fastx_toolkit/download.html). The remaining clean reads (high-quality reads) were sent to the authors for further analysis.

Chloroplast genome assembly and annotation

For the clean reads, BLAT analysis was used to exclude nuclear and mitochondrial reads using published plastid genome sequences from Ranunculaceae as references[55]. Next, de novo assembly was performed using Geneious R11 with a medium-low sensitivity setting[56] to assemble plastid genome sequences. For most of the tested samples, only one contig (approximately 130 kb) was obtained by de novo assembly. If more than one smaller contig was obtained, the whole-genome reads were mapped to those contigs using the Fine Tuning program in Geneious R11 (iterating up to 100 times) to fill gaps. Contigs were connected by overlapping their terminal sequences using the Repeat Finder program implemented in Geneious R11. Sanger sequencing was also used to bridge gaps when necessary. When a 130 kb contig (including SSC, IR, and LSC) was built for each sample, the IR region was determined using the Repeat Finder program, after which the IR region was inverted and copied manually to construct the complete cp genome sequence. Because great variation was found at the IR-SC boundaries in the two Asteropyrum plastomes, we subjected these regions to Sanger sequencing. Broader population sampling of the two Asteropyrum subspecies, as well as an intermediate individual, was applied to assess the stability of the variation at IR-SC boundaries using Sanger sequencing (Supplementary Tables S4 and S5). Complete plastid genomes were annotated using the Unix program Plann 1.1.2[57] and manually verified using Geneious Annotate R11 and the online program Blast[58]. The cp genome sequences and annotations were uploaded to GenBank using Bankit (https://www.ncbi.nlm.nih.gov/books/NBK63590/). Accession numbers are shown in Table 4. Illustrations of all the newly sequenced plastomes were obtained using the Organellar Genome DRAW tool[59].

Comparative chloroplast genomic analyses for two subspecies of Asteropyrum and their relatives

The IR/SC boundaries of Asteropyrum and the other tested species were illustrated and compared with other outgroups to address IR expansion-contraction. MAFFT v7.309[60] was used to align the plastome sequences, whereas mVISTA[61] was used to export visual results to allow evaluation of the structural similarity of plastomes. The alignment programs applied in mVISTA were LAGAN, which produces true multiple alignments regardless of whether they contain inversions or not, and Shuffle-LAGAN, which can detect rearrangements and inversions[62,63]. The sequence diversification of the two subspecies of Asteropyrum was assessed by comparing the two aligned sequences. The rate of identical sites was calculated using Geneious R11.

Plastid phylogenomic analysis

In this study, 48 cp genome sequences, including those of five outgroups from Ranunculales, were aligned for phylogenomic analysis. The gene orders for the cp genome sequences of Tribe Anemoneae and Adonis (in which inversions and/or transposition regions were present) were shuffled in the same order with other Ranunculaceae and outgroup species. The following data sets were applied for phylogenetic reconstruction: complete cp genome sequence (with only one IR region), LSC, SSC, IR, coding regions (CDs), intron, and intergenic spacer regions. All datasets were aligned using MAFFT v7.309[60]. Ambiguous alignments and sites with more than 80% missing data[4] were deleted automatically using a Python script (https://github.com/HeJian151004/get_homology). Maximum Parsimony (MP) analysis was conducted for all the seven datasets using PAUP v4.0b10[64]. Characters were treated as unordered and equally weighted, whereas gaps were treated as missing data. Branch-and-Bound or a 1000-replicate heuristic search was applied with simple addition, and tree bisection reconnection branch swapping with MUL-trees was utilized to search the MP tree(s). Statistical support was assessed by 1000 bootstrap replicates with 1000 random taxon addition replicates and 10 trees held at each step. Maximum likelihood (ML) analyses were carried out with RAxML-HPC2 v8.2.10[65] performed on the online server (https://www.phylo.org/). The GTR + G model was applied for all datasets as suggested by the software instructions (see RAxML manual). The statistical value was inferred using the combined rapid bootstrap method (1000 replicates). Bayesian inference (BI) was performed with MrBayes v3.2.3[66] using partitioned substitution models tested by PartitionFinder v2.1.1[67] with a minimum subset size of 5000, because excessively parameter-rich models often cause calculation problems in Bayesian analysis and fail to converge[2,50]. The best substitution models and data partition schemes were selected by Akaike information criterion (AIC)[68]. Two parallel independent Markov chain Monte Carlo (MCMC) chains were run, each of which consisted of three hot chains and one cold chain for 5,000,000 generations. The trees were sampled and saved every 100 generations. The MCMC convergence was tested by calculating the standard deviation value of split frequencies (less than 0.01) and by assessing the convergence of the parameter values of the two runs. The first 25% of trees were discarded as burn-in, and the remaining trees were used to generate the consensus tree. Supplementary information Supplementary dataset
  43 in total

1.  Cryptic failure of partitioned Bayesian phylogenetic analyses: lost in the land of long trees.

Authors:  David C Marshall
Journal:  Syst Biol       Date:  2009-11-17       Impact factor: 15.683

2.  Chloroplast genomic data provide new and robust insights into the phylogeny and evolution of the Ranunculaceae.

Authors:  Wei Zhai; Xiaoshan Duan; Rui Zhang; Chunce Guo; Lin Li; Guixia Xu; Hongyan Shan; Hongzhi Kong; Yi Ren
Journal:  Mol Phylogenet Evol       Date:  2019-02-28       Impact factor: 4.286

3.  PartitionFinder 2: New Methods for Selecting Partitioned Models of Evolution for Molecular and Morphological Phylogenetic Analyses.

Authors:  Robert Lanfear; Paul B Frandsen; April M Wright; Tereza Senfeld; Brett Calcott
Journal:  Mol Biol Evol       Date:  2017-03-01       Impact factor: 16.240

4.  Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs.

Authors:  K H Wolfe; W H Li; P M Sharp
Journal:  Proc Natl Acad Sci U S A       Date:  1987-12       Impact factor: 11.205

5.  Structural rearrangements, including parallel inversions, within the chloroplast genome of Anemone and related genera.

Authors:  S B Hoot; J D Palmer
Journal:  J Mol Evol       Date:  1994-03       Impact factor: 2.395

6.  Phylogenomic and structural analyses of 18 complete plastomes across nearly all families of early-diverging eudicots, including an angiosperm-wide analysis of IR gene content evolution.

Authors:  Yanxia Sun; Michael J Moore; Shoujun Zhang; Pamela S Soltis; Douglas E Soltis; Tingting Zhao; Aiping Meng; Xiaodong Li; Jianqiang Li; Hengchang Wang
Journal:  Mol Phylogenet Evol       Date:  2015-12-24       Impact factor: 4.286

7.  Analysis of the Amborella trichopoda chloroplast genome sequence suggests that amborella is not a basal angiosperm.

Authors:  Vadim V Goremykin; Karen I Hirsch-Ernst; Stefan Wolfl; Frank H Hellwig
Journal:  Mol Biol Evol       Date:  2003-06-27       Impact factor: 16.240

8.  Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae).

Authors:  Joseph F Walker; Michael J Zanis; Nancy C Emery
Journal:  Am J Bot       Date:  2014-04-03       Impact factor: 3.844

9.  Plann: A command-line application for annotating plastome sequences.

Authors:  Daisie I Huang; Quentin C B Cronk
Journal:  Appl Plant Sci       Date:  2015-08-10       Impact factor: 1.936

10.  OrganellarGenomeDRAW--a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets.

Authors:  Marc Lohse; Oliver Drechsel; Sabine Kahlau; Ralph Bock
Journal:  Nucleic Acids Res       Date:  2013-04-22       Impact factor: 16.971

View more
  10 in total

1.  Organization, Phylogenetic Marker Exploitation, and Gene Evolution in the Plastome of Thalictrum (Ranunculaceae).

Authors:  Kun-Li Xiang; Wei Mao; Huan-Wen Peng; Andrey S Erst; Ying-Xue Yang; Wen-Chuang He; Zhi-Qiang Wu
Journal:  Front Plant Sci       Date:  2022-05-20       Impact factor: 6.627

2.  Plastome evolution in the Caesalpinia group (Leguminosae) and its application in phylogenomics and populations genetics.

Authors:  Paulo Aecyo; André Marques; Bruno Huettel; Ana Silva; Tiago Esposito; Elâine Ribeiro; Inara R Leal; Edeline Gagnon; Gustavo Souza; Andrea Pedrosa-Harand
Journal:  Planta       Date:  2021-07-08       Impact factor: 4.116

3.  rbcL, a potential candidate DNA barcode loci for aconites: conservation of himalayan aconites.

Authors:  Ranjana K Negi; Pooja Nautiyal; Rajneesh Bhatia; Rakesh Verma
Journal:  Mol Biol Rep       Date:  2021-09-02       Impact factor: 2.316

4.  Characterization of the complete chloroplast genome of the Musk Larkspur Delphinium brunonianum (Ranunculales: Ranunculaceae).

Authors:  Qien Li; Xiao Guo; Farong Yuan; Cairang Nima; Duojie Dongzhi; XianJia Li
Journal:  Mitochondrial DNA B Resour       Date:  2020-06-08       Impact factor: 0.658

5.  Characterization of the complete chloroplast genome of the Tangut monkshood Aconitum tanguticum (Ranunculales: Ranunculaceae).

Authors:  Qien Li; Xianjia Li; Rangzhong Qieyang; Cairang Nima; Duojie Dongzhi; Xiao Guo
Journal:  Mitochondrial DNA B Resour       Date:  2020-06-02       Impact factor: 0.658

6.  Comparative plastid genomics of four Pilea (Urticaceae) species: insight into interspecific plastid genome diversity in Pilea.

Authors:  Jingling Li; Jianmin Tang; Siyuan Zeng; Fang Han; Jing Yuan; Jie Yu
Journal:  BMC Plant Biol       Date:  2021-01-07       Impact factor: 4.215

7.  Comparative Analysis of Complete Chloroplast Genomes of 13 Species in Epilobium, Circaea, and Chamaenerion and Insights Into Phylogenetic Relationships of Onagraceae.

Authors:  Yike Luo; Jian He; Rudan Lyu; Jiamin Xiao; Wenhe Li; Min Yao; Linying Pei; Jin Cheng; Jinyu Li; Lei Xie
Journal:  Front Genet       Date:  2021-11-04       Impact factor: 4.599

8.  The complete chloroplast genome of Ranunculus yunnanensis (Ranunculaceae).

Authors:  Rui Rao; Zhe-Fei Zeng; Yu Du; Chun-Min Mao; Lei Xie; Xue-Lian Guo; Liang-Liang Yue
Journal:  Mitochondrial DNA B Resour       Date:  2021-12-16       Impact factor: 0.658

9.  Phylogenomic Analyses of Hepatica Species and Comparative Analyses Within Tribe Anemoneae (Ranunculaceae).

Authors:  Kyu Tae Park; SeonJoo Park
Journal:  Front Plant Sci       Date:  2021-06-04       Impact factor: 5.753

10.  New Insights Into the Plastome Evolution of the Millettioid/Phaseoloid Clade (Papilionoideae, Leguminosae).

Authors:  Oyetola Oyebanji; Rong Zhang; Si-Yun Chen; Ting-Shuang Yi
Journal:  Front Plant Sci       Date:  2020-03-10       Impact factor: 5.753

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.