Literature DB >> 30149578

Complete Chloroplast Genomes from Sanguisorba: Identity and Variation Among Four Species.

Xiang-Xiao Meng1, Yan-Fang Xian2, Li Xiang3, Dong Zhang4, Yu-Hua Shi5, Ming-Li Wu6, Gang-Qiang Dong7, Siu-Po Ip8, Zhi-Xiu Lin9, Lan Wu10,11, Wei Sun12.   

Abstract

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405⁻85,557 bp), small single-copy regions (SSC; 18,550⁻18,768 bp), and a pair of inverted repeats (IRs; 25,576⁻25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39⁻53 long repeats and 79⁻91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.

Entities:  

Keywords:  Sanguisorba; chloroplast genome; molecular structure; phylogenetic analysis

Mesh:

Substances:

Year:  2018        PMID: 30149578      PMCID: PMC6225366          DOI: 10.3390/molecules23092137

Source DB:  PubMed          Journal:  Molecules        ISSN: 1420-3049            Impact factor:   4.411


1. Introduction

The genus Sanguisorba belongs to the Rosaceae; there are about 30 species in the genus Sanguisorba in the world, mainly distributed in Asia, Europe, and North America (eFlora of China: http://www.eflora.cn/). There are seven species and six varieties of Sanguisorba in China [1], distributed in both northern and southern China, especially in the northeast provinces. Sanguisorba officinalis has been recorded as a medicinal plant that is commonly used to treat water and fire burns, hemorrhoidal bleeding, and hematochezia [2]. Diyu Shengbai Tablet, a Chinese patent medicine, is mainly composed of S. officinalis, and contains active chemical components including saponins, flavonoids and tannins [3]. It can protect the hematopoietic system, elevate the peripheral blood white blood cells, neutrophils, and platelets, improve bone marrow micro-circulation, and adjust and improve body immunity and other functions. It is also often clinically used as an adjuvant during chemotherapy [3]. The chloroplast genome is ~100–150 kb in length and contains a wealth of evolutionary information, which can be used to reveal phylogenetic relationships among closely related species and can also be valuable for species identification [4,5]. It has been widely used in species identification, phylogenetic evolution, and genetic engineering-related research [6,7]. With the rapid development of high-throughput sequencing technologies and bioinformatics tools, the cost of sequencing chloroplast genome has been significantly reduced, making the large-scale acquisition of chloroplast genomic sequences possible [8,9]. This has made possible the study of chloroplast genomes in terms of population genetic structure, phylogenetic evolution, and species identification. However, molecular research on the genus Sanguisorba is still very scarce. Currently, there are no reports on the chloroplast genome sequence of the genus Sanguisorba, which seriously hampers molecular identification, phylogenetic, genetic, and breeding research involving the genus. In this study, we report the chloroplast genome assembly, annotation, and structural analysis of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba) as well as the complete chloroplast genome sequences of these species, which are the first four sequenced members of the genus Sanguisorba. In addition, we compared the chloroplast genomes of the four Sanguisorba species in detail (e.g., based on IR expansion/contraction and difference regions). From this we constructed a phylogenetic tree using the maximum parsimony (MP) method based on both the whole cp genome and on common protein-coding genes, respectively. Overall, our results provide useful genetic information on the chloroplast of Sanguisorba species, as well as their relative position in phylogenetic tree.

2. Results and Discussion

2.1. Chloroplast Genome Assembly and Features

Using an Illumina HiSeq X platform, four Sanguisorba species were sequenced to produce 11,554,422–18,828,898 paired-end raw reads. After screening these paired-end reads, 598,166 to 1,080,144 cp genome reads were successfully mapped with 569X to 1032X sequencing depth (Table 1). In this study, the sequencing depth was high enough to satisfy the technical requirements of an organelle genome assembly. In total, the complete cp genomes of the four Sanguisorba species were similar in length, ranging from 155,127 bp (S. stipulata) to 155,479 bp (S. officinalis) (Figure 1 and Figures S1–S3, and Table 1), with the typical quadripartite structure of angiosperms. All four cp genomes contained a large single-copy regions (LSC, 84,405–85,557 bp) and a small single-copy regions (SSC, 18,550–18,768bp), separated by a pair of inverted repeats regions (IRs, 25,576–25,615 bp).
Table 1

Sequence information and Illumina next-generation sequencing (NGS) data of the four Sanguisorba chloroplast genomes.

SpeciesRaw Reads No.Mapped Reads No.Sequencing Depth Cp Genome Length (bp)GC Content (%)LSC a (bp)SSC a (bp)IRs a (bp)
S. officinalis 11,554,422609,666581X155,47937.1985,54718,76825,582
S. filiformis 16,876,554656,271628X154,28237.3384,40518,65925,609
S. stipulata 18,828,8981,080,1441032X155,12737.2385,34718,55025,615
S. tenuifolia var. alba18,366,336598,166569X155,45737.2085,55718,74825,576

a LSC (large single-copy regions), SSC (small single-copy regions), and IRs (inverted repeats regions).

Figure 1

Gene map of Sanguisorba officinalis chloroplast genome. Genes shown inside the circle are transcribed clockwise, and those outside are counterclockwise. Genes in different functional groups are color-coded.

The average GC content of the four Sanguisorba cp genomes was ~37.23%; in this respect they showed only minor differences from one another and resembled the cp genomes of other reported Rosaceae species [10,11,12]. Nevertheless, the GC content is unevenly distributed in the four Sanguisorba cp genomes. The GC content of the IR regions (~42.7%) is significantly higher than in the LSC region (~35.3%) or the SSC regions (~31.3%). We speculate that this may be a reason for the divergence of the conservation between the IR and SC regions [8,13]. All four Sanguisorba cp genomes possessed 112 unique genes including 78 protein-coding genes, 30 tRNA genes, and four rRNA genes (Table 2). Of these, six protein-coding genes, seven tRNA genes, and four rRNA genes are duplicated in the IR regions, making a total of 129 genes shared (Table 2). Our results showed that the four Sanguisorba cp genomes were highly conserved in gene type, order, and content. We classified the 112 genes into different categories according to their function, and the details are shown in Table 2. In addition, two pseudogenes (ycf1 and infA) were found in the four cp genomes. There were 18 genes located in the IR regions as follows: rrn16, rrn23, rrn5, rrn4.5, trnA-UGC, trnI-CAU, trnI-GAU, trnL-CAA, trnN-GUU, trnR-ACG, trnV-GAC, rps7, rps12, rpl2, rpl23, ndhB, ycf1, and ycf2 (Figure 1 and Figures S1–S3). rps12 is a trans-spliced gene, in which two 3’ end residues are located in the IR region and the 5’ end in the LSC region (Figure 1 and Figures S1–S3). This is a common phenomenon in the cp genomes of higher plants [14,15]. Significantly, the ycf15 gene is located in cp genome of most angiosperm while is absent from the Sanguisorba cp genomes. This phenomenon was also found to occur in Cedrela odorata [7], Schisandra chinensis [8], Cremastra appendiculata [16] and Aristolochia debilis [17].
Table 2

List of genes encoded by the four Sanguisorba chloroplast genomes.

CategoryGroupName
Self-replicationrRNA genesrrn4.5a, rrn5a, rrn16a, rrn23a
tRNA genestrnA-UGC *,a, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-GCC, trnG-UCC *, trnH-GUG, trnI-CAU a
trnI-GAU *,a, trnK-UUU *, trnL-CAA a, trnL-UAA *, trnL-UAG, trnM-CAU, trnN-GUU a, trnP-UGG, trnQ-UUG, trnR-ACG a, trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GAC a, trnV-UAC *, trnW-CCA, trnY-GUA
Small subunit of ribosomerps2, rps3, rps4, rps7a, rps8, rps11, rps12 **,a, rps14 rps15, rps16 *, rps18, rps19
Large subunit of ribosomerpl2 *,a, rpl14, rpl16 *, rpl20, rpl22, rpl23 a, rpl32, rpl33, rpl36
DNA dependent RNA polymeraserpoA, rpoB, rpoC1 *, rpoC2
Genes for phytosynthesisSubunits of NADH-dehydrogenasendhA *, ndhB *,a, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH ndhI, ndhJ, ndhK
Subunits of photosystem IpsaA, psaB, psaC, psaI, psaJ
Subunits of photosystem IIpsbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
Subunits of cytochrome b/f complexpetA, petB *, petD *, petG, petL, petN
Subunits of ATP synthaseatpA, atpB, atpE, atpF, atpH, atpl
Large subunit of RuBisCO rbcL
Other genesMaturase matK
Envelope membrane protein cemA
Subunit of Acetyl-CoA-carboxylase accD
C-type cytochrome synthesis gene ccsA
ProteaseclpP **
Genes of unknown functionOpen Reading Frames (ORF, ycf)ycf1, ycf2a, ycf3 **, ycf4
Pseudo genesycf1, infA

* Gene with one intron, ** Gene with two introns, a Gene with two copies.

Introns play an important role in the regulation of alternative gene splicing [18,19]. We found that 17 genes contained introns in all four Sanguisorba cp genomes, of which 11 are protein-coding genes and six are tRNA genes. 14 of the 17 contain a single intron, whereas three (clpP, rps12, and ycf3) have two introns. The largest intron, located into the trnK-UUU gene, ranged 2508 bp to 2516 bp in the four species (Table 3 and Tables S1–S3). The matK gene is located in the intron of trnK-UUU gene.
Table 3

The length of exons and introns in genes with introns in the Sanguisorba officinalis chloroplast genome.

No.GeneLocationExon I (bp)Intron I (bp)Exon II (bp)Intron I (bp)Exon III (bp)
1 clpP LSC69938291658228
2 ndhA SSC5631185541
3 ndhB IR777682756
4 petB LSC6761657
5 petD LSC9750474
6 rpl16 LSC81011403
7 rpl2 IR391673434
8 rpoC1 LSC4357491620
9rps12 *LSC114-23254326
10 rps16 LSC39899228
11 trnA-UGC IR3881435
12 trnG-UCC LSC2369848
13 trnI-GAU IR4294935
14 trnK-UUU LSC37251635
15 trnL-UAA LSC3755450
16 trnV-UAC LSC3960137
17 ycf3 LSC126723228766153

* rps12 is a trans-spliced gene, of which two 3′ end residues are located in the IR region and the 5′ end in the LSC region.

2.2. Codon Usage

The total length of the protein coding genes from the four Sanguisorba cp genomes is 78,582~78,612 bp, and these genes are encoded by 22,760~22,768 codons. Protein coding genes thus accounted for 50.6~50.9% of the whole genome sequence. The most frequent amino acid is leucine, with 2387~2400 (10.5%) of the codons, but cysteine is the least frequent in the four Sanguisorba cp genomes, with only 260~262 (1.1%) of all codons. Within the protein-coding sequences (CDS), the AT content of codons at the first to third positions is 54.5%, 61.9~62.0%, and 69.5~69.6%, respectively. The fact is that the AT content of the codons is the highest with the third position, and it’s common in land plants [7,13,20,21]. The same phenomenon was also found in the frequency of codon usage. All preferred synonymous codons (RSCU > 1) ended with A or U except the codons of trnL-CAA; however, most non-preferred synonymous codons (RSCU < 1) ended with G or C (Table 4 and Table S4–S6).
Table 4

Codon usage in the Sanguisorba officinalis chloroplast genomes. RSCU: Relative Synonymous Codon Usage.

Amino AcidCodonCountRSCUtRNAAmino AcidCodonCountRSCUtRNA
PheUUU8991.38 TyrUAU6821.61
PheUUC4010.62 trnF-GAA TyrUAC1650.39 trnY-GUA
LeuUUA8102.03 trnL-UAA StopUAA431.65
LeuUUG4661.17 trnL-CAA StopUAG200.77
LeuCUU5031.26 HisCAU4031.51
LeuCUC1480.37 HisCAC1320.49 trnH-GUG
LeuCUA3060.77 trnL-UAG GlnCAA6161.53 trnQ-UUG
LeuCUG1560.39 GlnCAG1910.47
IleAUU9831.5 AsnAAU8251.53
IleAUC3690.56 trnI-GAU AsnAAC2560.47 trnN-GUU
IleAUA6140.94 LysAAA9251.54 trnK-UUU
MetAUG5311trnfM-CAU, trnI-CAU, trnM-CAULysAAG2800.46
ValGUU4711.48 AspGAU7121.62
ValGUC1520.48 trnV-GAC AspGAC1680.38 trnD-GUC
ValGUA4741.48 trnV-UAC GluGAA9041.52 trnE-UUC
ValGUG1800.56 GluGAG2870.48
SerUCU4691.69 CysUGU2041.56
SerUCC2630.95 trnS-GGA CysUGC570.44 trnC-GCA
SerUCA3061.1 trnS-UGA StopUGA150.58
SerUCG1710.61 TrpUGG3961 trnW-CCA
ProCCU3521.47 ArgCGU3071.36 trnR-ACG
ProCCC1980.83 ArgCGC950.42
ProCCA2571.08 trnP-UGG ArgCGA3121.38
ProCCG1490.62 ArgCGG1030.46
ThrACU4651.59 SerAGU3491.25
ThrACC2240.76 trnT-GGU SerAGC1110.4 trnS-GCU
ThrACA3481.19 trnT-UGU ArgAGA3911.74 trnR-UCU
ThrACG1350.46 ArgAGG1440.64
AlaGCU5761.79 GlyGGU5241.32
AlaGCC2010.63 GlyGGC1920.48 trnG-GCC
AlaGCA3481.08 trnA-UGC GlyGGA5681.43 trnG-UCC
AlaGCG1610.5 GlyGGG3050.77
Average # codons = 22,768

2.3. Long Repeats and SSR Analysis

For long repeats analysis, the four cp genomes enclose long repeats with a total number ranging from 39 to 53 with at least 30 bp per repeat unit. Taking S. officinalis as an example, a number of 49 repeats were detected. These included 24 palindromic repeats, 17 forward repeats, six reverse repeats, and two complement repeats. Most repeats showed lengths between 30 and 44 bp and are in intergenic regions or intron sequences. SSRs, also called as microsatellites, are tandemly repeated sequences that consist of 1–6 nucleotide repeat units. SSRs are widely distributed in cp genomes in general and are important for studies of plant populations. Because of their high level of polymorphism, SSRs are widely used as molecular markers for species authentication, molecular breeding, and population genetics [22,23,24,25]. Here, we identified many SSRs in the cp genomes, ranging from 79 in S. tenuifolia var. alba to 91 in S. stipulata. Most of the SSRs are mononucleotide repeats, whose amount ranges from 55 (S. tenuifolia var. alba) to 69 (S. stipulata). The number of di-, tri-, tetra-, penta-, and hexanucleotide repeats found was 9~12, 3~4, 7~9, 0~1, and 1~2, respectively (Table 5). Most of the mononucleotide SSRs belonged to the A/T type in the four Sanguisorba species. The highest number of SSRs found was in S. stipulata, which showed 68 of 69 identified mononucleotide SSRs. The lowest number of SSRs found was 55 of the 59 found in S. officinalis. These results are consistent with those of previous studies that found that polyadenine (polyA) and polythymine (polyT) content were higher than polyguanine (polyG) and polycytosine (polyC) content in the cpSSRs of many plants [26]. We speculate that the abundance of A/T SSRs may be associated with the AT richness of these cp genomes [13,27].
Table 5

Types and numbers of SSRs found in the four Sanguisorba chloroplast genomes.

SSR TypeRepeat UnitNumber
S. officinalis S. filiformis S. stipulata S. tenuifolia var. alba
MonoA/T55566853
C/G4312
DiAT/AT119811
AG/CT1111
TriAAT/ATT3434
TetraAAAT/ATTT4354
AAAG/CTTT1111
ACAT/ATGT1111
AGAT/ATCT1111
AATT/AATT0110
PentaAAATT/AATTT1001
HexaAAAGGG/CCCTTT0200
AAAATC/ATTTTG0010
Total 82829179

2.4. IR Contraction and Expansion

It is well known that IRs are the most conserved regions in chloroplast genomes, and the contraction and expansion at the borders of IR regions are common evolutionary events. It is also a main cause of length variation in the chloroplast genomes [28,29]. In this study, we compared the IR/SSC and IR/LSC boundaries of the four Sanguisorba cp genomes (Figure 2). In the four Sanguisorba species, the IRb/SSC boundary extends into functional ycf1 genes, yielding a pseudogene ycf1, which have a length of 1106~1201 bp in the four species. A previous study reported that the pseudogene ycf1 may be useful for researching variation among cp genomes in higher plants or algae [30]. In addition, we found no overlap between the ycf1 pseudogene and ndhF in the four species. The ndhF gene is found in the SSC region, and was 138 bp, 90 bp, four bp, and 117 bp away from the IRb/SSC boundary in S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba, respectively. The trnH gene was found in the same position of the same LSC region in the four species, which is only two bp away from the IRb/SSC boundary. In the cp genome, variation in the IR/SSC and IR/LSC boundaries is governed by a dynamic and random process that is confined to conservative expansions and contractions [31,32]. There are many studies about the mechanisms responsible for IR expansion, and the leading view is that short IR expansions could be caused by gene conversion, but large IR expansions may be the result of double-strand DNA break repair (DSBR) [33,34]. In contrast, there are few reports on the mechanisms of IR contraction. However, Peery et al. proposed that DSBR theory was not only the main mechanism of IR region expansion, but also the main mechanism of IR region contraction [35].
Figure 2

Comparison of the border regions of the LSC, SSC, and IR among four chloroplast genomes. Ψ: pseudogenes.

2.5. Comparative Chloroplast Genomic Analysis

With the annotated S. officinalis cp genome as a reference, the whole cp genome of the four Sanguisorba species were compared and drawn by mVISTA to show sequence divergence (Figure 3), which is important for further phylogenetic analyses and species identification. Comparative genome analysis found that there is a high similarity between the cp genomes of all Sanguisorba species. The LSC and SSC regions are more divergent than the two IR regions, which is common in other higher plants and may be due to copy corrections between two IR regions by gene conversion [36]. Moreover, the coding regions have less variability proportions than the non-coding regions. The highest divergence among the four Sanguisorba cp genomes occurs in the intergenic spacers region, which contains trnE-trnT, trnS-psbZ, trnS-ycf3, trnF-ndhJ, accD-psal, and ycf1-ndhF. In this study, we found that the more conserved coding regions are the four rRNA located in IR region.
Figure 3

Comparison of the four Sanguisorba chloroplast genomes using mVISTA. CNS indicates conserved noncoding sequences. The Y-scale represents the percent identity between 50% and 100%.

2.6. Phylogenetic Analysis

Chloroplast genomes contain abundant genetic information that is widely applied in plant identification and phylogenetic studies [6,37,38,39]. Sanguisorba belongs to the subfamily Rosoideae in the Rosaceae. Previous studies have reported phylogenetic relationships within the Rosaceae that were analyzed based on chloroplast regions [40,41]. Here, the availability of the completed cp genomes and protein coding genes of the four Sanguisorba species provide us with sequence and gene information for studying the molecular evolution and phylogeny of the genus Sanguisorba [9,42]. In this study, two datasets (i.e., the whole complete cp genome and the set of protein coding genes) from the cp genomes of the four Sanguisorba species and one outgroup (Fragaria chiloensis) were used to perform phylogenetic analysis. Phylogenetic trees were generated using the maximum parsimony (MP) method based on two datasets with the same topologies (Figure 4 and Figure S4). For the four Sanguisorba species, S. officinalis has the closest relationship with S. tenuifolia var. alba, followed by S. stipulata, and has the least close relationship with the S. filiformis. In addition, both S. stipulata and S. filiformis group into a monophyletic clade.
Figure 4

Phylogenetic relationships between the four Sanguisorba species determined by whole cp genome sequences using the maximum parsimony (MP) method. Fragaria chiloensis was set as the outgroup.

3. Materials and Methods

3.1. Plant Materials and DNA Extraction

Fresh leaves of four Sanguisorba species were collected from Jilin and Yunan Provinces in China. Then we washed the leaves powder with HF buffer (100 mmol·L−1 Tris-HCl pH 8.0, 20 mmol·L−1 EDTA, 0.7 mol·L−1 NaCl, 2% PVP, and 0.2% 2-mercaptoethanol). HF buffer (600 μL) was added to leaves powder (~100 mg), the mixture vortexed vigorously for 3 min, centrifuged for 5 min at 12,000 rpm, and the supernatant discarded. Finally the total genomic DNA of each sample was isolated from the leaves powder by Plant Genomic DNA Kits (Tiangen Biotech Co., Beijing, China), according to the manufacturer’s instructions. The DNA quality and quantity of each sample was estimated by a NanoDrop 2000 Spectrophotometer (Nanodrop Technologies, Wilmington, DE, USA) and a Qubit3.0 Fluorometer (Thermo Scientific, Waltham, MA, USA), as well as by agarose gel electrophoresis.

3.2. Chloroplast Genome Sequencing, Assembly and Annotation

After DNA was purified and prepared, ~2 μg was used to construct shotgun libraries. Genomic DNA was taken and sheared into 450 bp contigs with the Covaris M220 Focused-ultrasonicator (Covaris, Woburn, MA, USA). The library was constructed by TruSeqTM DNA Sample Prep Kit (Illumina Inc., San Diego, CA, USA), according to the manufacturer’s instructions. An Illumina HiSeq X platform was used for sequencing. Clean reads were obtained by using the Fastqc trim tool [43]. We then extracted cp-like reads from trimmed reads by performing BLASTs [44] using reference sequences (Rosa roxburghii, accession No.: NC_032038). Sequence assembly was performed by using SOAPdenovo [45], and the contigs were aligned using SSPACE [46]. The complete chloroplast genomes of the four Sanguisorba species were annotated using the CpGAVAS web service [47]. The tRNA genes were confirmed using tRNAscan-SE [48,49]. OGDRAW software (http://ogdraw.mpimp-golm.mpg.de/) [50] was used to draw circular cp genome maps for each species. The validated complete cp genome of the four Sanguisorba species were deposited in GenBank (https://www.ncbi.nlm.nih.gov/): S. officinalis, MF678801; S. filiformis, MF678800; S. stipulata, MF678798; and S. tenuifolia var. alba, MF678799.

3.3. Genome Comparison and Structural Analyses

The IR and SC boundary regions of the four Sanguisorba species were compared and examined. Comparison of the four cp genomes was performed using the Shuffle-LAGAN mode in mVISTA [51,52], with the annotation of S. officinalis used as the reference. In addition, we analyzed the codon usage, relative synonymous codon usage values (RSCU), and GC content using MEGA5 [53]. SSRs were identified by MISA (http://pgrc.ipk-gatersleben.de/misa/) [54] with minimum repeat numbers of 10, 5, 4, 3, 3, and 3 for mono-, di-, tri-, tetra-, penta-, and hexanucleotides, respectively. The forward and inverted repeats in the Sanguisorba cp genome were detected using REPuter [55] with a minimal repeat sequence of 30 bp and a sequence identity of 90%.

3.4. Phylogenetic Analyses

Phylogenetic analyses were performed for the four Sanguisorba species using Fragaria chiloensis (Rosaceae) as an outgroup. The complete cp genome sequences and protein coding genes shared in four Sanguisorba species and Fragaria chiloensis (accession No.: NC_019601) [56] were aligned by ClustalW2 [57]. Phylogenetic trees were constructed using the maximum parsimony (MP) method in PAUP*4.0b10 [58]. A heuristic search was performed using the MULPARS option, with the random stepwise addition of sequences in 1000 replications and tree bisection reconnection (TBR) branch swapping. The branch support of the phylogenetic tree was 1000 bootstrap replicates.

4. Conclusions

The complete cp genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba), the first four sequenced members of the genus Sanguisorba, were assembled, annotated and analyzed in this study. The genome structure, gene content, and gene order were similar in the four species. Long repeats and SSRs reported here provide opportunities for the development of new molecular markers to study medicinal plants in the genus Sanguisorba. Phylogenetic analysis strongly supported that S. officinalis has the closest relationship with S. tenuifolia var. alba, followed by S. stipulata, and then S. filiformis. The available genome data presented in this paper provides a basis for further research on the evolution of the genus Sanguisorba, as well as for species identification.
  43 in total

Review 1.  Alternative splicing: increasing diversity in the proteomic world.

Authors:  B R Graveley
Journal:  Trends Genet       Date:  2001-02       Impact factor: 11.639

2.  Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms.

Authors:  Michael J Moore; Charles D Bell; Pamela S Soltis; Douglas E Soltis
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-28       Impact factor: 11.205

3.  Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns.

Authors:  Robert K Jansen; Zhengqiu Cai; Linda A Raubeson; Henry Daniell; Claude W Depamphilis; James Leebens-Mack; Kai F Müller; Mary Guisinger-Bellian; Rosemarie C Haberle; Anne K Hansen; Timothy W Chumley; Seung-Bum Lee; Rhiannon Peery; Joel R McNeal; Jennifer V Kuehl; Jeffrey L Boore
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-28       Impact factor: 11.205

4.  Development and characterization of 18 novel EST-SSRs from the western flower Thrips, Frankliniella occidentalis (Pergande).

Authors:  Xian-Ming Yang; Jing-Tao Sun; Xiao-Feng Xue; Wen-Chao Zhu; Xiao-Yue Hong
Journal:  Int J Mol Sci       Date:  2012-03-05       Impact factor: 6.208

5.  Molecular Structure and Phylogenetic Analyses of Complete Chloroplast Genomes of Two Aristolochia Medicinal Species.

Authors:  Jianguo Zhou; Xinlian Chen; Yingxian Cui; Wei Sun; Yonghua Li; Yu Wang; Jingyuan Song; Hui Yao
Journal:  Int J Mol Sci       Date:  2017-08-24       Impact factor: 5.923

6.  The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.

Authors:  Sebastin Raveendar; Young-Wang Na; Jung-Ro Lee; Donghwan Shim; Kyung-Ho Ma; Sok-Young Lee; Jong-Wook Chung
Journal:  Molecules       Date:  2015-07-20       Impact factor: 4.411

7.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

8.  CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences.

Authors:  Chang Liu; Linchun Shi; Yingjie Zhu; Haimei Chen; Jianhui Zhang; Xiaohan Lin; Xiaojun Guan
Journal:  BMC Genomics       Date:  2012-12-20       Impact factor: 3.969

9.  Comparative chloroplast genomics: analyses including new sequences from the angiosperms Nuphar advena and Ranunculus macranthus.

Authors:  Linda A Raubeson; Rhiannon Peery; Timothy W Chumley; Chris Dziubek; H Matthew Fourcade; Jeffrey L Boore; Robert K Jansen
Journal:  BMC Genomics       Date:  2007-06-15       Impact factor: 3.969

10.  NCBI BLAST: a better web interface.

Authors:  Mark Johnson; Irena Zaretskaya; Yan Raytselis; Yuri Merezhuk; Scott McGinnis; Thomas L Madden
Journal:  Nucleic Acids Res       Date:  2008-04-24       Impact factor: 16.971

View more
  7 in total

1.  PCIR: a database of Plant Chloroplast Inverted Repeats.

Authors:  Rui Zhang; Fangfang Ge; Huayang Li; Yudong Chen; Ying Zhao; Ying Gao; Zhiguo Liu; Long Yang
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

2.  The complete chloroplast genome of Agrimonia pilosa Ledeb. isolated in Korea (Rosaceae): investigation of intraspecific variations on its chloroplast genomes.

Authors:  Kyeong-In Heo; Jongsun Park; Hong Xi; Juhyeon Min
Journal:  Mitochondrial DNA B Resour       Date:  2020-06-01       Impact factor: 0.658

3.  Antibacterial activity and synergy of antibiotics with sanguisorbigenin isolated from Sanguisorba officinalis L. against methicillin-resistant Staphylococcus aureus.

Authors:  S Wang; J Luo; X-Q Liu; O-H Kang; D-Y Kwon
Journal:  Lett Appl Microbiol       Date:  2021-01-08       Impact factor: 2.858

4.  A Comprehensive Study of the Genus Sanguisorba (Rosaceae) Based on the Floral Micromorphology, Palynology, and Plastome Analysis.

Authors:  Inkyu Park; Junho Song; Sungyu Yang; Goya Choi; Byeongcheol Moon
Journal:  Genes (Basel)       Date:  2021-11-05       Impact factor: 4.096

5.  Comparative chloroplast genomes and phylogenetic analyses of Pinellia.

Authors:  Ning Cui; Weixu Chen; Xiwen Li; Ping Wang
Journal:  Mol Biol Rep       Date:  2022-06-11       Impact factor: 2.742

6.  The Complete Chloroplast Genome of Endangered Species Stemona parviflora: Insight into the Phylogenetic Relationship and Conservation Implications.

Authors:  Ran Wei; Qiang Li
Journal:  Genes (Basel)       Date:  2022-07-29       Impact factor: 4.141

7.  Complete chloroplast genome studies of different apple varieties indicated the origin of modern cultivated apples from Malus sieversii and Malus sylvestris.

Authors:  Xueli Li; Zhijie Ding; Haoyu Miao; Jinbo Bao; Xinmin Tian
Journal:  PeerJ       Date:  2022-03-18       Impact factor: 2.984

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.