Literature DB >> 31242865

Comparative analyses of chloroplast genomes from 22 Lythraceae species: inferences for phylogenetic relationships and genome evolution within Myrtales.

Cuihua Gu1, Li Ma2, Zhiqiang Wu3, Kai Chen2, Yixiang Wang4.   

Abstract

BACKGROUND: Lythraceae belongs to the order Myrtales, which is part of Archichlamydeae. The family has 31 genera containing approximately 620 species of herbs, shrubs and trees. Of these 31 genera, five large genera each possess 35 or more species. They are Lythrum, with 35; Rotala, with 45; Nesaea, with 50; Lagerstroemia, with 56; and Cuphea, with 275 species.
RESULTS: We reported six newly sequenced chloroplast (cp) genomes (Duabanga grandiflora, Trapa natans, Lythrum salicaria, Lawsonia inermis, Woodfordia fruticosa and Rotala rotundifolia) and compared them with 16 other cp genomes of Lythraceae species. The cp genomes of the 22 Lythraceae species ranged in length from 152,049 bp to 160,769 bp. In each Lythraceae species, the cp genome contained 112 genes consisting of 78 protein coding genes, four ribosomal RNAs and 30 transfer RNAs. Furthermore, we detected 211-332 simple sequence repeats (SSRs) in six categories and 7-27 long repeats in four categories. We selected ten divergent hotspots (ndhF, matK, ycf1, rpl22, rpl32, trnK-rps16, trnR-atpA, rpl32-trnL, trnH-psbA and trnG-trnR) among the 22 Lythraceae species to be potential molecular markers. We constructed phylogenetic trees from 42 Myrtales plants with 8 Geraniales plants as out groups. The relationships among the Myrtales species were effectively distinguished by maximum likelihood (ML), maximum parsimony (MP) and Bayesian inference (BI) trees constructed using 66 protein coding genes. Generally, the 22 Lythraceae species gathered into one clade, which was resolved as sister to the three Onagraceae species. Compared with Melastomataceae and Myrtaceae, Lythraceae and Onagraceae differentiated later within Myrtales.
CONCLUSIONS: The study provided ten potential molecular markers as candidate DNA barcodes and contributed cp genome resources within Myrtales for further study.

Entities:  

Keywords:  Chloroplast genome; Lythraceae; Myrtales; Phylogenomic

Mesh:

Year:  2019        PMID: 31242865      PMCID: PMC6595698          DOI: 10.1186/s12870-019-1870-3

Source DB:  PubMed          Journal:  BMC Plant Biol        ISSN: 1471-2229            Impact factor:   4.215


Background

Lythraceae belongs to the order Myrtales and is named after the genus Lythrum [1]. The flowering family is composed of five subfamilies, Lythroideae, Punicoideae, Sonneratioideae, Duabangoideae and Trapoideae, with 31 genera. The subfamily Punicoideae was formerly the family Punicaceae, and the subfamily Trapoideae was formerly the Trapaceae. The genera Cuphea, Lagerstroemia, Nesaea, Rotala, and Lythrum represent the largest groups of Lythraceae. Lythraceae species are distributed around the world, with most in tropical regions and some in temperate climate regions [2-7]. Most Lythraceae species are herbs, while shrubs or trees are less common [8]. Lythraceae differ from other plant families by the petals, which are crumpled inside their buds, and the many-layered outer integument of their seeds [2, 3]. Many species occur in aquatic or semiaquatic habitats, such as Didiplis, Rotala, Morus and Trapa. Some species in the family are of high economic value, such as Punica granatum as a fruit tree, Trapa natans as edible food, Heimia myrtifolia as an important medicinal plant [9] and Lawsonia inermis as a natural dye. Overall, the species of Lythraceae have high economic and ornamental value and are widely used in horticulture [10, 11]. Past studies of Lythraceae have concentrated on morphology [2, 12], palynology [13, 14] and anatomy [15]. However, these studies did not distinguish the intraspecific relationship within Lythraceae. More recently, to deepen our understanding of the relationship among Lythraceae species, the modern branch method was used to make a preliminary estimate of the phylogeny within Lythraceae species [16]. Based on rbcL genome data, the psaA-ycf3 spacer in the cp genome and the ITS sequence of the nuclear ribosomale DNA, the phylogenetic relationship within Lythraceae ware preliminarily inferred [17]. These two noncoding regions improved the resolution between species in an rbcL bifurcation diagram [17]. However, due to the use of certain DNA fragments, these studies lead to incomplete conclusions. Complete cp genomes will provide better solutions to relationship reconstruction within Lythraceae and allow exploration of its phylogenetic position within Myrtales. The chloroplast is an essential organelle for land plants [18], and is mostly inherited maternally [19]. The cp genome usually consists of a two-stranded DNA molecule, and most cp genomes are 120–220 kb in length with 120–140 coding genes [20, 21]. The cp genome usually has four parts: a large single copy (LSC) region, a small single copy (SSC) region, and two copies of the inverted repeat region (IRA and IRB). Because the cp genome is more conserved and shorter in length than the nuclear and mitochondrial genomes, some cp genome sequence have been used to distinguish species and conduct phylogenetic studies [22-25]. An increasing number of cp genomes have recently been reported because complete cp genome sequences provides better data to distinguish marginal taxa, especially below the species level. In this study, we report six newly sequenced Lythraceae cp genomes and compare them with those of 16 other species within Lythraceae including nine published cp genomes (P. granatum, H. myrtifolia, Lagerstroemia fauriei, Lagerstroemia floribunda, Lagerstroemia guilinensis, Lagerstroemia indica, Lagerstroemia speciosa, Lagerstroemia subcostata and Lagerstroemia intermedia) downloaded from GenBank and seven unpublished Lagerstroemia cp genomes (Lagerstroemia excelsa, Lagerstroemia limii, Lagerstroemia villosa, Lagerstroemia siamica, Lagerstroemia tomentosa, Lagerstroemia venusta and Lagerstroemia calyculata). Our objectives were as follows: (1) To detect differences between the cp genomes of 22 Lythraceae species; (2) to select 10 highly variable regions to act as candidate barcodes for identifying related species of Lythraceae; (3) to reconstruct phylogenetic relationships to verify branch relationships within Lythraceae and explore its status in Myrtales.

Results

Chloroplast genome structure and content

The complete cp genomes of the 22 Lythraceae species ranged in length from 152,049 bp (L. subcostata) to 160,769 bp (L. villosa) (Table 1). All cp genomes had the typical four conjoined structures, including the LSC and SSC regions separated by two IR regions (Fig. 1). The LSC regions ranged from 83,817 bp (L. guilinensis) to 89,569 bp (W. fruticosa) and accounted for 55.10–56.90% of the total length. The SSC regions varied between 16,501 bp (D. grandiflora) and 33,301 bp (L. speciosa) and accounted for 10.60–21.80% of the total length. The IR regions ranged from 17,541 bp (L. floribunda) to 26,906 bp (L. villosa) and accounted for 11.50–17.00% of the total length.
Table 1

Summary of complete chloroplast genomes for 22 species in Lythraceae

L.excelsa L.limii L.villosa L.siamica L.tomentosa L.venusta L.calyculata L.fauriei L.floribunda L.guilinensis
Accession numberMK881635MK881627MK881633MK881628MK881632MK881630MK881636NC_029808NC_031825NC_029885
FamilyLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceae
Total length (bp)152,214152,153160,769152,519152,294152,521152,294152,440152,240152,193
GC(%)37.637.5836.9737.5837.6537.5737.6537.6137.7237.62
LSC
 Length (bp)84,05383,95488,70284,16684,01384,19484,01283,92683,96783,817
 GC(%)35.9435.9234.6935.8935.9835.8735.9735.9436.135.95
 length(%)55.255.255.255.255.255.255.255.155.255.1
SSC
 Length (bp)16,91716,90518,25516,86516,91716,83316,79816,93416,78716,909
 GC(%)31.0330.9630.7830.9531.0330.9731.1730.9231.2330.97
 length(%)11.111.111.411.111.111.011.011.111.011.1
IR
 Length (bp)25,62225,64726,90625,74425,62225,74725,74225,79025,78825,794
 GC(%)42.4942.4742.8342.5142.4942.5142.5142.5142.4842.47
 length(%)16.816.916.716.916.816.916.916.916.916.9
L. indica L.speciosa L.subcostata L.intermedia D.grandiflora T.natans L.salicaria L.intermis P.granatum W.fruticosa R.rotundifolia H.myrtifolia
NC_030484NC_031414NC_034952NC_034662MK881638MK881634MK881629MK881631NC_035240MK881637MK881626MG921615
LythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceae
152,205152,476152,049152,330156,084155,555158,483157,756158,639159,380157,753159,219
37.5937.5837.5937.5937.4936.4136.8136.8936.9236.6336.8937.00
LSC
 84,04684,05183,89083,98786,47188,50688,99988,42489,02289,56988,42288,571
 35.9335.8935.9235.9235.5934.1934.7534.7634.8934.5334.7635.00
 55.255.155.255.155.456.956.256.156.156.256.155.6
SSC
 16,91516,88616,90916,87116,50118,27418,53017,38618,68518,69717,38618,822
 30.9830.9730.9730.9331.2830.1830.6831.0130.6330.2331.0130.60
 11.111.011.111.110.611.711.711.011.811.711.011.8
IR
 25,62225,81725,62525,73626,55624,38825,47725,97325,46625,55725,97325,643
 42.542.5142.542.5142.542.7742.6342.542.7842.6542.542.60
 16.816.916.916.917.015.716.116.516.116.016.516.1

GC guanine-cytosine, LSC large single-copy region, SSC short single-copy region, IRs inverted repeats

Fig. 1

Structural map of the Lythraceae chloroplast genome. Genes drawn inside the circle are transcribed clockwise, and those outside are counterclockwise. Small single copy (SSC), Large single copy (LSC), and inverted repeats (IRa, IRb) are indicated. Genes belonging to different functional groups are color-coded

Summary of complete chloroplast genomes for 22 species in Lythraceae GC guanine-cytosine, LSC large single-copy region, SSC short single-copy region, IRs inverted repeats Structural map of the Lythraceae chloroplast genome. Genes drawn inside the circle are transcribed clockwise, and those outside are counterclockwise. Small single copy (SSC), Large single copy (LSC), and inverted repeats (IRa, IRb) are indicated. Genes belonging to different functional groups are color-coded A total of 112 unique genes were detected in the cp genomes of the 22 Lythraceae species, including 78 coding genes, 30 tRNA genes and 4 rRNA genes (Table 2). Among the 22 Lythraceae species, the lengths of the protein coding exons ranged from 73,401 bp (L. indica) to 81,047 bp (H. myrtifolia), rRNA ranged from 9022 bp (T. natans) to 9068 bp (L.fauriei), tRNA ranged from 2741 bp (L. guilinensis) to 2913 bp (L. excelsa), intergenic regions ranged from 44,031 bp (L. guilinensis) to 51,367 bp (L. villosa) and intronic regions ranged from 14,786 bp (L. calyculata) to 18,099 bp (L. villosa). Each of these accounted for 37.00–38.00%, 3.00–6.00%, 1.80–1.90%, 28.90–32.40% and 9.70–11.30% of the total length, respectively (Table 3).
Table 2

Genes contained in the sequenced Lythraceae chloroplast genome

Gene categoryGroups of genesName of genes
Self-replicationRibosomal RNAs rrn16 b ;rrn23 b ;rrn4.5 b ;rrn5 b
Transfer RNAs trnA-UGC a,b ;trnC-GCA;trnD-GUC;trnE-UUC;trnF-GAA;trnfM-CAU
trnG-UCC a ;trnG-GCC;trnH-GUG;trnI-CAU b ;trnI-GAU a,b ;trnK-UUU a
trnL-CAA b ;trnL-UAA a; trnL-UAG;trnM-CAU;trnN-GUU b ;trnP-UGG
trnQ-UUG;trnR-ACG b ;trnR-UCU;trnS-GCU;trnS-GGA;trnS-UGA
trnT-GGU;trnT-UGU;trnV-UAC a; trnW-CCA;trnY-GUA
Small subunit of ribosome rps2;rps3;rps4;rps7 b ;rps8;rps11;rps12 a,b ;rps14;rps15;rps16 a ;rps18;rps19
Large subunit of ribosome rpl2 a,b ;rpl14;rpl16 a ;rpl20;rpl23 b ;rpl32;rpl33;rpl36
DNA dependent RNA polymerase rpoA;rpoB;rpoC1 a ;rpoC2
PhotosynthesisSubunits of photosystem I psaA;psaB;psaC;psaI;psaJ
Subunits of photosystem II psbA;psbB;psbC;psbD;psbE;psbF;psbH;psbI;psbJ;psbK;psbL;psbM
psbN; psbT;psbZ
Subunits of cytochrome petA;petB a ;petD;petG;petL;petN
Subunits of ATP synthase atpA;atpB;atpE;atpF a ;atpH;atpI
ATP-dependent protease subunit p gene clpP a
Large subunit of Rubisco rbcL
Subunits of NADH dehydrogenase ndhA a ;ndhB a,b ;ndhC;ndhD;ndhE;ndhF;ndhG;ndhH;ndhI;ndhJ;ndhK
Other genesMaturase matK
Envelop membrane protein cemA
Acetyl-CoAcarboxylase accD
other ccsA;infA
Genes of unknown functionConserved open reading frames ycf1 b ;ycf2 b ; ycf3 a ; ycf4

aIntron-containing genes

bGenes located in the IR regions

Table 3

Distribution of genes and Intergenic regions for 22 species in Lythraceae

L.excelsa L.limii L.villosa L.siamica L.tomentosa L.venusta L.calyculata L.fauriei L.floribunda L.guilinensis
Accession numberMK881635MK881627MK881633MK881628MK881632MK881630MK881636NC_029808NC_031825NC_029885
FamilyLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceaeLythraceae
Protein Coding Genes
 Length (bp)79,04679,08079,64879,05679,06279,05979,06279,06278,85279,068
 GC(%)37.8637.8437.7737.837.8737.837.8637.8437.9237.86
 length(%)52525052525252525252
rRNA
 Length (bp)9038903890359040903890409038906890449044
 GC(%)55.7255.7255.4355.6255.6855.6455.6855.6755.5755.71
 length(%)6666666666
tRNA
 Length (bp)2913281928142808281328142813280927452741
 GC(%)52.853.3553.253.4553.4753.4553.4753.1553.5553.37
 length(%)2222222222
Intergenic Regions
 Length (bp)45,48246,15651,36745,61945,94145,48646,03344,13845,26644,031
 GC(%)32.5532.4331.332.5232.4932.5432.8432.3732.5432.37
 length(%)30303230303030293029
Intron
 Length (bp)15,33415,41918,09915,60715,59015,60614,78615,83415,87715,596
 GC(%)38.238.1937.8238.1938.3238.1837.6438.4238.738.33
 length(%)10101110101010101010
L. indica L.speciosa L.subcostata L.intermedia D.grandiflora T.natans L.salicaria L.intermis P.granatum W.fruticosa R.rotundifolia H.myrtifolia
NC_030484NC_031414NC_034952NC_034662MK881638MK881634MK881629MK881631NC_035240MK881637MK881626MG921615
LythraceaeLythraceaeLythraceaeLythraceaeSonneratiaceaeTrapaceaeLythraceaeLythraceaePunicaceaeLythraceaeLythraceaeLythraceae
Protein Coding Genes
 73,40179,04477,13979,03578,99378,84878,84979,00679,02978,97879,00081,047
 38.4537.7937.7637.8137.8837.2737.6237.5437.6337.5337.5437.00
 485251525151505050505051
rRNA
 905090469042904690409022903890389038903890389050
 55.6955.5655.6855.5855.5555.5155.1755.2855.2655.2855.2855.00
 666666666666
tRNA
 281727422828280729032812281328122817281928122817
 53.2553.6153.3953.4452.7753.2453.3653.3153.2853.4253.3153.00
 222222222222
Intergenic Regions
 44,53544,31344,18445,34645,92348,75550,85149,41751,35750,98949,44150,172
 32.3532.4532.6132.5832.3630.4631.3231.3831.5530.9931.6232.00
 292929302931323132323132
Intron
 16,22615,86116,20116,37515,87915,56415,94315,91515,92815,97315,91516,133
 37.9138.3337.8737.8938.2737.1437.7837.7937.937.737.7938.00
 111011111010101010101010

GC guanine-cytosine, LSC large single-copy region, SSC short single-copy region, IRs inverted repeats

Genes contained in the sequenced Lythraceae chloroplast genome aIntron-containing genes bGenes located in the IR regions Distribution of genes and Intergenic regions for 22 species in Lythraceae GC guanine-cytosine, LSC large single-copy region, SSC short single-copy region, IRs inverted repeats Among the 112 distinct genes, a total of 17 genes contained introns. Three genes (rps12 and ycf3) contained two introns, similar to Melastomataceae cp genomes [26]. Fourteen genes contained one intron, including eight coding genes (rps16, rpoC1, atpF, petB, petD, ndhB, ndhA, rpl16) and 6 tRNA genes (trnK-UUU, trnL-UAA, trnV-UAC, trnI-GAU, trnA-UGC, trnG-UCC). Of the 17 genes containing introns, one gene was distributed in the SSC regions, three genes was distributed in the IR regions and 13 genes in the LSC regions (Additional file 1: Table S1).

Codon usage

A total of 79 coding genes were used to estimate the codon usage frequency. They were encoded by 25,068 (L. indica) to 27,111 (L. guilinensis) codons. The termination codons were UGA, UAG and UAA. For the 22 species, the GCU encoded alanine had the highest RSCU value and the UAC encoded tyrosine had the lowest at approximately 0.45. Among most of the 22 Lythraceae species, the AAA encoded lysine had the highest number of occurrences, at more than 1000. This result was also reported in the cp genomes of H. myrtifolia, Aquilaria sinensis, Epipremnum aureum and Papaver rhoeas [9, 27–29]. The RSCU results (Table 4, Additional file 2: Table S2) showed that A or T had a higher nucleotide frequency than G or C in the third codon position. It is often the case in terrestrial species that the third codon position prefers A/T over C/G, and the richness of A/ T in the IR regions may be the main reason [30, 31].
Table 4

Codon content of 20 amino acid and stop codon of 79 coding genes of 7 species

D. grandiflora T.natans L. salicaria L. intermis P. granatum W. fruticosa R. rotundifolia
Amino acidCodonRSCUa
AlaGCU1.751.781.841.631.801.761.72
AlaGCG0.510.440.470.610.460.520.53
AlaGCC0.670.640.610.680.630.670.64
AlaGCA1.071.131.091.091.111.051.10
CysUGU1.381.431.411.231.431.281.20
CysUGC0.620.570.590.770.570.730.80
AspGAU1.571.561.591.571.591.571.56
AspGAC0.430.440.410.430.410.430.45
GluGAG0.490.490.500.500.480.500.47
GluGAA1.511.521.501.511.521.501.53
PheUUU1.301.251.311.321.301.311.31
PheUUC0.700.750.690.680.700.700.69
GlyGGU1.251.321.311.141.271.181.20
GlyGGG0.700.710.650.860.660.770.82
GlyGGC0.470.410.440.510.460.510.51
GlyGGA1.581.561.601.491.611.551.47
HisCAC0.500.560.510.510.470.490.55
HisCAU1.501.441.491.491.531.511.45
IleAUU1.421.341.441.481.431.451.52
IleAUA0.921.030.910.790.900.900.80
IleAUC0.660.630.650.730.670.660.68
LysAAA1.461.461.461.461.471.441.45
LysAAG0.540.550.540.540.530.560.55
LeuCUA1.001.241.021.071.021.051.09
LeuCUC0.640.590.640.640.660.690.61
LeuCUG0.550.580.540.580.530.510.53
LeuCUU1.811.591.801.711.791.751.78
LeuUUA1.181.201.181.201.191.181.22
LeuUUG0.820.800.820.800.810.820.78
MetAUG1.001.001.001.001.001.001.00
AsnAAC0.460.500.450.560.440.470.56
AsnAAU1.541.501.551.441.561.541.44
ProCCA1.211.261.221.211.201.2191.23
ProCCC0.840.820.780.840.770.7610.84
ProCCU1.491.441.501.391.551.5031.41
ProCCG0.460.480.500.570.470.5170.53
GlnCAA1.541.481.541.501.561.5581.51
GlnCAG0.460.520.460.500.440.4420.49
ArgAGA1.441.431.451.351.441.3981.38
ArgAGG0.560.570.550.650.560.6020.62
ArgCGA1.601.601.631.621.631.5991.68
ArgCGC0.430.460.410.410.410.4850.42
ArgCGG0.500.580.470.700.490.6060.67
ArgCGU1.471.361.491.271.471.311.23
SerAGC0.540.540.500.670.530.6640.69
SerAGU1.461.461.501.331.471.3361.31
SerUCA0.911.180.941.230.950.9781.24
SerUCC0.990.830.960.800.940.9310.80
SerUCG0.540.520.520.590.530.5960.61
SerUCU1.561.481.581.371.591.4951.35
ThrACC0.820.770.810.910.820.8690.89
ThrACA1.161.231.171.201.181.1221.23
ThrACG0.490.450.470.540.480.5390.54
ThrACU1.531.561.551.351.521.4691.33
ValGUU1.491.391.481.531.481.4881.54
ValGUG0.500.570.530.510.530.5490.50
ValGUC0.550.530.530.620.570.5930.60
ValGUA1.451.511.461.341.421.371.36
TrpUGG1.001.001.001.001.0011.00
TyrUAC0.430.460.430.540.440.4830.52
TyrUAU1.571.541.571.461.561.5171.48
StopbUGA0.881.210.881.170.891.1221.21
StopbUAG0.800.760.800.710.810.6530.74
StopbUAA1.321.031.321.121.301.2251.045

aRelative synonymous codon usage; bStop codon

Codon content of 20 amino acid and stop codon of 79 coding genes of 7 species aRelative synonymous codon usage; bStop codon

Comparative genomic analysis within 22 Lythraceae species

Taking the annotation of L. excelsa as a reference, MVISTA was carried out with the cp genome sequences of 22 Lythraceae species. After the 22 cp genomes were pair wise compared, we found that the similarity between the sequences was rather high. From Fig. 2, it is apparent that the 14 Lagerstroemia species are separated from the eight other Lythraceae species. The divergence among the 14 Lagerstroemia species was low. The LSC and SSC regions had more variation than the IR regions, and the noncoding regions had greater differentiation than the coding regions. Some regions contained more variation, such as ndhF, ndhH, matK, ycf2, rpl22, accD, rpoB, rbcL, psbK among the coding genes and psbM-trnD, trnI-trnA, ndhF-rp132, rp132-trnL, ndhD-psaC, atpA-atpF, trnI-GAU intron, trnK-rps16, trnH-psbA among the intergenic regions (Fig. 2). Similar divergence levels were measured for these regions previously [32, 33].
Fig. 2

Sequence alignment of whole chloroplast genomes using the Shuffle LAGAN alignment algorithm in mVISTA. Lagerstroemia fauriei was chosen to be the reference genome. The vertical scale indicates the percentage of identity, ranging from 50 to 100%

Sequence alignment of whole chloroplast genomes using the Shuffle LAGAN alignment algorithm in mVISTA. Lagerstroemia fauriei was chosen to be the reference genome. The vertical scale indicates the percentage of identity, ranging from 50 to 100% Compared to the LSC and SSC regions of the 22 cp genomes, the IR regions were most conserved in terms of the sequence and number of genes. However, large variations also existed in connections between the IR, LSC and SSC regions. Inversion and translocation were not detected in the compared genomes. IR amplification and contraction were the main reasons for the difference in the size of these 22 cp genomes. Significant differences in evolutionary rates were present among the genes across the 22 Lythraceae species analyzed. Overall, the mean Ka/Ks were less than 0.5 for most genes (92.21%). 17 genes showed Ka/Ks higher than 1 for at least one species. Among the 17 genes, seven genes (rbcL, psbJ, rpl2, rpl20, rpl23, ccsA and ycf4) presented these high rates for at least 15 species. The results showed that the seven genes may be under positive selection. Seven genes associated with photosynthesis (psbN, psbI, psaC, atpH, petD, psbD and psbM) showed the lowest rates of evolution (mean Ka/Ks = 0 to 0.0057), and showed uniform rates in most species evaluated. The Ka/Ks of psbN, psbI, psaC and atpH were 0 because there were no non-synonymous substitutions (Additional file 3: Table S3). In order to detect a possible evolutionary rate acceleration in particular phylogenetic branches, We analyzed three genes with most variable Ka/Ks, namely rpl23 (large subunit of ribosome), rbcL (large subunit of rubisco) and ycf4 (genes of unknown function). Since the Ka/Ks in comparison among 14 Lagerstroemia species were almost 0, we compared the Ka/Ks at rpl23, rbcL and ycf4 in comparison of 14 Lagerstroemia species and the remaining eight Lythraceae species. For the rpl23 gene, the Ka/Ks ranged from 0.891 to 1.8077 except for the comparison with D. grandiflora. There was no non-synonymous substitution between Lagerstroemia species and D. grandiflora in addition to L. excelsa. As seen in the phylogenetic tree, the relationship between the D. grandiflora and the 14 Lagerstroemia species was closer than the other seven Lythraceae species. For the rbcL gene, the Ka/Ks ranged from 0.1119 to 0.3849, which may be due to a low Ks value (0.0046–0.0177). For the ycf4 gene, in addition to the comparison with W. fruticosa (2.4259–2.8340), the ratio of Lagerstroemia species and other seven Lythraceae species ranged from 0.0305 to 0.8758. The result showed that the rpl23 gene evolved faster than rbcL and ycf4. The Ka/Ks for the three genes of clade L. intermis-R. rotundifolia were invalid due to the Ks was 0. The Ka/Ks for the ycf4 and rbcL of clade P. granatum-W. fruticosa were 0.04 and 2.205, for rpl23 was invalid (Additional file 3: Table S3).

Genome size differences among the 22 Lythraceae cp genomes

Of the 22 Lythraceae species, L. subcostata was the shortest (152,049 bp), and L. villosa was the longest (160,769 bp). Except for L. villosa, the lengths of the cp genomes of Lagerstroemia species varied between 152,049 bp and 152,519 bp, while the cp genomes of the other genera of Lythraceae varied from 155,555 bp to 159,380 bp (Table 1). In general, the cp genomes of 13 Lagerstroemia species were significantly smaller than those of other Lythraceae. The longer length of the cp genome of L. villosa resembled those of the 6 newly sequenced species of Lythraceae more than it resembled the Lagerstroemia species. The lengths of the intergenic regions (IGS) ranged from 44,031 bp to 46,156 bp among the 13 Lagerstroemia species and 45,923 bp to 51,357 bp among the remaining species of Lythraceae, which was in accord with the lengths of the complete cp genomes (Table 4). As in other angiosperm plants, differences in IGS length contributed greatly to the variation in genome size. The percentage of GC content in the chloroplast genomes of the 22 species was 36.41–37.72%, with an average of 37.34%. The average GC content of Lagerstroemia species was 37.56%, which was higher than that of the other genera (36.88%).

Contraction and expansion of inverted repeats (IRs)

The genomic structure, including the number and sequence of genes, was highly conserved among the 22 Lythraceae species. However, there were structural changes in the IRA and IRB boundaries (Fig. 3). Although the IR region is more conserved than the other regions, the enlargement and contraction of IR boundaries played a major role in genome size [34-36].
Fig. 3

Comparison of junctions between the LSC, SSC, and IR regions among 22 species. Distance in the figure is not to scale. LSC, Large single-copy; SSC, Small single -copy; IR, inverted repeat

Comparison of junctions between the LSC, SSC, and IR regions among 22 species. Distance in the figure is not to scale. LSC, Large single-copy; SSC, Small single -copy; IR, inverted repeat The sizes of the IRs varied from 24,421 bp (T. natans) to 26,907 bp (L. villosa). Within the IRA-LSC boundaries of the 22 species, the boundaries of 18 species fell within the rps19 coding gene and caused an rps19 pseudogene in the IRB region. The IRA-LSC boundary of L. villosa was located on the left side of the rps19 coding gene and the IRA-LSC borders of D. grandiflora, W. fruticosa and H. myrtifolia were located on the right of the rps19 coding gene. The distance between rps19 and the IRA-LSC boundary ranged from 3 bp to 279 bp. Except for the 14 Lagerstroemia species and W. fruticosa, the IRA-SSC boundary was embedded in the ndhF encoding gene and had a length of 7 bp (W. fruticosa) to 158 bp (L. guilinensis) in the IRA region. For the other 7 Lythraceae species, ndhF was located on the right side of the IRA-SSC at a distance of 28 bp to 55 bp from the boundary. For all species, the SSC-IRB boundary was located in the ycf1 gene with a length of 1062 bp to 2252 bp in the IRB region, causing a ycf1 pseudogene in the IRA region with a corresponding length. The trnH-GUG noncoding gene was located on the right side of the IRB-LSC boundary ranging from 69 bp to 75 bp at a distance of 0 to 33 bp from the IRB-LSC boundary.

Long repeat structure analysis

Twenty-two Lythraceae species had 383 long repeats of four types. Eighteen species had only forward and palindromic repeats, and only T. natans had all four kinds of repeats. L. indica had the largest number of repeats, including 22 forward and six palindromic repeats. W. fruticosa, P. granatum, L. salicaria and P. granatum had only seven long repeats. As a whole, H. myrtifolia and the 14 Lagerstroemia species had more long repeats than D. grandiflora, T. natans, L. salicaria, L. inermis, P. granatum, W. fruticosa and R. rotundifolia (Fig. 4a, Additional file 4: Table S4). The copy length ranged from 30 bp to 81 bp. Repeat sequences of 30, 31 and 41 accounted for most of the total length (Fig. 4b).
Fig. 4

Number of long repetitive repeats on the complete chloroplast genome sequence of 22 Lythraceae species. a Frequency of repeat type; b Frequency of the repeats more than 30 bp long

Number of long repetitive repeats on the complete chloroplast genome sequence of 22 Lythraceae species. a Frequency of repeat type; b Frequency of the repeats more than 30 bp long

Simple sequence repeat (SSR) analysis

SSRs, also called short tandem repeats or microsatellites, are made up of nucleotide repeat units 1–6 bp in length [37]. SSRs play a significant role in plant taxonomy and are widely applied as molecular markers [38, 39]. There were 211–332 SSRs in each Lythraceae species that ranged from 8 to 16 bp in length (Fig. 5, Additional file 5: Table S5). Six kinds of SSRs were discovered: mononucleotide, di-nucleotide, tri-nucleotide, tetra-nucleotide, penta-nucleotide and hexa-nucleotide. However, hexa-nucleotide repeats were detected in only the cp genomes of L. siamica, L. intermedia, T. natans and L. salicaria. Among each Lythraceae species, mononucleotide repeats were the most common, with numbers ranging from 123 to 212; followed by trinucleotide ranging from 56 to 68; dinucleotide ranging from 16 to 52; tetranucleotide ranging from 6 to 12; pentanucleotide ranging from 0 to 2 and hexa-nucleotide ranging from 0 to 1. (Fig. 5a). It was previously found that mono-nucleotide repeats were richest in Fritillaria, Lilium and Epimedium [22, 40]. As a result, mononucleotide repeats may play a more important role in genetic variation than the other SSRs.
Fig. 5

The comparison of simple sequence repeats (SSR) distribution in 22 chloroplast genomes. a Number of different SSR types detected in 22 chloroplast genomes; b Frequency of common motifs in the 22 chloroplast genomes; c Frequency of SSRs in the LSC, IR, SSC region; d Frequency of SSRs in the intergenic regions, protein-coding genes and introns

The comparison of simple sequence repeats (SSR) distribution in 22 chloroplast genomes. a Number of different SSR types detected in 22 chloroplast genomes; b Frequency of common motifs in the 22 chloroplast genomes; c Frequency of SSRs in the LSC, IR, SSC region; d Frequency of SSRs in the intergenic regions, protein-coding genes and introns In the 22 Lythraceae species, A/T mononucleotide repeats accounted for 45.30 and 50.00%, respectively. C/G mononucleotide repeats accounted for 1.40 and 3.30%, respectively. Most of the other SSRs were composed of A/T, which may have led to the high AT content covering 62.66% of the whole cp genomes within the 22 Lythraceae species (Fig. 5b). Similar biases were also reported in Quercus [41]. Moreover, the number of A/T mononucleotide repeats in D. grandiflora, T. natans, L. salicaria, L. intermis, P. granatum, W. fruticosa, R. rotundifolia and H. myrtifolia were more than 13 Lagerstroemia species, ranging from 71 to 92/71–103. Among the 14 Lagerstroemia species, the number of A mononucleotide repeats ranged from 54 to 58, with T mononucleotide repeats ranging from 65 to 71, except in L. villosa. These results show that the A/T mononucleotide repeats numbers in the same genus are similar. However, the number of A/T mononucleotide repeats of L. villosa was 88/117, which was much higher than those of the other 13 Lagerstroemia species. We can infer that the longer intergenic spacers are the main reason. SSRs were much more frequently located in the LSC regions (62.90%) than in the IR regions (23.20%) and the SSC regions (13.90%) (Fig. 5c). Furthermore, SSRs in the cp genomes of the Lythraceae species were located mainly in the intergenic spacers, with an average of 132. SSRs dispersed in coding genes were second, with an average of 92. The fewest SSRs were located in the introns, with an average of 37 (Fig. 5d). The SSR loci were located in 31 coding genes (matK, atpI, rpoC2, rpoB, trnS-UGA, rps14, psaB, psaA, ndhK, accD, ycf4, cemA, petA, psaJ, psbB, rpoA, rpl22, rps19, rpl2, ycf2, rrn23, ndhF, rpl32, ccsA, ndhD, ndhA, ycf1, trnI-GAU, ndhB, ycf2) and 57 intergenic regions of the 22 Lythraceae species. Yu et al. found 20 SSRs located in 9 coding genes (matK, rpoC1, rpoC2, cemA, ndhD, ndhG, ndhH, ycf2 and ycf1) of the Fritillaria cp genome [23]. These results indicate that SSRs with large variation in cp genomes can be applied to identify related species and used in research on phylogeny.

Divergence hotspots among 22 Lythraceae species

Divergent hotspots on cp genomes can be utilized to identify closely related species and provide information about phylogeny [42, 43]. The nucleotide diversity (Pi) values of the coding regions and intergenic regions of the 22 cp genomes within Lythraceae were computed using the program DnaSP 5.1. It can be seen in Fig. 6 that the values for the intergenic regions were higher than those for the coding regions, indicating that intergenic regions were more differentiated. For the coding regions, the Pi values of the IR region ranged from 0.0029–0.0144, the Pi values of LSC ranged from 0.00261–0.04547 and the Pi values of SSC ranged from 0.01254–0.04532. For the intergenic regions, the Pi value of the IR region ranged from 0.00232–0.15964, the Pi values of the LSC ranged from 0 to 0.22362 and the Pi values of the SSC ranged from 0.03567–0.17653 (Fig. 6, Additional file 6: Table S6). A total of 10 hotspots with high divergence were selected as potential molecular markers to identify related species and examine phylogeny within Myrtales.
Fig. 6

The nucleotide variability (Pi) value in the 22 aligned Lythraceae chloroplast genomes. a Protein-coding genes; b Intergenic regions

The nucleotide variability (Pi) value in the 22 aligned Lythraceae chloroplast genomes. a Protein-coding genes; b Intergenic regions Combining the results of DnaSP and mVISTA, we assessed the ability of 10 regions to distinguish the 22 Lythraceae species using ML trees. In the coding regions, the four most variable genes were ndhF, matK, rbcL, and rpl22. For the intergenic regions, trnK-rps16, rpl32-trnL, trnM-atpE, psbM-trnD, trnH-psbA and ndhF-rpl32 were the most variable. The regions with the greatest divergence according to their Pi values were similar to the regions obtained from the mVISTA program. Among the 10 divergent hotspots, 7 hotspots were distributed in the LSC region, and the other 3 hotspots were located in the SSC region. The IR regions were so conserved that no highly divergent hotspots were detected. According to the ML trees, trnK-rps16, ndhF, and rpl32-trnL had the highest resolution. The trnK-rps16 gene clearly separated all the genera within Lythraceae, but the 14 Lagerstroemia species could only be divided into five large branches. The ndhF gene could also divide all the genera within Lythraceae with bootstrap values of 36–100%, and it separated all 14 Lagerstroemia species. Except for the node subtending L. venusta, L. intermedia and L. speciosa with the bootstrap value of 22%, the 14 Lagerstroemia species were separated with bootstrap values of 64–100%. The rpl32-trnL gene divided all the genera except for Lythrum and Heimia, and the 14 Lagerstroemia species could only be divided into five large branches. Compared with trnK-rps16 and rpl32-trnL, ndhF had the highest resolution and was the best candidate marker for barcoding.

Phylogenetic analysis of 22 Lythraceae species with related cp genomes within Myrtales

MP, ML and BI trees were constructed based on the 66 shared protein coding genes of 50 cp genomes (Additional file 7: Table S7). These cp genomes included those of 22 Lythraceae species, 12 Myrtaceae species, three Onagraceae species, five Melastomataceae species and eight species included as out groups. The 22 Lythraceae species included H. myrtifolia, P. granatum, 14 Lagerstroemia species and 6 newly sequenced species (D. grandiflora, T. natans, L. inermis, R. rotundifolia, L. salicaria and W. fruticosa). The topological structures of the ML trees, MP trees and BI trees were consistent, and the four families (Lythraceae, Onagraceae, Myrtaceae and Melastomataceae) were classified into four monophyletic clades. In addition, Melastomataceae was identified as the basal group in Myrtales. The five subfamilies of the Lythraceae gathered into one clade, demonstrating that P. granatum and T. natans, formerly considered to belong to Punicaceae, and Trapaceae belong to Lythraceae. The 14 Lagerstroemia species gathered into one clade. Only two nodes with bootstrap values under 90% in the ML tree. The remaining nodes had support values of more than 92%. The bootstrap values of all nodes reached 100% in the MP tree (Fig. 7). The results showed that the Melastomataceae family, which was sister to the other families within Myrtales, was the earliest differentiating group. The next family to diverge was the Myrtaceae family, followed by the Onagraceae and Lythraceae. The 22 Lythraceae species gathered into one clade, which was resolved as sister to three Onagraceae species (Ludwigia octovalvis, Oenothera argillicola and Oenothera biennis). As a whole, the phylogenetic tree showed clear internal relationships among Myrtales species.
Fig. 7

The phylogenetic tree is based on 66 shared protein-coding genes of 50 species. Numbers indicated the bootstrap values from the BI (left), ML (middle) analyses and MP (right) analyses

The phylogenetic tree is based on 66 shared protein-coding genes of 50 species. Numbers indicated the bootstrap values from the BI (left), ML (middle) analyses and MP (right) analyses

Discussion

Each of the 22 Lythraceae cp genomes had four conjoined structures and contained 110–112 distinctive genes consisting of 76–78 coding genes, 29–30 tRNAs and 4 rRNAs. The genome length ranged from 152,049 to 16,0769 bp with GC content between 36.41 and 37.72%. It was clear that the 22 cp genomes were highly conserved in genome size, structure and organization, which were also consistent with the cp genomes of Melastomataceae species reported previously [26]. The largest location of variation among the 22 Lythraceae cp genomes was in the intergenic areas, which is a common phenomenon in cp genomes [10, 44, 45]. The slow evolutionary rate and the low Ka/Ks detected in the analyzed Lythraceae species were within expectations, and Ka/Ks varied among groups of different functional genes. As a common evolutionary pattern for photosynthetic plants, photosynthesis genes (psbN, psbI, psaC, atpH, petD, psbD and psbM) had the lowest evolutionary rates. The genes rpl2, rpl20 and rpl23 involved in replication, rbcL and psbJ involved in photosynthesis, ycf4 of unknown functions and other genes including ccsA evolved more quickly and had high Ka/Ks (≥1). The seven genes evolved faster among 22 Lythraceae species analyzed were also found in Capsicum and Sesamum indicum species [23, 46]. Some genes are species-specific in terms of the rates of evolution, such as clpP gene. Although it is highly conserved in most green plants, it is by far the fastest evolving plastid-encoded gene in some angiosperms. The rates of evolution in the plastid Clp protease complex are extreme different [47]. The mean Ka/Ks of the clpP gene within Lythraceae species was 0.0395, which was different from the high ratio of Ka/Ks in some plants. Williams also found that clpP1 has undergone remarkably frequent bouts of accelerated sequence evolution, which may result from the intron loss in many lineages, such as Oenothera. However, the clpP gene contained two introns across 22 Lythraceae species, which may be the reason for its low Ka/Ks. The clpP experiencing negative (purifying) selection among Lythraceae species may result from conserved lengths (591 bp). Genes under positive selection typically have large insertions of more or less repeating amino acid sequence motifs [48]. Genes under positive selection may also be bound up with a recent increase in diversification rate after adapted to novel ecological conditions [49]. The boundaries between the four cp genomes regions are important in the evolution of some taxa [50]. For example, pseudogenes such as ψycf1 or ψrps19 were produced by contraction and expansion of the IR region. The ψycf1 pseudogene exists in all 22 Lythraceae species while the ψrps19 pseudogene was absent in 4 Lythraceae species. The rps19 gene was located in the LSC regions of H. myrtifolia, W. fruticosa and D. grandiflora. In the cp genome of L. villosa, the rps19 gene was fully duplicated in IRA, as has also been reported in some Malpighiales species [51]. In previous studies, comparative analysis based on complete cp genomes was scarce due to the limited number of published cp genomes of Lythraceae species, and the phylogenetic relationships within Lythraceae were not clear. P. granatum and T. natans were placed alone in the Punicaceae family and the Trapaceae family respectively. The relationship between T. natans and the other species within Myrtales could not be confirmed because of the large morphological variation in T. natans, so DNA data were necessary to confirm the location of T. natans in Myrtales. The rbcL gene, the pasA-ycf3 spacer, and the ITS sequences have been used to establish trees and infer phylogenetic relationships within Lythraceae, and these relationships were corroborated by our results. The sister relationship between Trapa and Sonneratia was strongly supported, while the sister relationship between Trapa and Lythrum was weakly supported. Overall, the position of T. natans in the family Lythraceae was confirmed in our phylogenetic analysis. Our results further show that P. granatum belong to the Lythraceae.

Conclusion

In this study, the newly sequenced cp genomes of D. grandiflora, T. natans, L. salicaria, L. inermis, W. fruticosa and R. rotundifolia were reported and combined with those of 16 other species to compare a total of 22 Lythraceae cp genomes. The cp genomes of the 22 Lythraceae species were similar in structure, composition and gene order, showing that they are highly conserved. Three phylogenetic trees showed that 42 Myrtales species were completely divided into four branches representing four families with high bootstrap values. From previously existing cp genomes, the evolutionary history of Myrtales had been preliminarily understood. The results of this study provide additional rich genetic resources for phylogenetic research and will play an important role in further study within Myrtales.

Materials and methods

DNA extraction of plant materials and sequencing

The fresh leaves of six species of Lythraceae within Myrtales (D. grandiflora, T. natans, L. salicaria, L. inermis, W. fruticosa and R. rotundifolia) were obtained from the nursery of Zhejiang A&F University, and then immediately stored in silica gel. A CTAB method was used to extract the genomic DNA [52]. A NanoDrop 2000 Micro spectrophotometer and an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA) were employed to evaluate the concentration and quality of the extracted DNA. Following the manufacturer’s instructions, the purified DNA was used to build a sequencing library. The Illumina HiSeq 2000 sequencer (Illumina Biotechnology Company, San Diego, CA) was used to obtain paired-end (PE) reads of 150 bp [9].

Chloroplast genome assembly, annotation, and structure

Trimmomatic v0.3 was used to trim and filter raw reads with a Phred quality score ≤ 20. The other parameters in Trimmomatic v0.3 were set as follows: the sliding window was set to 4:15, the trailing was set to 3, the leading was set to 3 and the minlen was set to 50 [53]. CLC version 9.11 (Qiagen Company, Hilden) with default parameters was used to perform de novo assembly. Four to eight different contigs were created for each species [54]. The BLAST algorithm was used with the L. fauriei cp genome as a reference to align all contigs. The ends of each contig could be overlapped by 50 to 80 bp and combined as one large cp genome. The Re-read mapping was also conducted to validate the genome. The coverage of each genome varied from 500x to 900x. DOGMA v1.2 was used to perform genome annotation [8–10, 55]. OGDRAW (http://ogdraw.mpimp-golm.mpg.de/) was used to draw the circular cp genome map of the Lythraceae species and then manually edited [56]. The relative synonymous codon usage (RSCU) is the ratio of the frequency of the specific codon to the expected frequency [57]. An RSCU > 1.00 means that a codon is used more frequently than expected, while an RSCU < 1.00 denotes that a codon is used less frequently than expected. The RSCU was obtained using DAMBE5 [58].

Genome comparative analysis and molecular marker identification

A total of 22 Lythraceae species were compared. Taking the L. excelsa annotation as the reference, the mVISTA in LAGAN mode was used to make pairwise alignments among the 22 cp Lythraceae species genomes [59]. The 77 protein coding regions of 22 Lythraceae species were used to evaluate evolutionary rate variation. DnaSP 5.1 was to calculate the rates of nonsynonymous (Ka) and synonymous substitutions (Ks) [60]. A total of 13,318 Ka/Ks were obtained; the value could not be calculated if Ks = 0. MEGA 6 was used to align the cp genomes after manual adjustments in BioEdit software [61]. Then, DnaSP 5.1 was used to separately evaluate the Pi values of the coding and noncoding sequences. Pi values across the complete cp genomes, LSC, SSC, and IR regions were also calculated using DnaSP 5.1 [62].

Identification of long repetitive sequences and simple sequence repeats (SSRs)

REPuter was used to detect four kinds of long repeats: forward, reverse, palindromic, and complementary repeats [63]. The parameters were set as follows: (1) the minimum repeat was more than 30 bp; (2) the sequence identity was more than 90%; (3) the Hamming distance was equal to 3. Msatcommander 0.8.2.0 was used to detect the location and number of SSRs [64] with the following settings: mononucleotides ≥8; dinucleotides ≥4; trinucleotides, tetranucleotides, pentanucleotide and hexanucleotide SSRs ≥3.

Phylogenetic analysis

To reconstruct the phylogenetic relationships and examine the phylogenetic status of Lythraceae within Myrtales, the complete cp genomes of 42 Myrtales species were used for analysis. Clustal X 2.1 software with default parameter settings was used to align 66 protein coding gene sequences, with manual adjustments to the alignment ends when necessary [65]. The data matrix used in phylogenetic analysis is provided as supplementary data. Evolutionary relationships were analyzed using MEGA 6 for maximum likelihood (ML) and maximum parsimony (MP), MrBayes 3.1.2 for Bayesian inference (BI) trees [60, 66]. If the bootstrap values of the nodes were equal to 100%, they were not marked on the tree. In all analyses, eight species were considered outgroups. The phylogenetic trees were plotted in FigTree [67]. Table S1. The genes having intron in the 22 Lythraceae chloroplast genomes. (XLSX 50 kb) Table S2. Codon usage and codon-anticodon recognition pattern of 22 Lythraceae species. (XLSX 117 kb) Table S3. The rates of Ka、Ks and Ka/Ks of 77 genes among 22 Lythraceae species. (XLSX 1497 kb) Table S4. The comparison of Long repeats among 22 Lythraceae species. (XLSX 72 kb) Table S5. The comparison of SSRs among 22 Lythraceae species. (XLSX 525 kb) Table S6. The nucleotide variability (Pi) value of Protein-coding genes and Intergenic regions. (XLSX 23 kb) Table S7. The GenBank accession numbers of 50 species using in phylogenetic. (DOCX 17 kb)
  23 in total

1.  The complete chloroplast genome sequences of three lilies: genome structure, comparative genomic and phylogenetic analyses.

Authors:  Yuan Li; LiNa Zhang; TianXi Wang; ChaoChao Zhang; RuiJia Wang; Da Zhang; YuQi Xie; NingNing Zhou; WeiZhen Wang; HuiMin Zhang; Bin Hu; WenHan Li; QingQing Zhao; LiHua Wang; XueWei Wu
Journal:  J Plant Res       Date:  2022-10-19       Impact factor: 3.000

2.  Complete chloroplast genome of novel Adrinandra megaphylla Hu species: molecular structure, comparative and phylogenetic analysis.

Authors:  Huu Quan Nguyen; Thi Ngoc Lan Nguyen; Thi Nhung Doan; Thi Thu Nga Nguyen; Mai Huong Phạm; Tung Lam Le; Danh Thuong Sy; Hoang Ha Chu; Hoang Mau Chu
Journal:  Sci Rep       Date:  2021-06-03       Impact factor: 4.379

3.  Comparative analysis of chloroplast genome structure and molecular dating in Myrtales.

Authors:  Xiao-Feng Zhang; Jacob B Landis; Hong-Xin Wang; Zhi-Xin Zhu; Hua-Feng Wang
Journal:  BMC Plant Biol       Date:  2021-05-15       Impact factor: 4.215

4.  Chloroplast phylogenomics and divergence times of Lagerstroemia (Lythraceae).

Authors:  Wenpan Dong; Chao Xu; Yanlei Liu; Jipu Shi; Wenying Li; Zhili Suo
Journal:  BMC Genomics       Date:  2021-06-09       Impact factor: 3.969

5.  The complete chloroplast genome sequence of Trapa kozhevnikoviorum Pshenn. (Lythraceae).

Authors:  Godfrey Kinyori Wagutu; Xiangrong Fan; Wuchao Wang; Wei Li; Yuanyuan Chen
Journal:  Mitochondrial DNA B Resour       Date:  2021-05-19       Impact factor: 0.658

6.  Next-Generation Genome Sequencing of Sedum plumbizincicola Sheds Light on the Structural Evolution of Plastid rRNA Operon and Phylogenetic Implications within Saxifragales.

Authors:  Hengwu Ding; Ran Zhu; Jinxiu Dong; Lan Jiang; Juhua Zeng; Qingyu Huang; Huan Liu; Wenzhong Xu; Longhua Wu; Xianzhao Kan
Journal:  Plants (Basel)       Date:  2019-09-29

7.  Complete chloroplast genome sequence of Punica granatum 'Nana' (Lythraceae) and phylogenetic analysis.

Authors:  Jie Wang; Zhi-Qiang Wu; Li Ma; Cui-Hua Gu
Journal:  Mitochondrial DNA B Resour       Date:  2020-05-14       Impact factor: 0.658

8.  Holocene chloroplast genetic variation of shrubs (Alnus alnobetula, Betula nana, Salix sp.) at the siberian tundra-taiga ecotone inferred from modern chloroplast genome assembly and sedimentary ancient DNA analyses.

Authors:  Stefano Meucci; Luise Schulte; Heike H Zimmermann; Kathleen R Stoof-Leichsenring; Laura Epp; Pernille Bronken Eidesen; Ulrike Herzschuh
Journal:  Ecol Evol       Date:  2021-01-31       Impact factor: 2.912

9.  The complete chloroplast genome sequence of Anogeissus acuminata (combretaceae).

Authors:  Jiaojun Yu; Tongmei Xu; Hongyu Wang; Dange Chen; Hongjin Dong
Journal:  Mitochondrial DNA B Resour       Date:  2020-05-12       Impact factor: 0.658

10.  Characteristics and Mutational Hotspots of Plastomes in Debregeasia (Urticaceae).

Authors:  Ruo-Nan Wang; Richard I Milne; Xin-Yu Du; Jie Liu; Zeng-Yuan Wu
Journal:  Front Genet       Date:  2020-07-08       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.