| Literature DB >> 31242865 |
Cuihua Gu1, Li Ma2, Zhiqiang Wu3, Kai Chen2, Yixiang Wang4.
Abstract
BACKGROUND: Lythraceae belongs to the order Myrtales, which is part of Archichlamydeae. The family has 31 genera containing approximately 620 species of herbs, shrubs and trees. Of these 31 genera, five large genera each possess 35 or more species. They are Lythrum, with 35; Rotala, with 45; Nesaea, with 50; Lagerstroemia, with 56; and Cuphea, with 275 species.Entities:
Keywords: Chloroplast genome; Lythraceae; Myrtales; Phylogenomic
Mesh:
Year: 2019 PMID: 31242865 PMCID: PMC6595698 DOI: 10.1186/s12870-019-1870-3
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Summary of complete chloroplast genomes for 22 species in Lythraceae
|
|
|
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Accession number | MK881635 | MK881627 | MK881633 | MK881628 | MK881632 | MK881630 | MK881636 | NC_029808 | NC_031825 | NC_029885 | |
| Family | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | |
| Total length (bp) | 152,214 | 152,153 | 160,769 | 152,519 | 152,294 | 152,521 | 152,294 | 152,440 | 152,240 | 152,193 | |
| GC(%) | 37.6 | 37.58 | 36.97 | 37.58 | 37.65 | 37.57 | 37.65 | 37.61 | 37.72 | 37.62 | |
| LSC | |||||||||||
| Length (bp) | 84,053 | 83,954 | 88,702 | 84,166 | 84,013 | 84,194 | 84,012 | 83,926 | 83,967 | 83,817 | |
| GC(%) | 35.94 | 35.92 | 34.69 | 35.89 | 35.98 | 35.87 | 35.97 | 35.94 | 36.1 | 35.95 | |
| length(%) | 55.2 | 55.2 | 55.2 | 55.2 | 55.2 | 55.2 | 55.2 | 55.1 | 55.2 | 55.1 | |
| SSC | |||||||||||
| Length (bp) | 16,917 | 16,905 | 18,255 | 16,865 | 16,917 | 16,833 | 16,798 | 16,934 | 16,787 | 16,909 | |
| GC(%) | 31.03 | 30.96 | 30.78 | 30.95 | 31.03 | 30.97 | 31.17 | 30.92 | 31.23 | 30.97 | |
| length(%) | 11.1 | 11.1 | 11.4 | 11.1 | 11.1 | 11.0 | 11.0 | 11.1 | 11.0 | 11.1 | |
| IR | |||||||||||
| Length (bp) | 25,622 | 25,647 | 26,906 | 25,744 | 25,622 | 25,747 | 25,742 | 25,790 | 25,788 | 25,794 | |
| GC(%) | 42.49 | 42.47 | 42.83 | 42.51 | 42.49 | 42.51 | 42.51 | 42.51 | 42.48 | 42.47 | |
| length(%) | 16.8 | 16.9 | 16.7 | 16.9 | 16.8 | 16.9 | 16.9 | 16.9 | 16.9 | 16.9 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| NC_030484 | NC_031414 | NC_034952 | NC_034662 | MK881638 | MK881634 | MK881629 | MK881631 | NC_035240 | MK881637 | MK881626 | MG921615 |
| Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae |
| 152,205 | 152,476 | 152,049 | 152,330 | 156,084 | 155,555 | 158,483 | 157,756 | 158,639 | 159,380 | 157,753 | 159,219 |
| 37.59 | 37.58 | 37.59 | 37.59 | 37.49 | 36.41 | 36.81 | 36.89 | 36.92 | 36.63 | 36.89 | 37.00 |
| LSC | |||||||||||
| 84,046 | 84,051 | 83,890 | 83,987 | 86,471 | 88,506 | 88,999 | 88,424 | 89,022 | 89,569 | 88,422 | 88,571 |
| 35.93 | 35.89 | 35.92 | 35.92 | 35.59 | 34.19 | 34.75 | 34.76 | 34.89 | 34.53 | 34.76 | 35.00 |
| 55.2 | 55.1 | 55.2 | 55.1 | 55.4 | 56.9 | 56.2 | 56.1 | 56.1 | 56.2 | 56.1 | 55.6 |
| SSC | |||||||||||
| 16,915 | 16,886 | 16,909 | 16,871 | 16,501 | 18,274 | 18,530 | 17,386 | 18,685 | 18,697 | 17,386 | 18,822 |
| 30.98 | 30.97 | 30.97 | 30.93 | 31.28 | 30.18 | 30.68 | 31.01 | 30.63 | 30.23 | 31.01 | 30.60 |
| 11.1 | 11.0 | 11.1 | 11.1 | 10.6 | 11.7 | 11.7 | 11.0 | 11.8 | 11.7 | 11.0 | 11.8 |
| IR | |||||||||||
| 25,622 | 25,817 | 25,625 | 25,736 | 26,556 | 24,388 | 25,477 | 25,973 | 25,466 | 25,557 | 25,973 | 25,643 |
| 42.5 | 42.51 | 42.5 | 42.51 | 42.5 | 42.77 | 42.63 | 42.5 | 42.78 | 42.65 | 42.5 | 42.60 |
| 16.8 | 16.9 | 16.9 | 16.9 | 17.0 | 15.7 | 16.1 | 16.5 | 16.1 | 16.0 | 16.5 | 16.1 |
GC guanine-cytosine, LSC large single-copy region, SSC short single-copy region, IRs inverted repeats
Fig. 1Structural map of the Lythraceae chloroplast genome. Genes drawn inside the circle are transcribed clockwise, and those outside are counterclockwise. Small single copy (SSC), Large single copy (LSC), and inverted repeats (IRa, IRb) are indicated. Genes belonging to different functional groups are color-coded
Genes contained in the sequenced Lythraceae chloroplast genome
| Gene category | Groups of genes | Name of genes |
|---|---|---|
| Self-replication | Ribosomal RNAs |
|
| Transfer RNAs |
| |
|
| ||
|
| ||
|
| ||
|
| ||
| Small subunit of ribosome |
| |
| Large subunit of ribosome |
| |
| DNA dependent RNA polymerase |
| |
| Photosynthesis | Subunits of photosystem I |
|
| Subunits of photosystem II |
| |
|
| ||
| Subunits of cytochrome |
| |
| Subunits of ATP synthase |
| |
| ATP-dependent protease subunit p gene |
| |
| Large subunit of Rubisco |
| |
| Subunits of NADH dehydrogenase |
| |
| Other genes | Maturase |
|
| Envelop membrane protein |
| |
| Acetyl-CoAcarboxylase |
| |
| other |
| |
| Genes of unknown function | Conserved open reading frames |
|
aIntron-containing genes
bGenes located in the IR regions
Distribution of genes and Intergenic regions for 22 species in Lythraceae
|
|
|
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Accession number | MK881635 | MK881627 | MK881633 | MK881628 | MK881632 | MK881630 | MK881636 | NC_029808 | NC_031825 | NC_029885 | |
| Family | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | Lythraceae | |
| Protein Coding Genes | |||||||||||
| Length (bp) | 79,046 | 79,080 | 79,648 | 79,056 | 79,062 | 79,059 | 79,062 | 79,062 | 78,852 | 79,068 | |
| GC(%) | 37.86 | 37.84 | 37.77 | 37.8 | 37.87 | 37.8 | 37.86 | 37.84 | 37.92 | 37.86 | |
| length(%) | 52 | 52 | 50 | 52 | 52 | 52 | 52 | 52 | 52 | 52 | |
| rRNA | |||||||||||
| Length (bp) | 9038 | 9038 | 9035 | 9040 | 9038 | 9040 | 9038 | 9068 | 9044 | 9044 | |
| GC(%) | 55.72 | 55.72 | 55.43 | 55.62 | 55.68 | 55.64 | 55.68 | 55.67 | 55.57 | 55.71 | |
| length(%) | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | |
| tRNA | |||||||||||
| Length (bp) | 2913 | 2819 | 2814 | 2808 | 2813 | 2814 | 2813 | 2809 | 2745 | 2741 | |
| GC(%) | 52.8 | 53.35 | 53.2 | 53.45 | 53.47 | 53.45 | 53.47 | 53.15 | 53.55 | 53.37 | |
| length(%) | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | |
| Intergenic Regions | |||||||||||
| Length (bp) | 45,482 | 46,156 | 51,367 | 45,619 | 45,941 | 45,486 | 46,033 | 44,138 | 45,266 | 44,031 | |
| GC(%) | 32.55 | 32.43 | 31.3 | 32.52 | 32.49 | 32.54 | 32.84 | 32.37 | 32.54 | 32.37 | |
| length(%) | 30 | 30 | 32 | 30 | 30 | 30 | 30 | 29 | 30 | 29 | |
| Intron | |||||||||||
| Length (bp) | 15,334 | 15,419 | 18,099 | 15,607 | 15,590 | 15,606 | 14,786 | 15,834 | 15,877 | 15,596 | |
| GC(%) | 38.2 | 38.19 | 37.82 | 38.19 | 38.32 | 38.18 | 37.64 | 38.42 | 38.7 | 38.33 | |
| length(%) | 10 | 10 | 11 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| NC_030484 | NC_031414 | NC_034952 | NC_034662 | MK881638 | MK881634 | MK881629 | MK881631 | NC_035240 | MK881637 | MK881626 | MG921615 |
| Lythraceae | Lythraceae | Lythraceae | Lythraceae | Sonneratiaceae | Trapaceae | Lythraceae | Lythraceae | Punicaceae | Lythraceae | Lythraceae | Lythraceae |
| Protein Coding Genes | |||||||||||
| 73,401 | 79,044 | 77,139 | 79,035 | 78,993 | 78,848 | 78,849 | 79,006 | 79,029 | 78,978 | 79,000 | 81,047 |
| 38.45 | 37.79 | 37.76 | 37.81 | 37.88 | 37.27 | 37.62 | 37.54 | 37.63 | 37.53 | 37.54 | 37.00 |
| 48 | 52 | 51 | 52 | 51 | 51 | 50 | 50 | 50 | 50 | 50 | 51 |
| rRNA | |||||||||||
| 9050 | 9046 | 9042 | 9046 | 9040 | 9022 | 9038 | 9038 | 9038 | 9038 | 9038 | 9050 |
| 55.69 | 55.56 | 55.68 | 55.58 | 55.55 | 55.51 | 55.17 | 55.28 | 55.26 | 55.28 | 55.28 | 55.00 |
| 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 |
| tRNA | |||||||||||
| 2817 | 2742 | 2828 | 2807 | 2903 | 2812 | 2813 | 2812 | 2817 | 2819 | 2812 | 2817 |
| 53.25 | 53.61 | 53.39 | 53.44 | 52.77 | 53.24 | 53.36 | 53.31 | 53.28 | 53.42 | 53.31 | 53.00 |
| 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| Intergenic Regions | |||||||||||
| 44,535 | 44,313 | 44,184 | 45,346 | 45,923 | 48,755 | 50,851 | 49,417 | 51,357 | 50,989 | 49,441 | 50,172 |
| 32.35 | 32.45 | 32.61 | 32.58 | 32.36 | 30.46 | 31.32 | 31.38 | 31.55 | 30.99 | 31.62 | 32.00 |
| 29 | 29 | 29 | 30 | 29 | 31 | 32 | 31 | 32 | 32 | 31 | 32 |
| Intron | |||||||||||
| 16,226 | 15,861 | 16,201 | 16,375 | 15,879 | 15,564 | 15,943 | 15,915 | 15,928 | 15,973 | 15,915 | 16,133 |
| 37.91 | 38.33 | 37.87 | 37.89 | 38.27 | 37.14 | 37.78 | 37.79 | 37.9 | 37.7 | 37.79 | 38.00 |
| 11 | 10 | 11 | 11 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 |
GC guanine-cytosine, LSC large single-copy region, SSC short single-copy region, IRs inverted repeats
Codon content of 20 amino acid and stop codon of 79 coding genes of 7 species
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|
| Amino acid | Codon | RSCUa | ||||||
| Ala | GCU | 1.75 | 1.78 | 1.84 | 1.63 | 1.80 | 1.76 | 1.72 |
| Ala | GCG | 0.51 | 0.44 | 0.47 | 0.61 | 0.46 | 0.52 | 0.53 |
| Ala | GCC | 0.67 | 0.64 | 0.61 | 0.68 | 0.63 | 0.67 | 0.64 |
| Ala | GCA | 1.07 | 1.13 | 1.09 | 1.09 | 1.11 | 1.05 | 1.10 |
| Cys | UGU | 1.38 | 1.43 | 1.41 | 1.23 | 1.43 | 1.28 | 1.20 |
| Cys | UGC | 0.62 | 0.57 | 0.59 | 0.77 | 0.57 | 0.73 | 0.80 |
| Asp | GAU | 1.57 | 1.56 | 1.59 | 1.57 | 1.59 | 1.57 | 1.56 |
| Asp | GAC | 0.43 | 0.44 | 0.41 | 0.43 | 0.41 | 0.43 | 0.45 |
| Glu | GAG | 0.49 | 0.49 | 0.50 | 0.50 | 0.48 | 0.50 | 0.47 |
| Glu | GAA | 1.51 | 1.52 | 1.50 | 1.51 | 1.52 | 1.50 | 1.53 |
| Phe | UUU | 1.30 | 1.25 | 1.31 | 1.32 | 1.30 | 1.31 | 1.31 |
| Phe | UUC | 0.70 | 0.75 | 0.69 | 0.68 | 0.70 | 0.70 | 0.69 |
| Gly | GGU | 1.25 | 1.32 | 1.31 | 1.14 | 1.27 | 1.18 | 1.20 |
| Gly | GGG | 0.70 | 0.71 | 0.65 | 0.86 | 0.66 | 0.77 | 0.82 |
| Gly | GGC | 0.47 | 0.41 | 0.44 | 0.51 | 0.46 | 0.51 | 0.51 |
| Gly | GGA | 1.58 | 1.56 | 1.60 | 1.49 | 1.61 | 1.55 | 1.47 |
| His | CAC | 0.50 | 0.56 | 0.51 | 0.51 | 0.47 | 0.49 | 0.55 |
| His | CAU | 1.50 | 1.44 | 1.49 | 1.49 | 1.53 | 1.51 | 1.45 |
| Ile | AUU | 1.42 | 1.34 | 1.44 | 1.48 | 1.43 | 1.45 | 1.52 |
| Ile | AUA | 0.92 | 1.03 | 0.91 | 0.79 | 0.90 | 0.90 | 0.80 |
| Ile | AUC | 0.66 | 0.63 | 0.65 | 0.73 | 0.67 | 0.66 | 0.68 |
| Lys | AAA | 1.46 | 1.46 | 1.46 | 1.46 | 1.47 | 1.44 | 1.45 |
| Lys | AAG | 0.54 | 0.55 | 0.54 | 0.54 | 0.53 | 0.56 | 0.55 |
| Leu | CUA | 1.00 | 1.24 | 1.02 | 1.07 | 1.02 | 1.05 | 1.09 |
| Leu | CUC | 0.64 | 0.59 | 0.64 | 0.64 | 0.66 | 0.69 | 0.61 |
| Leu | CUG | 0.55 | 0.58 | 0.54 | 0.58 | 0.53 | 0.51 | 0.53 |
| Leu | CUU | 1.81 | 1.59 | 1.80 | 1.71 | 1.79 | 1.75 | 1.78 |
| Leu | UUA | 1.18 | 1.20 | 1.18 | 1.20 | 1.19 | 1.18 | 1.22 |
| Leu | UUG | 0.82 | 0.80 | 0.82 | 0.80 | 0.81 | 0.82 | 0.78 |
| Met | AUG | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Asn | AAC | 0.46 | 0.50 | 0.45 | 0.56 | 0.44 | 0.47 | 0.56 |
| Asn | AAU | 1.54 | 1.50 | 1.55 | 1.44 | 1.56 | 1.54 | 1.44 |
| Pro | CCA | 1.21 | 1.26 | 1.22 | 1.21 | 1.20 | 1.219 | 1.23 |
| Pro | CCC | 0.84 | 0.82 | 0.78 | 0.84 | 0.77 | 0.761 | 0.84 |
| Pro | CCU | 1.49 | 1.44 | 1.50 | 1.39 | 1.55 | 1.503 | 1.41 |
| Pro | CCG | 0.46 | 0.48 | 0.50 | 0.57 | 0.47 | 0.517 | 0.53 |
| Gln | CAA | 1.54 | 1.48 | 1.54 | 1.50 | 1.56 | 1.558 | 1.51 |
| Gln | CAG | 0.46 | 0.52 | 0.46 | 0.50 | 0.44 | 0.442 | 0.49 |
| Arg | AGA | 1.44 | 1.43 | 1.45 | 1.35 | 1.44 | 1.398 | 1.38 |
| Arg | AGG | 0.56 | 0.57 | 0.55 | 0.65 | 0.56 | 0.602 | 0.62 |
| Arg | CGA | 1.60 | 1.60 | 1.63 | 1.62 | 1.63 | 1.599 | 1.68 |
| Arg | CGC | 0.43 | 0.46 | 0.41 | 0.41 | 0.41 | 0.485 | 0.42 |
| Arg | CGG | 0.50 | 0.58 | 0.47 | 0.70 | 0.49 | 0.606 | 0.67 |
| Arg | CGU | 1.47 | 1.36 | 1.49 | 1.27 | 1.47 | 1.31 | 1.23 |
| Ser | AGC | 0.54 | 0.54 | 0.50 | 0.67 | 0.53 | 0.664 | 0.69 |
| Ser | AGU | 1.46 | 1.46 | 1.50 | 1.33 | 1.47 | 1.336 | 1.31 |
| Ser | UCA | 0.91 | 1.18 | 0.94 | 1.23 | 0.95 | 0.978 | 1.24 |
| Ser | UCC | 0.99 | 0.83 | 0.96 | 0.80 | 0.94 | 0.931 | 0.80 |
| Ser | UCG | 0.54 | 0.52 | 0.52 | 0.59 | 0.53 | 0.596 | 0.61 |
| Ser | UCU | 1.56 | 1.48 | 1.58 | 1.37 | 1.59 | 1.495 | 1.35 |
| Thr | ACC | 0.82 | 0.77 | 0.81 | 0.91 | 0.82 | 0.869 | 0.89 |
| Thr | ACA | 1.16 | 1.23 | 1.17 | 1.20 | 1.18 | 1.122 | 1.23 |
| Thr | ACG | 0.49 | 0.45 | 0.47 | 0.54 | 0.48 | 0.539 | 0.54 |
| Thr | ACU | 1.53 | 1.56 | 1.55 | 1.35 | 1.52 | 1.469 | 1.33 |
| Val | GUU | 1.49 | 1.39 | 1.48 | 1.53 | 1.48 | 1.488 | 1.54 |
| Val | GUG | 0.50 | 0.57 | 0.53 | 0.51 | 0.53 | 0.549 | 0.50 |
| Val | GUC | 0.55 | 0.53 | 0.53 | 0.62 | 0.57 | 0.593 | 0.60 |
| Val | GUA | 1.45 | 1.51 | 1.46 | 1.34 | 1.42 | 1.37 | 1.36 |
| Trp | UGG | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 1.00 |
| Tyr | UAC | 0.43 | 0.46 | 0.43 | 0.54 | 0.44 | 0.483 | 0.52 |
| Tyr | UAU | 1.57 | 1.54 | 1.57 | 1.46 | 1.56 | 1.517 | 1.48 |
| Stopb | UGA | 0.88 | 1.21 | 0.88 | 1.17 | 0.89 | 1.122 | 1.21 |
| Stopb | UAG | 0.80 | 0.76 | 0.80 | 0.71 | 0.81 | 0.653 | 0.74 |
| Stopb | UAA | 1.32 | 1.03 | 1.32 | 1.12 | 1.30 | 1.225 | 1.045 |
aRelative synonymous codon usage; bStop codon
Fig. 2Sequence alignment of whole chloroplast genomes using the Shuffle LAGAN alignment algorithm in mVISTA. Lagerstroemia fauriei was chosen to be the reference genome. The vertical scale indicates the percentage of identity, ranging from 50 to 100%
Fig. 3Comparison of junctions between the LSC, SSC, and IR regions among 22 species. Distance in the figure is not to scale. LSC, Large single-copy; SSC, Small single -copy; IR, inverted repeat
Fig. 4Number of long repetitive repeats on the complete chloroplast genome sequence of 22 Lythraceae species. a Frequency of repeat type; b Frequency of the repeats more than 30 bp long
Fig. 5The comparison of simple sequence repeats (SSR) distribution in 22 chloroplast genomes. a Number of different SSR types detected in 22 chloroplast genomes; b Frequency of common motifs in the 22 chloroplast genomes; c Frequency of SSRs in the LSC, IR, SSC region; d Frequency of SSRs in the intergenic regions, protein-coding genes and introns
Fig. 6The nucleotide variability (Pi) value in the 22 aligned Lythraceae chloroplast genomes. a Protein-coding genes; b Intergenic regions
Fig. 7The phylogenetic tree is based on 66 shared protein-coding genes of 50 species. Numbers indicated the bootstrap values from the BI (left), ML (middle) analyses and MP (right) analyses