| Literature DB >> 36118878 |
Lucun Yang1,2, Jingjing Li3, Guoying Zhou1,2.
Abstract
Swertia L. is a large genus in the family Gentianaceae. Different chloroplast gene segments have been used to study systematic evolutionary relationships between species of Swertia L. However, as gene fragment-based phylogenies lack sufficient resolution, the systematic evolutionary relationships between Swertia L. species have remained unclear. We sequenced and annotated the complete chloroplast genomes of four Swertia species, namely, S. bifolia, S. tetraptera, S. franchetian, and S. przewalskii, using next generation sequencing and the plastid genome annotator tool. The chloroplast genome sequences of 19 additional species of Swertia L. were downloaded from the NCBI database and also assessed. We found that all 23 Swertia L. species had a similar genetic structure, that is, a ring tetrad structure, but with some clear differences. The chloroplast genomes of the 23 Swertia L. species were 149036-153691 bp long, averaging 152385 bp; the genomes contained 134 functional genes: 38 tRNA, eight rRNA, and 88 protein-encoding genes. A comparative analysis showed that chloroplasts genome of Swertia was conserved in terms of genome structure, codon preference, and repeat sequences, but it differed in terms of genome sizes, gene contents, and SC/IR boundary. Using Swertia wolfangiana as a reference, we found clear divergences in most of the non-coding and intergenic regions of the complete chloroplast genomes of these species; we also found that rpoC1, ccsA, ndhI, ndhA, and rps15 protein-coding genes had large variations. These highly variable hotspots will be useful for future phylogenetic and population genetic studies. Phylogenetic analysis with high bootstrap support showed that Swertia L. was not monophyletic. The classification of subgen. Swertia and subgen. Ophelia was supported by molecular data, which also partly supported the division of sect. Ophelia, sect. Platynema, sect. Poephila, sect. Swertia, and sect. Macranthos. However, the systematic positions of other groups and species require further exploration. The Swertia L formed at 29.60 Ma. Speciation of 10 species occurred in succession after 12 Ma and 13 species occurred in succession after 2.5 Ma. Our analysis provides insight into the unresolved evolutionary relationships of Swertia L. species.Entities:
Keywords: Swertia; chloroplast genome; comparative analysis; phylogenetic analysis; repeat sequences
Year: 2022 PMID: 36118878 PMCID: PMC9470856 DOI: 10.3389/fgene.2022.895146
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Complete genome features of Swertia L. species.
| Species | All length (bp) | GC (%) | LSC length (bp) | GC (%) | SSC length (bp) | GC (%) | IR length (bp) | GC (%) | GenBank accession numbers |
|---|---|---|---|---|---|---|---|---|---|
|
| 153,242 | 38.06 | 83,496 | 36.16 | 18,200 | 31.89 | 25,773 | 43.33 | ON018645 |
|
| 153,751 | 38.03 | 84,156 | 36.02 | 18,089 | 32.07 | 25,753 | 43.39 | MW344296 |
|
| 149,089 | 38.20 | 80,481 | 36.34 | 17,946 | 31.79 | 25,331 | 43.42 | MZ261898 |
|
| 153,429 | 38.05 | 83,612 | 36.16 | 18,037 | 31.75 | 25,890 | 43.3 | NC_054359 |
|
| 152,977 | 37.5 | 83,622 | 35.55 | 18,092 | 31.25 | 25,069 | 43.02 | MZ261899.1 |
|
| 150,057 | 38.17 | 81,310 | 36.28 | 17,887 | 31.79 | 25,430 | 43.42 | MW344298 |
|
| 153,691 | 38.10 | 83,859 | 36.20 | 18,300 | 31.9 | 25,766 | 43.5 | NC057681.1 |
|
| 153,039 | 38.10 | 83,372 | 36.18 | 18,249 | 31.89 | 25,709 | 43.33 | MW344299 |
|
| 153,428 | 38.2 | 83,564 | 34.66 | 18,342 | 33.22 | 25, 749 | 43.28 | NC_056357 |
|
| 149,488 | 38.19 | 80,727 | 36.30 | 17,903 | 31.81 | 25,429 | 43.42 | NC_044474 |
|
| 153,475 | 38.15 | 83,595 | 36.23 | 18,348 | 31.93 | 25,766 | 43.47 | MZ261902 |
|
| 153,015 | 38.17 | 83,048 | 36.35 | 18,395 | 31.90 | 25,785 | 43.44 | NC_045301 |
|
| 152,737 | 38.22 | 83,046 | 36.31 | 18,231 | 31.99 | 25,730 | 43.50 | MZ261903 |
|
| 152,190 | 38.10 | 82,893 | 36.25 | 18,343 | 31.82 | 25,477 | 43.35 | NC_050660 |
|
| 153,499 | 38.16 | 83,591 | 36.23 | 18,336 | 31.95 | 25,761 | 43.50 | KU641021 |
|
| 153,690 | 38.12 | 83,864 | 36.25 | 18,254 | 31.82 | 25,786 | 43.37 | NC_057596 |
|
| 151,079 | 38.1 | 81,780 | 33.22 | 18,193 | 33.66 | 25,553 | 42.16 | ON017794 |
|
| 149,036 | 38.19 | 80,432 | 36.33 | 17,936 | 31.81 | 25,334 | 43.42 | MZ261905 |
|
| 153,448 | 38.15 | 83,535 | 36.25 | 18,345 | 31.88 | 25,784 | 43.47 | MZ261896 |
|
| 152,804 | 38.08 | 83,195 | 36.17 | 18,105 | 31.89 | 25,752 | 43.33 | NC_052874 |
|
| 152,787 | 38.1 | 83,177 | 32.18 | 18,305 | 32.18 | 25,679 | 44.38 | ON164641 |
|
| 151,682 | 38.14 | 82,623 | 36.26 | 18,335 | 31.83 | 25,362 | 43.48 | MF795137 |
|
| 153,225 | 38.06 | 83,528 | 36.17 | 18,219 | 31.88 | 25,739 | 43.34 | MW344307 |
FIGURE 1Structure and characteristics of the complete chloroplast genomes of 23 Swertia L. species. Genes inside and outside the circle are transcribed clockwise and counterclockwise separately. Darker and lighter grey in the inner circle each represent GC and AT content.
Gene composition of chloroplast genome of all Swertia L. species.
| Category | Group of genes | Name of genes |
|---|---|---|
| Photosynthesis | Photosystem I |
|
| Photosystem II |
| |
| NADH dehydrogenase |
| |
| Cytochrome b/f complex |
| |
| ATP synthase |
| |
| Self-replication | Ribosomal proteins (SSU) |
|
| Ribosomal proteins (LSU) |
| |
| Ribosomal RNAs |
| |
| Transfer RNAs | tRNA-Lys | |
| DNA-dependent RNA polymerase |
| |
| Other genes | Maturase | matK |
| Protease | clpP | |
| Envelope membrane protein | cemA | |
| Subunit acetyl-CoA carboxylase | Accd | |
| c-Type cytochrome synthesis gene | ccsA | |
| Genes of unkown function | Conserved open reading frames | ycf1, 2a, 3 |
represents a gene with one intron.
represents a gene with two introns.
represents trans-splice gene.
FIGURE 2Type of repeated sequences in the 23 Swertia L. plastid genomes. (A) Number of repeat sequences by length; (B) number of four repeat types (Note: BIF represents S. bifolia; BIM represents S. bimaculata; CIN represents S. cincta; COR represents S. cordata; DIC represents S. dichotoma; DILA represents S. dilatata; DIL represents S. diluta; ERY represents S. erythrosticta; FRA represents S. franchetiana; HIS represents S. hispidicalyx; KOU represents S. kouitchensis; LED represents S. leducii; MAC represents S. macrosperma; MUL represents S. multicaulis; MUS represents S. mussotii; NER represents S. nervosa; PRZ represents S. przewalskii; PUB represents S. pubescens; PUN represents S. punicea; SOU represents S. souliei; TET represents S. tetraptera; VET represents S. verticillifolia; and WOL represent S. wolfgangiana); (C) pie chart showing the numbers of four repeat types.
FIGURE 3Simple sequence repeats (SSRs) in the 23 Swertia L. plastid genomes.
FIGURE 4Comparison and analysis based on chloroplast genome of 23 Swertia L. species. Orientation of genes was pointed out by arrows up the alignments. Purple, blue, pink, and grey bars correspond to exons, untranslated regions, non-coding sequences, and mRNA, respectively. The Y-axis indicates the genetic similarity percentage. Genetic similarity among 50%–100% were showed in the figure (for interpretation of the references to color in this figure legend, the reader is referred to the web version of this article).
FIGURE 5MAUVE alignment of 23 Swertia L. species chloroplast genomes. The S wolfgangiana genome is shown at the top as the reference genome.
FIGURE 6Percentages of variable characters in homologous regions among chloroplast genomes of 23 Swertia L. species. (A) Coding region. (B) Non-coding region. The homologous regions are oriented according to their locations in the chloroplast genome.
FIGURE 7Comparative analysis of chloroplast genomic boundaries of the 23 Swertia L. plastid genomes.
FIGURE 8Phylogenetic tree of 23 Swertia L. species using Bayesian inference (BI) analyses based on whole chloroplast genomes.
FIGURE 9Divergence time estimated using BEAST.