| Literature DB >> 35283921 |
Jing Ren1,2,3, Jing Tian2,3,4, Hui Jiang2,3,4, Xin-Xin Zhu5, Fredrick Munyao Mutie2,3,4, Vincent Okelo Wanga2,3,4, Shi-Xiong Ding2,3,4, Jia-Xin Yang2,3,4, Xiang Dong2,3,4, Ling-Ling Chen1, Xiu-Zhen Cai1, Guang-Wan Hu2,3,4.
Abstract
Coleanthus subtilis (Tratt.) Seidel (Poaceae) is an ephemeral grass from the monotypic genus Coleanthus Seidl, which grows on wet muddy areas such as fishponds or reservoirs. As a rare species with strict habitat requirements, it is protected at international and national levels. In this study, we sequenced its whole chloroplast genome for the first time using the next-generation sequencing (NGS) technology on the Illumina platform, and performed a comparative and phylogenetic analysis with the related species in Poaceae. The complete chloroplast genome of C. subtilis is 135,915 bp in length, with a quadripartite structure having two 21,529 bp inverted repeat regions (IRs) dividing the entire circular genome into a large single copy region (LSC) of 80,100 bp and a small single copy region (SSC) of 12,757 bp. The overall GC content is 38.3%, while the GC contents in LSC, SSC, and IR regions are 36.3%, 32.4%, and 43.9%, respectively. A total of 129 genes were annotated in the chloroplast genome, including 83 protein-coding genes, 38 tRNA genes, and 8 rRNA genes. The accD gene and the introns of both clpP and rpoC1 genes were missing. In addition, the ycf1, ycf2, ycf15, and ycf68 were pseudogenes. Although the chloroplast genome structure of C. subtilis was found to be conserved and stable in general, 26 SSRs and 13 highly variable loci were detected, these regions have the potential to be developed as important molecular markers for the subfamily Pooideae. Phylogenetic analysis with species in Poaceae indicated that Coleanthus and Phippsia were sister groups, and provided new insights into the relationship between Coleanthus, Zingeria, and Colpodium. This study presents the initial chloroplast genome report of C. subtilis, which provides an essential data reference for further research on its origin.Entities:
Keywords: Coleanthus subtilis; chloroplast genome; comparative analysis; monotypic genus; phylogeny
Year: 2022 PMID: 35283921 PMCID: PMC8908325 DOI: 10.3389/fpls.2022.828467
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1Chloroplast genome map of C. subtilis. The genes located inside the circles are transcribed in a clockwise direction, while those outside the circle are transcribed counterclockwise. Different colored genes represent different functions, as shown in the legend at the bottom left. The inverted boundaries and GC content are drawn in the inner circle.
Features of the chloroplast genomes of C. subtilis and other Poaceae species.
| Species | Genome length (bp) | GC content (%) | Gene number | |||||||||
| Total | LSC | SSC | IR | Total | LSC | SSC | IR | Total | PCG | tRNA | rRNA | |
|
| 135915 | 80100 | 12757 | 21529 | 38.3 | 36.3 | 32.4 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 135613 | 79805 | 12750 | 21529 | 38.3 | 36.3 | 32.4 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 135353 | 79594 | 12727 | 21516 | 38.3 | 36.3 | 32.6 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 135086 | 79683 | 12609 | 21397 | 38.3 | 36.2 | 32.6 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 135943 | 80121 | 12390 | 21491 | 38.3 | 36.2 | 32.7 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 136705 | 80604 | 12769 | 21666 | 38.5 | 36.4 | 32.8 | 44.0 | 129 | 83 | 38 | 8(4) |
|
| 136408 | 80511 | 12835 | 21531 | 38.3 | 36.2 | 32.4 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 136726 | 80711 | 12701 | 21657 | 38.6 | 36.5 | 32.9 | 44.1 | 129 | 83 | 38 | 8(4) |
|
| 135551 | 79626 | 12671 | 21627 | 38.2 | 36.1 | 32.5 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 135946 | 80111 | 12625 | 21605 | 38.5 | 36.4 | 32.6 | 44.0 | 129 | 83 | 38 | 8(4) |
|
| 136330 | 81252 | 12666 | 21206 | 38.6 | 36.6 | 32.7 | 44.1 | 129 | 83 | 38 | 8(4) |
|
| 136823 | 79707 | 12722 | 22917 | 38.3 | 36.2 | 32.6 | 43.7 | 129 | 83 | 38 | 8(4) |
|
| 136934 | 80964 | 12566 | 21702 | 38.3 | 36.3 | 32.3 | 43.9 | 128 | 83 | 37 | 8(4) |
|
| 136682 | 80660 | 12688 | 21667 | 38.6 | 36.5 | 32.9 | 44.0 | 129 | 83 | 38 | 8(4) |
|
| 133798 | 78819 | 12497 | 21241 | 38.4 | 36.3 | 32.6 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 133608 | 78636 | 12474 | 21249 | 38.3 | 36.2 | 32.7 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 135272 | 79826 | 12598 | 21424 | 38.4 | 36.4 | 32.8 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 136395 | 80645 | 12466 | 21642 | 38.5 | 36.4 | 33.0 | 44.0 | 129 | 83 | 38 | 8(4) |
|
| 135175 | 79848 | 12485 | 21421 | 38.3 | 36.1 | 32.4 | 43.9 | 129 | 83 | 38 | 8(4) |
|
| 134710 | 80478 | 12570 | 20831 | 38.5 | 36.5 | 32.8 | 44.0 | 129 | 83 | 38 | 8(4) |
|
| 135794 | 79728 | 12760 | 21653 | 38.5 | 36.4 | 32.9 | 44.0 | 129 | 83 | 38 | 8(4) |
|
| 135568 | 80009 | 12823 | 21368 | 38.4 | 36.3 | 32.6 | 44.0 | 129 | 83 | 38 | 8(4) |
|
| 135466 | 79629 | 12685 | 21576 | 38.3 | 36.2 | 32.4 | 43.8 | 129 | 83 | 38 | 8(4) |
|
| 137370 | 81202 | 12842 | 21663 | 38.8 | 36.9 | 32.9 | 44.0 | 129 | 83 | 38 | 8(4) |
List of the annotated genes in the chloroplast genomes of C. subtilis.
| Category | Groups of genes | Name of genes |
| Self-replication | Ribosomal RNA | |
| Transfer RNA | ||
| Small subunit of ribosome | ||
| Large subunit of ribosome | ||
| RNA polymerase subunits | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| Subunits of cytochrome | ||
| ATP synthase | ||
| NADH-dehydrogenase | ||
| Other genes | Rubisco large subunit |
|
| Translational initiation factor |
| |
| Maturase K |
| |
| Envelope membrane protein |
| |
| Proteolysis |
| |
| Cytochrome c biogenesis |
|
FIGURE 2Comparison of the chloroplast genome structures among 24 Poaceae species. The different colored squares represent different types of genes. Black represents transfer RNA (tRNA), or green if the tRNA has introns (rRNA). Red represents ribosomal RNA, while white represents protein coding genes (PCGs).
FIGURE 3A plot of comparative analysis of the boundary features of the 24 Poaceae species. The comparative regions are the boundaries of large single copy (LSC), small single copy (SSC) and inverted repeat (IR) regions.
FIGURE 4The Shuffle-LAGAN alignment was used in mVISTA to compare the contiguity of the chloroplast genomes of 24 species, with C. subtilis as the reference. The vertical scale in the figure indicates the degree of identity between 50% and 100%, while the horizontal scale shows the sequence information of the chloroplast genomes. Gray lines indicate gene direction, order and position.
FIGURE 5Relative synonymous codon usage (RSCU) values for amino acids and stop codons of the 76 protein-coding regions of C. subtilis. The colors of the histograms correspond to the colors of the codons.
FIGURE 6Comparative analysis plots of RSCU values for the five species. Each amino acid corresponds to five histograms, and their heights represent the RSCU value. The histogram from left to right is Coleanthus subtilis, Phippsia algida, Puccinellia nuttalliana, Sclerochloa dura, and Zingeria biebersteiniana.
FIGURE 7(A) Type of repeats in the whole chloroplast genomes of C. subtilis and its related species. (B) Size of repeats in the chloroplast genome of C. subtilis and its related species. (C) Distribution of repeats in functional regions of the plastid genome. (D) Distribution of repeats in regions of the chloroplast genomes. IR, inverted repeat; LSC, large single copy; SSC, small single copy; LSC/IR show those repeats for which one copy of the repeat exists in one region and a second copy exists in another region. CDS, protein-coding sequence; IGS, intergenic spacer region.
FIGURE 8(A) The type of SSRs in the cp genome of C. subtilis and its related species. (B) The region of SSRs in the cp genome of C. subtilis and its related species. (C) The unit of SSRs in the cp genome of C. subtilis and its related species. IR, inverted repeat; LSC, large single copy; SSC, small single copy.
FIGURE 9Nucleotide diversity of the chloroplast genomes of C. subtilis and its related species.
dN/dS ratios of the chloroplast genomes of C. subtilis and its related species.
| M8 | Gene | Region | Selected sites | Pr( | Number of sites |
| Naive Empirical Bayes (NEB) |
| LSC | 133 A | 0.958 | 1 |
|
| LSC | 1247 A | 1.000 | 1 | |
|
| LSC | 1646 L | 0.998 | 1 | |
|
| SSC | 1824 F | 0.963 | 1 | |
|
| LSC | 1989 L/2043 R/2092 F | 0.969*/0.958*/0.986 | 3 | |
|
| LSC | 2370 E | 0.964 | 1 | |
|
| LSC | 2521 H | 0.966 | 1 | |
|
| LSC | 2558 C/2599 S/2711 I/2713 I/2805 L/2807 L/2882 V/2937 Q | 0.958*/0.954*/0.951*/0.956*/0.956*/0.965*/0.953*/0.970 | 8 | |
|
| SSC | 3322 G | 0.954 | 1 | |
|
| SSC | 4425 I | 0.995 | 1 | |
|
| SSC | 5107 S/5119 I | 0.952*/0.951 | 2 | |
|
| LSC | 6446 K | 0.959 | 1 | |
|
| LSC | 7436 L/7917 V | 0.954*/0.960 | 2 | |
|
| LSC | 8195 S/8372 L | 0.954*/0.950 | 2 | |
|
| LSC | 10060 F/10233 F | 0.951*/0.953 | 2 | |
|
| LSC | 11311 V | 0.955 | 1 | |
|
| LSC | 12867 G | 0.959 | 1 | |
|
| LSC | 14684 G/14758 R | 0.967*/0.959 | 2 | |
|
| LSC | 15415 N/15426 I/15775 A/15836 F/15841 S/15860 K/15891 K/15907 F/16026 E/16204 Q/16260 A | 0.965*/0.968*/0.998**/0.958*/0.953*/0.962*/0.962*/0.964*/0.964*/0.961*/0.965 | 11 | |
|
| LSC | 16812 T | 0.967 | 1 | |
|
| LSC | 17399 E | 0.955 | 1 | |
| Bayes Empirical Bayes (BEB) |
| LSC | 1247 A | 0.974 | 1 |
The level of significance is indicated by the number of “*”, where “*” represents significant, and “**” indicates highly significant.
FIGURE 10Phylogenetic trees were generated based on the 76 shared protein-coding sequences of 53 species using maximum likelihood (ML) and Bayesian (BI) methods. The ML tree and BI tree have a consistent topology. The ML bootstrap values/Bayesian posterior probabilities are displayed on the nodes. To make Coleanthus subtilis more visible, it was marked with a star.