| Literature DB >> 35108324 |
Xiaoyang Wang1,2, Tashi Dorjee1,2, Yiru Chen1,2, Fei Gao1,2, Yijun Zhou1,2.
Abstract
Tetraena mongolica, Zygophyllum xanthoxylon, and Z. fabago are three typical dryland plants with important ecological values in subfamily Zygophylloideae of Zygophyllaceae. Studies on the chloroplast genomes of them are favorable for understanding the diversity and phylogeny of Zygophyllaceae. Here, we sequenced and assembled the whole chloroplast genomes of T. mongolica, Z. xanthoxylon, and Z. fabago, and performed comparative genomic and phylogenetic analysis. The total size, structure, gene content and orders of these three chloroplast genomes were similar, and the three chloroplast genomes exhibited a typical quadripartite structure with a large single-copy region (LSC; 79,696-80,291 bp), a small single-copy region (SSC; 16,462-17,162 bp), and two inverted repeats (IRs; 4,288-4,413 bp). A total of 107 unique genes were identified from the three chloroplast genomes, including 70 protein-coding genes, 33 tRNAs, and 4 rRNAs. Compared with other angiosperms, the three chloroplast genomes were significantly reduced in overall length due to an unusual 16-24 kb shrinkage of IR regions and loss of the 11 genes which encoded subunits of NADH dehydrogenase. Genome-wide comparisons revealed similarities and variations between the three species and others. Phylogenetic analysis based on the three chloroplast genomes supported the opinion that Zygophyllaceae belonged to Zygophyllales in Fabids, and Z. xanthoxylon and Z. fabago belonged to Zygophyllum. The genome-wide comparisons revealed the similarity and variations between the chloroplast genomes of the three Zygophylloideae species and other plant species. This study provides a valuable molecular biology evidence for further studies of phylogenetic status of Zygophyllaceae.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35108324 PMCID: PMC8809528 DOI: 10.1371/journal.pone.0263253
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The complete chloroplast genome of T. mongolica (a), Z. xanthoxylon (b) and Z. fabago (c).
The predicted genes are displayed and colors indicate functional classifications in the legend. The genes outside the circle are transcribed clockwise, whereas those inside the circle are transcribed counterclockwise. The inner gray circle describes the GC content. The large single copy (LSC), small single copy (SSC) and inverted repeat (IR) regions are marked in the inner circle.
Genes identified from the chloroplast genomes of T. mongolica, Z. xanthoxylon and Z. fabago.
| Category | Function | Gene names |
|---|---|---|
| Transcription and translation | Ribosomal proteins (LSU) | |
| Ribosomal proteins (SSU) | ||
| RNA polymerase | ||
| Ribosomal RNAs | ||
| tRNA genes | ||
| Photosynthesis | ATP synthase | |
| Cytochrome b/f complex | ||
| Photosystem I | ||
| Photosystem II | ||
| Other genes | Maturase |
|
| Envelop membrane protein |
| |
| Subunit Acetyl-CoA-Carboxylate |
| |
| c-type cytochrome synthesis gene |
| |
| ATP-dependent protease subunit gene |
| |
| Unknown | Proteins of unknown function |
* Duplicate genes.
Summary of major features of the three chloroplast genomes.
| Item | |||
|---|---|---|---|
| Total length (bp) | 106,081 | 105,423 | 104,984 |
| LSC length (bp) | 80,291 | 79,877 | 79,696 |
| SSC length (bp) | 17,162 | 16,970 | 16,462 |
| IR length (bp) | 4,315 | 4,288 | 4,413 |
| GC (%) | 33.70 | 34.06 | 36.00 |
| Total number of gene | 107 | 107 | 107 |
| No. of protein-coding genes | 70 | 70 | 70 |
| No. of rRNA | 4 | 4 | 4 |
| No. of tRNA | 33 | 33 | 33 |
Fig 2Comparison of sequencing results of PCR products among chloroplast genomes of tobacco, T. mongolica, Z. xanthoxylon, and Z. fabago to show the loss of ndh genes.
Tobacco chloroplast genome as the reference. (a) Comparison of the fragment psaC-ndhE-ndhG-ndhI-ndhA-ndhH-rps15 in tobacco and corresponding regions in the chloroplast genomes of T. mongolica, Z. xanthoxylon, and Z. fabago. (b) Comparison of the fragment rps7-ndhB-trnL-CAA in tobacco and corresponding regions in chloroplast genomes of T. mongolica, Z. xanthoxylon, and Z. fabago. The screenshots indicated by the blue arrows were excerpts from the PCR sequencing results of each gene.
Intron-containing genes in chloroplast genomes of T. mongolica, Z. xanthoxylon and Z. fabago.
| Gene | Location | Intron length in | Intron length in | Intron length in |
|---|---|---|---|---|
|
| SSC | 2551 | 2544 | 2551 |
|
| SSC | 691 | 695 | 682 |
|
| SSC | 760 | 723 | 650 |
|
| SSC | 814 | 774 | 784 |
|
| SSC | 752, 757 | 736, 743 | 786, 747 |
|
| SSC | 460 | 455 | 465 |
|
| SSC | 603 | 621 | 612 |
|
| SSC | 866 | 823 | 837 |
|
| SSC | 749 | 793 | 791 |
|
| SSC | 781 | 761 | 772 |
|
| SSC | 943 | 855 | 940 |
|
| IRB | 668 | 668 | 630 |
|
| SSC | 712 | 716 | 716 |
|
| SSC | 845 | 753 | 834 |
|
| IRB | 668 | 671 | 666 |
* The gene has two introns.
Types and numbers of SSRs in chloroplast genomes of T. mongolica, Z. xanthoxylon and Z. fabago.
| SSRs type | Repeat unit | Number in | Number in | Number in |
|---|---|---|---|---|
| Mono | A/T | 75 | 61 | 76 |
| Di | AT/TA | 1 | 4 | 2 |
| Tri | -- | 0 | 0 | 0 |
| Tetra | -- | 0 | 0 | 0 |
| penta | -- | 0 | 0 | 0 |
| hexa | -- | 0 | 0 | 0 |
| total | -- | 76 | 65 | 78 |
The summary of SSRs distribution in different regions of three chloroplast genomes.
| Species | LSC | IRA | SSC | IRB |
|---|---|---|---|---|
| 64 | 1 | 10 | 1 | |
| 52 | 3 | 7 | 3 | |
| 66 | 3 | 6 | 3 |
Fig 3Long repeat sequences in chloroplast genomes of T. mongolica, Z. xanthoxylon, and Z. fabago.
(a) Number of long repeats; (b) Number of different long repeats types; (c) Sequence length of long repeats.
Fig 4The nucleotide diversity values (Pi) of all regions.
Regions with Pi = 0 are excluded and are not exhibited in the figure. The black starts show 8 polymorphic loci with the length>300 bp and Pi>0.1. The X-axis indicates chloroplast regions and the y-axis nucleotide diversity values (Pi).
Polymorphic loci identified based on comparative analysis of chloroplast genomes of T. mongolica, Z. xanthoxylon, and Z. fabago.
| Serial number | Region | Nucleotide diversity | Number of polymorphic sites | Alignment length | Conserved length |
|---|---|---|---|---|---|
| 1 |
| 0.28721 | 204 | 729 | 477 |
| 2 |
| 0.23601 | 179 | 871 | 524 |
| 3 |
| 0.18088 | 105 | 777 | 387 |
| 4 |
| 0.17105 | 130 | 918 | 760 |
| 5 |
| 0.14917 | 80 | 536 | 362 |
| 6 |
| 0.10503 | 103 | 1065 | 676 |
| 7 |
| 0.10490 | 180 | 2775 | 1716 |
| 8 |
| 0.10426 | 136 | 1114 | 908 |
Fig 5Proportion of codon preference in chloroplast genome of T. mongolica, Z. xanthoxylon and Z. fabago.
Fig 6Codon content of 20 amino acids and stop codons in all protein-coding genes of chloroplast genomes of T. mongolica, Z. xanthoxylon, and Z. fabago.
Fig 7Gene order comparison of nine chloroplast genomes (A. carambola, L. usitatissimum, E. novogranatense, G. maderense, E. carvifolium, T. mongolica, Z. xanthoxylon and Z. fabago).
A. trichopoda chloroplast genome as reference, utilizing MAUVE software. The boxes above the line indicate the gene sequences in clockwise direction and the boxes below the line indicate gene sequences in counterclockwise direction.
Fig 8Comparison of the three chloroplast genomes utilizing T. terrestris as reference.
Gray arrows and thick black lines above the alignment represent gene direction. Purple bars show exons, sky-blue bars show transfer RNA (tRNA) and ribosomal RNA (rRNA), red bars show non-coding sequences (CNS) and white peaks show the differences between chloroplast genomes. The y-axis indicates the identity percentage ranging from 50 to 100%.
Fig 9The phylogenetic tree of the sequences from 69 species, using Maximum Likelihood (ML) based on concatenated sequences of 50 genes implemented in MEGA X.
A. trichopoda was set as the outgroup. Bootstrap supports were calculated from 1000 replicates. ▲ Represent the bootstrap value. ★ Represent the missing evolutionary branch of the 11 gene.
Fig 10IR expansion and contraction in three chloroplast genomes.
Gene names are shown in boxes, and genes lengths in the corresponding regions are marked above the boxes. These features are not to scale.