| Literature DB >> 32735595 |
Dong-Mei Li1, Yuan-Jun Ye1, Ye-Chun Xu1, Jin-Mei Liu1, Gen-Fa Zhu1.
Abstract
Zingiber montanum (Z. montanum) and Zingiber zerumbet (Z. zerumbet) are important medicinal and ornamental herbs in the genus Zingiber and family Zingiberaceae. Chloroplast-derived markers are useful for species identification and phylogenetic studies, but further development is warranted for these two Zingiber species. In this study, we report the complete chloroplast genomes of Z. montanum and Z. zerumbet, which had lengths of 164,464 bp and 163,589 bp, respectively. These genomes had typical quadripartite structures with a large single copy (LSC, 87,856-89,161 bp), a small single copy (SSC, 15,803-15,642 bp), and a pair of inverted repeats (IRa and IRb, 29,393-30,449 bp). We identified 111 unique genes in each chloroplast genome, including 79 protein-coding genes, 28 tRNAs and 4 rRNA genes. We analyzed the molecular structures, gene information, amino acid frequencies, codon usage patterns, RNA editing sites, simple sequence repeats (SSRs) and long repeats from the two chloroplast genomes. A comparison of the Z. montanum and Z. zerumbet chloroplast genomes detected 489 single-nucleotide polymorphisms (SNPs) and 172 insertions/deletions (indels). Thirteen highly divergent regions, including ycf1, rps19, rps18-rpl20, accD-psaI, psaC-ndhE, psbA-trnK-UUU, trnfM-CAU-rps14, trnE-UUC-trnT-UGU, ccsA-ndhD, psbC-trnS-UGA, start-psbA, petA-psbJ, and rbcL-accD, were identified and might be useful for future species identification and phylogeny in the genus Zingiber. Positive selection was observed for ATP synthase (atpA and atpB), RNA polymerase (rpoA), small subunit ribosomal protein (rps3) and other protein-coding genes (accD, clpP, ycf1, and ycf2) based on the Ka/Ks ratios. Additionally, chloroplast SNP-based phylogeny analyses found that Zingiber was a monophyletic sister branch to Kaempferia and that chloroplast SNPs could be used to identify Zingiber species. The genome resources in our study provide valuable information for the identification and phylogenetic analysis of the genus Zingiber and family Zingiberaceae.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32735595 PMCID: PMC7394419 DOI: 10.1371/journal.pone.0236590
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Circular gene map of the chloroplast genomes of two Zingiber species.
The gray arrowheads indicate the direction of the genes. Genes shown inside the circle are transcribed clockwise, and those outside the circle are transcribed counterclockwise. Different genes are color coded. The innermost darker gray corresponds to GC content, whereas the lighter gray corresponds to AT content. IR, inverted repeat; LSC, large single copy region; SSC, small single copy region.
Characteristics of the chloroplast genomes of ten Zingiberaceae species.
| Genome characteristics | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| MK262727 | MK262726 | NC_044775 | MK209001 | MK209002 | MK262734 | MK262732 | MK262736 | MK262725 | MK262730 | |
| 164,464 | 163,589 | 162,621 | 163,811 | 163,555 | 162,135 | 162,176 | 163,949 | 163,300 | 163,608 | |
| 87,856 | 89,161 | 87,486 | 88,405 | 88,020 | 86,966 | 86,984 | 88,581 | 87,498 | 88,680 | |
| 15,803 | 15,642 | 15,577 | 15,812 | 15,989 | 15,737 | 15,694 | 15,808 | 15,568 | 15,288 | |
| 30,356/30,449 | 29,393 | 29,779 | 29,797 | 29,773 | 29,716 | 29,749 | 29,780 | 30,117 | 29,820 | |
| 141(111) | 141(111) | 133(113) | 133(111) | 133(113) | 141 (111) | 141 (111) | 141(111) | 141(111) | 133(111) | |
| 87(79) | 87(79) | 87(79) | 87(79) | 87(79) | 87 (79) | 87 (79) | 87(79) | 87(79) | 87(79) | |
| 46(28) | 46(28) | 38(30) | 38(28) | 38(30) | 46 (28) | 46 (28) | 46(28) | 46(28) | 38(28) | |
| 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | |
| 35.75 | 36.27 | 36.10 | 36.10 | 36.10 | 36.20 | 36.21 | 36.09 | 36.00 | 36.08 | |
| 36.72 | 36.95 | 37.10 | 36.90 | 37.20 | 36.94 | 36.92 | 36.96 | 36.85 | 36.91 | |
| 33.63 | 34.31 | 33.80 | 33.90 | 33.90 | 34.02 | 34.00 | 33.85 | 33.78 | 33.71 | |
| 29.24 | 29.64 | 29.70 | 29.50 | 29.40 | 29.60 | 29.66 | 29.53 | 29.59 | 30.06 | |
| 40.51/40.46 | 41.02 | 41.10 | 41.00 | 41.10 | 41.14 | 41.16 | 41.15 | 40.89 | 41.14 | |
| 19 | 19 | 20 | 18 | 17 | 18 | 18 | 18 | 17 | 18 | |
| 61 | 61 | 60 | 61 | 61 | 61 | 61 | 61 | 61 | 61 | |
| 12 | 12 | 11 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | |
| 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | |
| 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | |
| 40(20) | 40(20) | 40(20) | 40(20) | 40(20) | 40(20) | 40(20) | 40(20) | 40(20) | 40(20) |
CDS, protein-coding genes; LSC, large single copy region; SSC, small single copy region; IR, inverted repeat.
Genes present in the chloroplast genomes of Z. montanum and Z. zerumbet.
| Category | Function | Genes |
|---|---|---|
| Photosystem Ⅰ | ||
| Photosystem Ⅱ | ||
| Cytochrome b/f | ||
| ATP synthase | ||
| NADH dehydrogenase | ||
| Rubisco | ||
| RNA polymerase | ||
| Large subunit ribosomal proteins | ||
| Small subunit ribosomal proteins | ||
| Ribosomal RNAs | ||
| Transfer RNAs | ||
| Other proteins | ||
| Proteins of unknown function |
×2, Gene with two copies; ×4, Gene with four copies
*, Genes containing one intron
**, Genes containing two introns.
Fig 2Amino acid proportion in Z. montanum and Z. zerumbet protein-coding sequences.
Fig 3Predicted RNA editing sites of protein-coding genes in the chloroplast genomes of Z. montanum and Z. zerumbet.
Fig 4SNP and indel statistics for the Z. zerumbet chloroplast genome.
The Z. montamum chloroplast genome was used as the reference sequence for SNP and indel analyses. (A) Synonymous and nonsynonymous SNPs belonging to different protein-coding genes. The genes with zero SNP were not shown. (B) Insertion, deletion and total indel statistics. (C) Indels belonging to different protein-coding genes.
Fig 5Comparison of simple sequence repeats among four chloroplast genomes of Zingiber species.
(A) SSRs distribution between coding and noncoding regions detected in the four Zingiber species chloroplast genomes. (B) Frequencies of identified SSRs in LSC, SSC and IR regions. (C) Number of different SSR types detected in four Zingiber species chloroplast genomes. (D) Frequency of identified SSRs in different repeat class types.
Fig 6Analysis of long repeat sequences in the chloroplast genomes of the four Zingiber species.
(A) Total of four long repeat types; (B) frequency of palindromic repeats by length; (C) frequency of forward repeats by length; and (D) frequency of reverse repeats by length.
Fig 7Sequence alignment of the four Zingiber chloroplast genomes in mVISTA.
The chloroplast genome of Z. montanum was used as a reference. Gray arrows and thick black lines above the alignment indicate gene orientation. Purple bars represent exons, sky-blue bars represent transfer RNA (tRNA) and ribosomal RNA (rRNA) and red bars represent noncoding sequences (CNS). The horizontal axis indicates the coordinates within the chloroplast genome. The vertical scale represents the identity percentage ranging from 50% to 100%. White represents regions with sequence variation among the four species.
Fig 8Sliding window analysis of the whole chloroplast genomes among four Zingiber species.
Window length: 800 bp; step size: 200 bp. X-axis: position of the window midpoint.
Fig 9Phylogenetic relationships constructed with SNPs from 31 chloroplast genomes using the maximum likelihood method.
The bootstrap values were based on 1,000 replicates and are indicated next to the branches.