| Literature DB >> 31409043 |
Yingxian Cui1,2, Liping Nie1,2, Wei Sun3, Zhichao Xu1,2, Yu Wang1,2, Jing Yu1, Jingyuan Song1,2, Hui Yao4,5.
Abstract
Zingiber officinale, commonly known as ginger, is an important plant of the family Zingiberaceae and is widely used as an herbal medicine and condiment. The lack of chloroplast genomic information hinders molecular research and phylogenetic analysis on ginger. We introduced the complete chloroplast genome of Z. officinale and identified its phylogenetic position in Zingiberaceae. The chloroplast genome of Z. officinale is 162,621 bp with a four-part circular structure and 36.1% GC content. All 113 unique genes were annotated. A total of 78 simple sequence repeats (SSRs) and 42 long repeat sequences, which are potential areas for species authentication, were found. Comparative analysis revealed some highly variable regions, including rps16-trnQ-UUG, atpH-atpI, trnT-UGU-trnL-UAA, ycf1, and psaC-ndhE. Moreover, the small single-copy (SSC) region was the most variable region in all four shared regions, indicating that it may be undergoing rapid nucleotide substitution in the family Zingiberaceae. Phylogenetic analysis based on all available chloroplasts of Zingiberales in the National Center for Biotechnology Information indicated that Zingiber is a sister branch to Kaempferia species. The availability of the Z. officinale chloroplast genome provided invaluable data for species-level authentication and phylogenetic analysis and can thus benefit further investigations on species in the family Zingiberaceae.Entities:
Keywords: Zingiber officinale; Zingiberaceae; comparative analysis; complete chloroplast genome; phylogenetic analysis
Year: 2019 PMID: 31409043 PMCID: PMC6724139 DOI: 10.3390/plants8080283
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Base composition of the Z. officinale chloroplast genome.
| Region | Positions | T(U) (%) | C (%) | A (%) | G (%) | Length (bp) |
|---|---|---|---|---|---|---|
| Total | 32.2 | 18.4 | 31.7 | 17.7 | 162,621 | |
| IRa | 30.1 | 21.3 | 28.8 | 19.8 | 29,779 | |
| IRb | 28.8 | 19.8 | 30.1 | 21.3 | 29,779 | |
| LSC | 33.7 | 17.4 | 32.4 | 16.5 | 87,486 | |
| SSC | 34.3 | 15.8 | 36.0 | 13.9 | 15,577 | |
| CDS | 31.7 | 17.2 | 31.2 | 19.9 | 79,167 | |
| 1st position | 24 | 18.3 | 31.3 | 26.5 | 26,389 | |
| 2nd position | 33 | 20.1 | 29.7 | 17.7 | 26,389 | |
| 3rd position | 39 | 13.2 | 32.6 | 15.6 | 26,389 |
Figure 1Gene map of the Zingiber officinale complete chloroplast genome. Genes that are inside and outside the circle are transcribed clockwise and counterclockwise, respectively. The darker gray area in the inner circle corresponds to GC content, whereas the lighter gray area corresponds to AT content.
Gene content of the Z. officinale chloroplast genome.
| Group of Genes | Gene Names | Amount |
|---|---|---|
| Photosystem I | 5 | |
| Photosystem II | 15 | |
| Cytochrome b/f complex | 6 | |
| ATP synthase | 6 | |
| NADH dehydrogenase | 12 | |
| RubisCO large subunit |
| 1 |
| RNA polymerase | 4 | |
| Ribosomal proteins (SSU) | 15 | |
| Ribosomal proteins (LSU) | 11 | |
| Proteins of unknown function | 6 | |
| Transfer RNAs | 38 | 38 |
| Ribosomal RNAs | 8 | |
| Other genes | 6 |
(×2) indicates that the number of the repeating unit is 2; * indicates introns of genes.
Figure 2Codon content of all protein-coding genes in the Z. officinale chloroplast genome.
Types and amounts of simple sequence repeats (SSRs) in the Z. officinale chloroplast genome.
| SSR Type | Repeat Unit | Amount | Ratio (%) |
|---|---|---|---|
| Mono | A/T | 27 | 100.0 |
| Di | AG/CT | 2 | 8.7 |
| AT/TA | 21 | 91.3 | |
| Tri | AAG/CTT | 1 | 20.0 |
| AAT/ATT | 4 | 80.0 | |
| Tetra | AAAC/GTTT | 1 | 4.8 |
| AAAG/CTTT | 4 | 19.0 | |
| AAAT/ATTT | 13 | 61.9 | |
| AACT/AGTT | 1 | 4.8 | |
| AATG/ATTC | 2 | 9.5 | |
| Penta | AAAAT/ATTTT | 1 | 50.0 |
| AATAG/ATTCT | 1 | 50.0 |
Figure 3Repeat sequences in the Z. officinale chloroplast genome. F, forward; P, palindrome; R, reverse; and C, complement.
Figure 4Comparison of the complete chloroplast genomes of seven species in the family Zingiberaceae using mVISTA, with Z. officinale as a reference. Blue block, conserved genes; sky-blue block, transfer RNA (tRNA) and ribosomal RNA (rRNA); and red block, conserved noncoding sequences (CNS). White represents regions with sequence variation among the seven species.
Figure 5Sliding window analysis of the Z. officinale chloroplast genome. The red line indicates the comparison of 10 species in the family Zingiberaceae (Z. officinale, Z. spectabile, K. elegans, K. galangal, C. longa, C. flaviflora, C. roscoeana, Amomum compactum, A. krervanh, and Alpinia oxyphylla); the blue line indicates the comparison of four species in the family Zingiberaceae (Z. officinale, Z. spectabile, K. elegans, and K. galang). Window length: 800 bp; step size 200 bp. x axis: Position of the midpoint of a window. y axis: nucleotide diversity of each window.
Figure 6Likelihood (ML) phylogenetic tree of 19 species in order Zingiberales based on the concatenated sequences of 75 shared protein-coding genes of chloroplast genomes. Numbers at nodes are values for bootstrap support. Dioscorea polystachya and D. collettii were used as outgroups.