| Literature DB >> 35547259 |
Shuai Guo1,2, Xuejiao Liao1,2, Shiyu Chen1,2, Baosheng Liao2, Yiming Guo3, Ruiyang Cheng2, Shuiming Xiao2, Haoyu Hu2, Jun Chen4, Jin Pei1, Yangjin Chen5, Jiang Xu2, Shilin Chen1,2.
Abstract
Polygonum is a generalized genus of the Polygonaceae family that includes various herbaceous plants. In order to provide aid in understanding the evolutionary and phylogenetic relationship in Polygonum at the chloroplast (cp) genome-scale level, we sequenced and annotated the complete chloroplast genomes of four Polygonum species using next-generation sequencing technology and CpGAVAS. Then, repeat sequences, IR contractions, and expansion and transformation sites of chloroplast genomes of four Polygonum species were studied, and a phylogenetic tree was built using the chloroplast genomes of Polygonum. The results indicated that the chloroplast genome construction of Polygonum also displayed characteristic four types of results, comparable to the published chloroplast genome of recorded angiosperms. The chloroplast genomes of the four Polygonum plants are highly consistent in genome size (159,015 bp-163,461 bp), number of genes (112 genes, including 78 protein-coding genes, 30 tRNA genes, and 4 rRNA genes), gene types, gene order, codon usage, and repeat sequence distribution, which identifies the high preservation among the Polygonum chloroplast genomes. The Polygonum phylogenetic tree was recreated by a full sequence of the chloroplast genome, which illustrates that the P. bistorta, P. orientale, and P. perfoliatum are divided into the same branch, and P. aviculare belongs to Fallopia. The precise system site of lots base parts requires further verification, but the study would provide a basis for developing the available genetic resources and evolutionary relationships of Polygonum.Entities:
Keywords: Polygonum; comparative analysis; complete chloroplast genome; phylogenetic analysis; repeats analysis
Year: 2022 PMID: 35547259 PMCID: PMC9084321 DOI: 10.3389/fgene.2022.764534
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Chloroplast genome features of Polygonum.
| Species | All | LSC | SSC | IR | Accession numbers | ||||
|---|---|---|---|---|---|---|---|---|---|
| Length (bp) | GC% | Length (bp) | GC% | Length (bp) | GC% | Length (bp) | GC% | ||
|
| 163,461 | 37.43 | 88,021 | 35.48 | 13,306 | 32.46 | 31,067 | 41.27 | MZ748474 |
|
| 159,476 | 37.37 | 84,360 | 36.07 | 12,968 | 32.99 | 31,074 | 41.35 | MZ748475 |
|
| 159,015 | 38.21 | 83,583 | 36.59 | 13,154 | 33.19 | 31,139 | 41.45 | MZ748476 |
|
| 160,680 | 37.96 | 85,384 | 36.20 | 12,928 | 33.09 | 31,184 | 41.39 | MZ748477 |
FIGURE 1Gene map of the Polygonum chloroplast genome. Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Genes belonging to different functional groups are color coded. The darker gray in the inner circle corresponds to DNA G + C content, while the lighter gray corresponds to A + T content.
Gene composition of the chloroplast genome of Polygonum.
| Category | Group of genes | Name of genes |
|---|---|---|
| Self-replication | Large subunit of ribosomal proteins |
|
| Small subunit of ribosomal proteins |
| |
| DNA-dependent RNA polymerase |
| |
| rRNA genes |
| |
| tRNA genes |
| |
| Photosynthesis | Photosystem I |
|
| Photosystem II |
| |
| NADH oxidoreductase |
| |
| Cytochrome b6/f complex |
| |
| ATP synthase |
| |
| Rubisco |
| |
| Other genes | Maturase |
|
| Protease |
| |
| Envelope membrane protein |
| |
| Subunit acetyl-CoA-carboxylase |
| |
| c-type cytochrome synthesis gene |
| |
| Conserved open reading frames |
|
Genes containing introns.
Duplicated gene (genes present in the IR regions).
Length of exons and introns in four Polygonum chloroplast genomes.
| Gene | Region | Exon 1 | Intron 1 | Exon 2 | Intron 2 | Exon 3 | |
|---|---|---|---|---|---|---|---|
|
|
| LSC | 411 | 770 | 144 | — | — |
|
| IR | 327 | 576 | 291 | 915 | 69 | |
|
| SSC | 541 | 1099 | 551 | — | — | |
|
| IR | 756 | 675 | 777 | — | — | |
|
| IR | 6 | 769 | 642 | — | — | |
|
| IR | 8 | 759 | 475 | — | — | |
|
| IR | 399 | 931 | 9 | — | — | |
|
| IR | 435 | 662 | 393 | — | — | |
|
| LSC | 1613 | 780 | 430 | — | — | |
|
| IR | 114 | 27587 | 232 | 527 | 26 | |
|
| IR | 38 | 801 | 35 | — | — | |
|
| IR | 42 | 945 | 35 | — | — | |
|
| LSC | 35 | 2510 | 37 | — | — | |
|
| LSC | 37 | 579 | 50 | — | — | |
|
| LSC | 37 | 583 | 38 | — | — | |
|
| LSC | 155 | 744 | 227 | 739 | 128 | |
|
|
| LSC | 411 | 770 | 144 | — | — |
|
| LSC | 327 | 576 | 291 | 915 | 69 | |
|
| SSC | 541 | 1099 | 551 | — | — | |
|
| IR | 756 | 674 | 777 | — | — | |
|
| LSC | 6 | 769 | 642 | — | — | |
|
| LSC | 8 | 759 | 475 | — | — | |
|
| LSC | 399 | 931 | 9 | — | — | |
|
| IR | 435 | 662 | 393 | — | — | |
|
| LSC | 1613 | 780 | 430 | — | — | |
|
| LSC+IR | 114 | 75184 | 232 | 527 | 26 | |
|
| IR | 38 | 801 | 35 | — | — | |
|
| IR | 42 | 945 | 35 | — | — | |
|
| IR | 35 | 2510 | 37 | — | — | |
|
| LSC | 37 | 579 | 50 | — | — | |
|
| LSC | 37 | 583 | 38 | — | — | |
|
| IR | 155 | 744 | 227 | 739 | 128 | |
|
|
| LSC | 411 | 770 | 144 | — | — |
|
| LSC | 327 | 576 | 291 | 915 | 69 | |
|
| SSC | 541 | 559 | 551 | — | — | |
|
| IR | 777 | 675 | 756 | — | — | |
|
| LSC | 6 | 796 | 642 | — | — | |
|
| LSC | 8 | 759 | 475 | — | — | |
|
| LSC | 399 | 931 | 9 | — | — | |
|
| LSC | 1613 | 780 | 430 | — | — | |
|
| LSC+IR | 27 | 527 | 231 | — | — | |
|
| LSC | 114 | 75184 | 231 | 527 | 27 | |
|
| IR | 38 | 801 | 35 | — | — | |
|
| IR | 42 | 945 | 35 | — | — | |
|
| LSC | 35 | 2510 | 37 | — | — | |
|
| LSC | 37 | 579 | 50 | — | — | |
|
| LSC | 37 | 583 | 38 | — | — | |
|
| LSC | 155 | 744 | 227 | 739 | 128 | |
|
|
| LSC | 411 | 770 | 144 | — | — |
|
| LSC | 327 | 576 | 291 | 915 | 69 | |
|
| SSC | 541 | 1099 | 551 | — | — | |
|
| IR | 756 | 675 | 777 | — | — | |
|
| LSC | 6 | 769 | 642 | — | — | |
|
| LSC | 8 | 759 | 475 | — | — | |
|
| LSC | 399 | 931 | 9 | — | — | |
|
| IR | 435 | 662 | 393 | — | — | |
|
| LSC | 1613 | 780 | 430 | — | — | |
|
| IR | 114 | 75080 | 232 | 527 | 26 | |
|
| LSC | 223 | 863 | 44 | — | — | |
|
| IR | 38 | 801 | 35 | — | — | |
|
| IR | 42 | 945 | 35 | — | — | |
|
| LSC | 35 | 2510 | 37 | — | — | |
|
| LSC | 37 | 583 | 38 | — | — | |
|
| LSC | 155 | 744 | 227 | 739 | 128 |
Codon–anticodon recognition patterns and codon usage of four Polygonum chloroplast genomes
| AA | Codon | RSCU value | |||
|---|---|---|---|---|---|
|
|
|
|
| ||
| Stop | UAA | 1.66 | 1.64 | 1.64 | 1.66 |
| UAG | 0.76 | 0.73 | 0.75 | 0.67 | |
| UGA | 0.59 | 0.63 | 0.61 | 0.67 | |
| Ala | GCA | 1.13 | 1.11 | 1.13 | 1.13 |
| GCC | 0.7 | 0.74 | 0.72 | 0.68 | |
| GCG | 0.46 | 0.52 | 0.51 | 0.52 | |
| GCU | 1.7 | 1.64 | 1.64 | 1.68 | |
| Cys | UGC | 0.52 | 0.54 | 0.59 | 0.55 |
| UGU | 1.48 | 1.46 | 1.41 | 1.45 | |
| Asp | GAC | 0.43 | 0.43 | 0.41 | 0.41 |
| GAU | 1.57 | 1.57 | 1.59 | 1.59 | |
| Glu | GAA | 1.5 | 1.46 | 1.46 | 1.46 |
| GAG | 0.5 | 0.54 | 0.54 | 0.54 | |
| Phe | UUC | 0.66 | 0.67 | 0.68 | 0.65 |
| UUU | 1.34 | 1.33 | 1.32 | 1.35 | |
| Gly | GGA | 1.54 | 1.56 | 1.53 | 1.57 |
| GGC | 0.5 | 0.48 | 0.51 | 0.48 | |
| GGG | 0.74 | 0.74 | 0.73 | 0.7 | |
| GGU | 1.21 | 1.22 | 1.23 | 1.25 | |
| His | CAC | 0.49 | 0.47 | 0.48 | 0.5 |
| CAU | 1.51 | 1.53 | 1.52 | 1.5 | |
| Ile | AUA | 0.93 | 0.97 | 0.96 | 0.95 |
| AUC | 0.56 | 0.55 | 0.55 | 0.54 | |
| AUU | 1.51 | 1.48 | 1.49 | 1.51 | |
| Lys | AAA | 1.49 | 1.48 | 1.48 | 1.49 |
| AAG | 0.51 | 0.52 | 0.52 | 0.51 | |
| Leu | CUA | 0.85 | 0.85 | 0.83 | 0.84 |
| CUC | 0.43 | 0.45 | 0.42 | 0.42 | |
| CUG | 0.38 | 0.38 | 0.4 | 0.39 | |
| CUU | 1.29 | 1.26 | 1.31 | 1.28 | |
| UUA | 1.85 | 1.84 | 1.77 | 1.83 | |
| UUG | 1.21 | 1.22 | 1.27 | 1.24 | |
| Met | AUG | 1 | 1 | 1 | 1 |
| Asn | AAC | 0.51 | 0.46 | 0.5 | 0.51 |
| AAU | 1.49 | 1.54 | 1.5 | 1.49 | |
| Pro | CCA | 1.04 | 1.12 | 1.07 | 1.04 |
| CCC | 0.76 | 0.78 | 0.82 | 0.79 | |
| CCG | 0.69 | 0.61 | 0.6 | 0.62 | |
| CCU | 1.51 | 1.49 | 1.51 | 1.55 | |
| Gln | CAA | 1.52 | 1.51 | 1.5 | 1.51 |
| CAG | 0.48 | 0.49 | 0.5 | 0.49 | |
| Arg | AGA | 1.68 | 1.76 | 1.69 | 1.71 |
| AGG | 0.77 | 0.74 | 0.8 | 0.76 | |
| CGA | 1.43 | 1.42 | 1.43 | 1.4 | |
| CGC | 0.36 | 0.4 | 0.36 | 0.37 | |
| CGG | 0.47 | 0.42 | 0.47 | 0.48 | |
| CGU | 1.29 | 1.27 | 1.26 | 1.29 | |
| Ser | AGC | 0.42 | 0.42 | 0.42 | 0.44 |
| AGU | 1.17 | 1.2 | 1.16 | 1.16 | |
| UCA | 1.15 | 1.14 | 1.16 | 1.17 | |
| UCC | 0.97 | 1.03 | 0.99 | 0.97 | |
| UCG | 0.63 | 0.61 | 0.6 | 0.58 | |
| UCU | 1.66 | 1.6 | 1.66 | 1.67 | |
| Thr | ACA | 1.23 | 1.23 | 1.27 | 1.25 |
| ACC | 0.73 | 0.73 | 0.7 | 0.7 | |
| ACG | 0.51 | 0.55 | 0.52 | 0.53 | |
| ACU | 1.53 | 1.49 | 1.51 | 1.52 | |
| Val | GUA | 1.46 | 1.43 | 1.44 | 1.46 |
| GUC | 0.56 | 0.57 | 0.6 | 0.58 | |
| GUG | 0.54 | 0.55 | 0.54 | 0.53 | |
| GUU | 1.44 | 1.45 | 1.42 | 1.43 | |
| Trp | UGG | 1 | 1 | 1 | 1 |
| Tyr | UAC | 0.38 | 0.39 | 0.41 | 0.41 |
| UAU | 1.62 | 1.61 | 1.59 | 1.59 | |
FIGURE 2Repeat sequences analysis of the four Polygonum cp genomes. (A) Repeat types in the four cp genomes; (B) tandem repeats in the four cp genomes; (C) forward repeats in the four cp genomes; (D) palindromic repeats in the four cp genomes. In (A), different colors show different repeat types; in (B–D), different colors show different lengths. The ordinate represents the number of repeats.
FIGURE 3SSR analysis of four cp genomes. The ordinate represents the number of SSRs.
FIGURE 4Comparative analysis of chloroplast genome differences in six Polygonum cp genomes. Gray arrows and thick black lines above the alignment indicate gene orientation. Purple bars represent exons, blue bars represent untranslated regions (UTRs), pink bars represent non-coding sequences (CNS), and gray bars represent mRNA. The y-axis represents the percentage identity (shown: 50–100%).
FIGURE 5Nucleotide diversity (Pi) among cp genomes of six Polygonum species.
FIGURE 6Comparison of the chloroplast genome boundaries in four Polygonum cp genomes.
FIGURE 7ML phylogenetic tree reconstruction containing the cp genomes of 19 plants. Chrysanthemum x morifolium was set as the out-group.