| Literature DB >> 35795352 |
Changjie Chen1, Yuhuan Miao1, Dandan Luo1, Jinxin Li1, Zixin Wang1, Ming Luo1, Tingting Zhao1, Dahui Liu1.
Abstract
Artemisia argyi Levl. et Van is an important Asteraceae species with a high medicinal value. There are abundant A. argyi germplasm resources in Asia, especially in China, but the evolutionary relationships of these varieties and the systematic localization of A. argyi in the family Asteraceae are still unclear. In this study, the chloroplast (cp) genomes of 72 A. argyi varieties were systematically analyzed. The 72 varieties originated from 47 regions in China at different longitudes, latitudes and altitudes, and included both wild and cultivated varieties. The A. argyi cp genome was found to be ∼151 kb in size and to contain 114 genes, including 82 protein-coding, 28 tRNA, and 4 rRNA genes. The number of short sequence repeats (SSRs) in A. argyi cp genomes ranged from 35 to 42, and most of them were mononucleotide A/T repeats. A total of 196 polymorphic sites were detected in the cp genomes of the 72 varieties. Phylogenetic analysis demonstrated that the genetic relationship between A. argyi varieties had a weak relationship with their geographical distribution. Furthermore, inverted repeat (IR) boundaries of 10 Artemisia species were found to be significantly different. A sequence divergence analysis of Asteraceae cp genomes showed that the variable regions were mostly located in single-copy (SC) regions and that the coding regions were more conserved than the non-coding regions. A phylogenetic tree was constructed using 43 protein-coding genes common to 67 Asteraceae species. The resulting tree was consistent with the traditional classification system; Artemisia species were clustered into one group, and A. argyi was shown to be closely related to Artemisia lactiflora and Artemisia montana. In summary, this study systematically analyzed the cp genome characteristics of A. argyi and compared cp genomes of Asteraceae species. The results provide valuable information for the definitive identification of A. argyi varieties and for the understanding of the evolutionary relationships between Asteraceae species.Entities:
Keywords: Artemisia argyi; Asteraceae; chloroplast genome; phylogenetic analysis; variety
Year: 2022 PMID: 35795352 PMCID: PMC9252292 DOI: 10.3389/fpls.2022.906725
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1Collection locations for Artemisia argyi germplasm resources. The green symbols indicate collection points for Artemisia argyi varieties. There are 47 markers located in 15 domestic provinces in China. Scale bar = 20 km.
FIGURE 2Gene map of the Artemisia argyi chloroplast (cp) genome. Genes drawn inside the circle are transcribed in the clockwise direction, and those on the outside are transcribed in the counterclockwise direction. The genes are color-coded based on their function. In the inner circle, darker gray corresponds to GC content and lighter gray corresponds to AT content of the Artemisia argyi cp genome. The SSC and LSC regions are separated by two darker areas representing inverted repeat (IR) regions (IRa and IRb).
Base composition of Artemisia argyi cp genomes.
| Region | T (%) | C (%) | A (%) | G (%) | GC (%) | Length (bp) |
| LSC | 32.4 | 17.5 | 32 | 18.1 | 35.60 | 82862∼82935 |
| SSC | 34.1 | 16.1 | 35 | 14.7 | 30.80 | 18333∼18371 |
| IRa | 28.3 | 22.3 | 28.6 | 20.8 | 43.10 | 24959∼24968 |
| IRb | 28.6 | 20.8 | 28.3 | 22.3 | 43.10 | 24959∼24968 |
| Total | 31.3 | 18.7 | 31.2 | 18.8 | 37.50 | 151115∼151202 |
| CDS | 31.7∼31.8 | 17.8∼18.0 | 29.9∼30.5 | 20.0∼20.3 | 37.7∼38.3 | 71805 ∼78654 |
| CDS-1st | 24.7∼24.8 | 18.5∼18.8 | 30.3∼31.0 | 25.6∼26.5 | 44.3∼45.0 | 23935∼26218 |
| CDS-2nd | 32.4∼32.7 | 20.4∼19.9 | 28.6∼29.5 | 18.0∼18.4 | 37.9∼38.7 | 23935∼26218 |
| CDS-3rd | 37.9∼38.5 | 14.1∼15.0 | 30.6∼31.5 | 15.9∼16.5 | 30.0∼31.5 | 23935∼26218 |
Gene contents in the Artemisia argyi cp genome.
| Gene category | Gene group | Gene name | Gene number |
| Photosynthesis | Subunits of photosystem I | 7 | |
| Submits of photosystem II | 16 | ||
| Subunits of ATP synthase | 6 | ||
| Subunits of cytochrome b/f complex | 6 | ||
| Subunits of NADH dehydrogenase | 11 | ||
| Large subunit of rubisco |
| 1 | |
| Self-replication | Subunits of RNA polymerase | 4 | |
| Large ribosomal subunit | 10 | ||
| Small ribosomal subunit | 12 | ||
| Ribosomal RNAs | 4 | ||
| Transfer RNAs | * | 28 | |
| Biosynthesis | Translational initiation factor |
| 1 |
| c-type cytochrome synthesis gene |
| 1 | |
| Envelope membrane protein |
| 1 | |
| Subunit of Acetyl-CoA-carboxylase |
| 1 | |
| Protease |
| 1 | |
| Maturase |
| 1 | |
| Unknown function | Conserved open reading frames | 3 |
*Genes with two copies.
The length of exon and intron in intron-containing genes of the Artemisia argyi cp genome.
| Gene | Location | Size (bp) | ExonI (bp) | IntronI (bp) | ExonII (bp) | IntronII (bp) | ExonIII (bp) |
|
| LSC | 1254 | 145 | 699 | 410 | ||
|
| LSC | 1394 | 6 | 746 | 642 | ||
|
| LSC | 1427 | 9 | 1019 | 399 | ||
|
| LSC | 648 | 38 | 573 | 37 | ||
|
| LSC | 1808 | 69 | 799 | 291 | 610 | 39 |
|
| LSC | 2823 | 450 | 732 | 1641 | ||
|
| SSC | 2169 | 552 | 1077 | 540 | ||
|
| LSC | 505 | 37 | 418 | 50 | ||
|
| LSC | 2640 | 37 | 2568 | 35 | ||
|
| LSC | 1950 | 126 | 703 | 228 | 740 | 153 |
|
| LSC | 799 | 47 | 729 | 23 | ||
|
| LSC | 1101 | 40 | 864 | 197 | ||
|
| LSC | 1158 | 8 | 675 | 475 |
Codon usage and codon-anticodon recognition patterns of the Artemisia argyi cp genome.
| Amino acid | Codon | Number | RSCU (%) | Amino acid | Codon | Number | RSCU (%) |
| Phe (F) | UUU | 954 | 1.27 | Tyr (Y) | UAU | 765 | 1.53 |
| Phe (F) | UUC | 552 | 0.73 | Tyr (Y) | UAC | 233 | 0.47 |
| Leu (L) | UUA | 779 | 1.64 | Stop | UAA | 183 | 1.11 |
| Leu (L) | UUG | 598 | 1.26 | Stop | UAG | 158 | 0.96 |
| Leu (L) | CUU | 613 | 1.29 | His (H) | CAU | 494 | 1.44 |
| Leu (L) | CUC | 269 | 0.56 | His (H) | CAC | 192 | 0.56 |
| Leu (L) | CUA | 332 | 0.70 | Gln (Q) | CAA | 721 | 1.43 |
| Leu (L) | CUG | 267 | 0.56 | Gln (Q) | CAG | 287 | 0.57 |
| Ile (I) | AUU | 1071 | 1.40 | Asn (N) | AAU | 923 | 1.48 |
| Ile (I) | AUC | 532 | 0.70 | Asn (N) | AAC | 326 | 0.52 |
| Ile (I) | AUA | 692 | 0.90 | Lys (K) | AAA | 935 | 1.36 |
| Met (M) | AUG | 636 | 1.00 | Lys (K) | AAG | 441 | 0.64 |
| Val (V) | GUU | 559 | 1.39 | Asp (D) | GAU | 803 | 1.52 |
| Val (V) | GUC | 228 | 0.57 | Asp (D) | GAC | 257 | 0.48 |
| Val (V) | GUA | 568 | 1.41 | Glu (E) | GAA | 982 | 1.42 |
| Val (V) | GUG | 254 | 0.63 | Glu (E) | GAG | 399 | 0.58 |
| Ser (S) | UCU | 579 | 1.52 | Cys (C) | UGU | 270 | 1.27 |
| Ser (S) | UCA | 391 | 1.03 | Cys (C) | UGC | 155 | 0.73 |
| Ser (S) | UCA | 373 | 0.98 | Stop | UGA | 152 | 0.92 |
| Ser (S) | UCG | 230 | 0.61 | Trp (W) | UGG | 508 | 1 |
| Pro (P) | CCU | 467 | 1.42 | Arg (R) | CGU | 368 | 1.17 |
| Pro (P) | CCC | 253 | 0.77 | Arg (R) | CGC | 149 | 0.47 |
| Pro (P) | CCA | 367 | 1.12 | Arg (R) | CGA | 381 | 1.21 |
| Pro (P) | CCG | 228 | 0.69 | Arg (R) | CGG | 202 | 0.64 |
| Thr (T) | ACU | 541 | 1.52 | Arg (R) | AGA | 467 | 1.23 |
| Thr (T) | ACC | 313 | 0.88 | Arg (R) | AGG | 239 | 0.63 |
| Thr (T) | ACA | 391 | 1.10 | Ser (S) | AGU | 511 | 1.62 |
| Thr (T) | ACG | 181 | 0.51 | Ser (S) | AGC | 278 | 0.88 |
| Ala (A) | GCU | 660 | 1.66 | Gly (G) | GGU | 636 | 1.24 |
| Ala (A) | GCC | 273 | 0.69 | Gly (G) | GGC | 283 | 0.55 |
| Ala (A) | GCA | 434 | 1.09 | Gly (G) | GGA | 733 | 1.43 |
| Ala (A) | GCG | 225 | 0.57 | Gly (G) | GGG | 401 | 0.78 |
FIGURE 3Types and distributions of repeat sequences and short sequence repeats (SSRs) in Artemisia argyi chloroplast (cp) genomes. (A) Number and position of repeat sequences in four Artemisia argyi cp genomes (S07, S09, S29, and S71). (B) Proportion of SSRs in 72 Artemisia argyi cp genomes. (C) Distribution of repeats classified by type. (D) The number of SSR loci in different cp genome regions.
FIGURE 4Sliding window analysis of 72 complete chloroplast (cp) genomes of Artemisia argyi. The x-axis represents the midpoint of the window and the y-axis represents the nucleotide diversity (Pi) of each window. The window length is 600 bp with a 200-bp step size.
FIGURE 5Plant morphology of S07, S09, S29, and S71 (A) and three polymorphic sites in chloroplast (cp) genomes of these four varieties (B).
FIGURE 6Phylogenetic analyses of 72 Artemisia argyi varieties. The tree is constructed using the ML method based on full-length chloroplast (cp) genomes. A. annua is used as the outgroup. Branches are color-coded. Sample numbers marked with light orange shadow are wild germplasm resources. Bootstrap values above 70 are marked in the figure.
FIGURE 7Comparison of the large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regional boundaries of ten Artemisia species chloroplast (cp) genomes. JSA denotes the junction between IRa and SSC; JLA denotes the junction between IRa and LSC; JLB denotes the junction between IRb and LSC; JSB denotes the junction between IRb and SSC.
FIGURE 8Full-length chloroplast (cp) genome alignment of Campanulales species with mVISTA. Using the cp genome of Artemisia argyi as a reference, the x-axis indicates the coordinates in cp genomes and the y-axis indicates the average percent identity, ranging from 50 to 100%. Arrows indicate gene orientation; cp genome regions are color-coded as exons, untranslated regions, conserved non-coding sequences, and mRNA.
FIGURE 9Phylogenetic tree constructed using 43 protein-coding genes common to 67 different Asteraceae species. Codonopsis minima and Platycodon grandiflorus from the family Campanulaceae are used as outgroup. The numbers above the branches represent the ML bootstrap values. Different tribes are color-coded. Non-Artemisia species of Anthemideae Cass are followed by gray circles, Artemisia species are followed by red circles, and Artemisia argyi is indicated with a red star.