| Literature DB >> 32365609 |
Swati Tyagi1, Jae-A Jung2, Jung Sun Kim1, So Youn Won1.
Abstract
Aster spathulifolius, a common ornamental and medicinal plant, is widely distributed in Korea and Japan, and is genetically classified into mainland and island types. Here, we sequenced the whole chloroplast genome of mainland A. spathulifolius and compared it with those of the island type and other Aster species. The chloroplast genome of mainland A. spathulifolius is 152,732 bp with a conserved quadripartite structure, has 37.28% guanine-cytosine (GC) content, and contains 114 non-redundant genes. Comparison of the chloroplast genomes between the two A. spathulifolius lines and the other Aster species revealed that their sequences, GC contents, gene contents and orders, and exon-intron structure were well conserved; however, differences were observed in their lengths, repeat sequences, and the contraction and expansion of the inverted repeats. The variations were mostly in the single-copy regions and non-coding regions, which, together with the detected simple sequence repeats, could be used for the development of molecular markers to distinguish between these plants. All Aster species clustered into a monophyletic group, but the chloroplast genome of mainland A. spathulifolius was more similar to the other Aster species than to that of the island A. spathulifolius. The accD and ndhF genes were detected to be under positive selection within the Aster lineage compared to other related taxa. The complete chloroplast genome of mainland A. spathulifolius presented in this study will be helpful for species identification and the analysis of the genetic diversity, evolution, and phylogenetic relationships in the Aster genus and the Asteraceae.Entities:
Keywords: Aster; Asteraceae; chloroplast genome; codon usage; intron loss; phylogenetic analysis
Year: 2020 PMID: 32365609 PMCID: PMC7285121 DOI: 10.3390/plants9050568
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Gene map of the mainland Aster spathulifolius chloroplast genome. Thick lines indicate the extent of the inverted repeat regions (IRa and IRb), which separate the genome into the small (SSC) and large (LSC) single-copy regions. Genes drawn inside the circle are transcribed clockwise, while those marked outside the circle are transcribed counterclockwise. Genes belonging to different functional groups are color coded. The dark gray in the inner circle corresponds to the GC content, while the light gray corresponds to the adenine-thymine (AT) content. Genes containing introns are marked with an asterisk. The map was drawn using OGDRAW.
Base composition and length of regions of the mainland Aster spathulifolius chloroplast genome.
| Category | T/U (%) | C (%) | A (%) | G (%) | Length (bp) |
|---|---|---|---|---|---|
| Genome | 31.53 | 18.37 | 31.19 | 18.91 | 152,732 |
| LSC | 32.61 | 17.27 | 32.23 | 17.88 | 84,552 |
| SSC | 34.87 | 14.82 | 33.81 | 16.50 | 18,268 |
| IRa | 28.62 | 20.79 | 28.32 | 22.28 | 24,956 |
| IRb | 28.32 | 22.28 | 28.62 | 20.79 | 24,956 |
| tRNA genes | 25.00 | 23.56 | 22.09 | 29.35 | 2784 |
| rRNA genes | 18.79 | 23.63 | 26.04 | 31.54 | 9048 |
| Protein-coding genes | 31.45 | 17.72 | 30.53 | 20.31 | 78,654 |
| 1st position | 23.82 | 18.87 | 30.51 | 26.80 | 26,218 |
| 2nd position | 32.58 | 20.31 | 29.30 | 17.81 | 26,218 |
| 3rd position | 37.93 | 13.98 | 31.78 | 16.31 | 26,218 |
List of genes in the mainland Aster spathulifolius chloroplast genome.
| Category | Group of Genes | Name of Genes |
|---|---|---|
| Self-replication | Large subunit of ribosomal proteins | |
| Small subunit of ribosomal proteins | ||
| DNA dependent RNA polymerase |
| |
| rRNA genes | ||
| tRNA genes | ||
| Photosynthesis | Photosystem I |
|
| Photosystem II |
| |
| NADH dehydrogenase | ||
| Cytochrome b6/f complex |
| |
| ATP synthase |
| |
| Rubisco |
| |
| Other genes | Translational initiation factor |
|
| Maturase |
| |
| Protease |
| |
| Envelop membrane protein |
| |
| Subunit acetyl-CoA carboxylase |
| |
| C-type cytochrome synthesis gene |
| |
| Unknown | Conserved open reading frame |
* Genes containing introns; ** Trans-spliced genes; (2X) indicates copy numbers.
Relative synonymous codon usage (RSCU), following the codon frequency in the protein-coding sequences of the mainland Aster spathulifolius chloroplast genome.
| Amino Acid | Codon | No | RSCU | tRNA | Amino Acid | Codon | No | RSCU | tRNA |
|---|---|---|---|---|---|---|---|---|---|
| Phe | UUU | 983 | 1.31 | Thr | ACU | 529 | 1.61 | ||
| Phe | UUC | 516 | 0.69 |
| Thr | ACC | 238 | 0.73 |
|
| Leu | UUA | 865 | 1.84 |
| Thr | ACA | 406 | 1.24 |
|
| Leu | UUG | 588 | 1.25 |
| Thr | ACG | 138 | 0.42 | |
| Leu | CUU | 607 | 1.29 | Ala | GCU | 624 | 1.74 | ||
| Leu | CUC | 194 | 0.41 | Ala | GCC | 236 | 0.66 | ||
| Leu | CUA | 379 | 0.81 |
| Ala | GCA | 408 | 1.14 |
|
| Leu | CUG | 182 | 0.39 | Ala | GCG | 166 | 0.46 | ||
| Ile | AUU | 1074 | 1.47 | Tyr | UAU | 802 | 1.64 | ||
| Ile | AUC | 427 | 0.58 |
| Tyr | UAC | 177 | 0.36 |
|
| Ile | AUA | 689 | 0.94 | STOP | UAA | 51 | 1.76 | ||
| Met | AUG | 635 | 1.00 |
| STOP | UAG | 21 | 0.72 | |
| Val | GUU | 511 | 1.46 | STOP | UGA | 15 | 0.52 | ||
| Val | GUC | 178 | 0.51 |
| His | CAU | 457 | 1.50 | |
| Val | GUA | 524 | 1.49 |
| His | CAC | 153 | 0.50 |
|
| Val | GUG | 191 | 0.54 | Gln | CAA | 727 | 1.53 |
| |
| Ser | UCU | 590 | 1.77 | Gln | CAG | 221 | 0.47 | ||
| Ser | UCC | 311 | 0.93 |
| Asn | AAU | 973 | 1.53 | |
| Ser | UCA | 403 | 1.21 |
| Asn | AAC | 298 | 0.47 |
|
| Ser | UCG | 169 | 0.51 | Lys | AAA | 1034 | 1.48 |
| |
| Ser | AGU | 401 | 1.21 | Lys | AAG | 366 | 0.52 | ||
| Ser | AGC | 122 | 0.37 |
| Asp | GAU | 851 | 1.61 | |
| Arg | AGA | 495 | 1.87 |
| Asp | GAC | 209 | 0.39 |
|
| Arg | AGG | 174 | 0.66 | Glu | GAA | 990 | 1.48 |
| |
| Arg | CGU | 353 | 1.33 |
| Glu | GAG | 352 | 0.52 | |
| Arg | CGC | 109 | 0.41 | Cys | UGU | 207 | 1.41 | ||
| Arg | CGA | 346 | 1.31 | Cys | UGC | 86 | 0.59 |
| |
| Arg | CGG | 113 | 0.43 | Trp | UGG | 462 | 1.00 |
| |
| Pro | CCU | 409 | 1.48 | Gly | GGU | 574 | 1.29 | ||
| Pro | CCC | 211 | 0.76 | Gly | GGC | 199 | 0.45 |
| |
| Pro | CCA | 316 | 1.14 |
| Gly | GGA | 684 | 1.53 |
|
| Pro | CCG | 170 | 0.61 | Gly | GGG | 329 | 0.74 |
Figure 2Predicted RNA-editing sites in the protein-coding genes in the mainland Aster spathulifolius chloroplast genome. Genes without RNA-editing sites are not plotted.
Summary of the complete chloroplast genomes of the six Aster species used in this study.
| Attributes | Mainland | Island |
|
|
|
|
|---|---|---|---|---|---|---|
| Size (bp) | 152,732 | 149,473 | 152,446 | 152,345 | 152,885 | 152,992 |
| GC content (%) | 37.2 | 37.7 | 37.3 | 37.3 | 37.0 | 37.3 |
| LSC size (bp) | 84,552 | 81,998 | 84,240 | 84,124 | 84,610 | 84,698 |
| SSC size (bp) | 18,268 | 17,973 | 18,196 | 18,243 | 18,269 | 18,250 |
| IR size (bp) | 24,956 | 24,751 | 25,005 | 24,989 | 25,003 | 25,022 |
| No. of genes | 114 | 114 | 114 | 114 | 114 | 114 |
| Protein-coding genes | 80 | 80 | 80 | 80 | 80 | 80 |
| tRNA genes | 30 | 30 | 30 | 30 | 30 | 30 |
| rRNA genes | 4 | 4 | 4 | 4 | 4 | 4 |
| Genes duplicated | 18 | 18 | 18 | 18 | 18 | 18 |
| Genes with intron | 17 | 17 | 17 | 17 | 17 | 17 |
| Pseudogene | 3 | 3 | 2 | 2 | 2 | 2 |
Comparison of intron lengths in the mainland Aster spathulifolius chloroplast genome with those of other related species.
| Genes | Location | Mainland | Island |
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
|
| LSC | 715 | 699 | 709 | 709 | 709 | 718 | |
|
| Intron1 | LSC | 812 | 800 | 812 | 811 | 814 | 812 |
| 616 | 623 | 614 | 615 | 615 | 614 | |||
|
| SSC | 1064 | 1105 | 1046 | 1077 | 1064 | 1055 | |
|
| IR | 676 | 670 | 674 | 674 | 674 | 674 | |
|
| LSC | 840 | 745 | 798 | 796 | 823 | 830 | |
|
| LSC | 767 | 724 | 760 | 785 | 809 | 745 | |
|
| LSC | 1058 | 1017 | 1087 | 1053 | 1098 | 1144 | |
|
| IR | 671 | 671 | 671 | 671 | 671 | 671 | |
|
| LSC | 742 | 721 | 743 | 742 | 742 | 743 | |
|
| 535 | 535 | 535 | 535 | 535 | 535 | ||
|
| LSC | 887 | 817 | 890 | 877 | 876 | 885 | |
|
| Intron1 | LSC | 692 | 697 | 691 | 690 | 702 | 690 |
| 744 | 739 | 739 | 739 | 739 | 739 | |||
|
| IR | 820 | 820 | 819 | 820 | 820 | 820 | |
|
| LSC | 732 | 722 | 732 | 733 | 732 | 732 | |
|
| IR | 780 | 781 | 780 | 775 | 780 | 781 | |
|
| LSC | 2542 | 2502 | 2528 | 2538 | 2539 | 2535 | |
|
| LSC | 438 | 423 | 438 | 438 | 438 | 438 | |
|
| LSC | 573 | 573 | 573 | 573 | 573 | 573 |
Figure 3Comparison of large single-copy regions (LSC), inverted repeat (IR), and small single-copy regions (SSC) junction positions among six Aster chloroplast genomes. Ψ indicates a pseudogene. Genes above the longer box are transcribed in rightward direction and genes below the longer box are transcribed in leftward direction.
Figure 4Comparison of the chloroplast genome of mainland Aster spathulifolius with those of other related species. SSC region in A. altaicus and A. tataricus was reoriented for comparison. Gray arrows above the alignment indicate the orientation of genes. Dark blue bars represent protein-coding genes, pale blue bars represent rRNA genes, and red bars represent conserved non-coding sequences. A cut-off of a 70% identity was used for the plots. The y-scale axis represents the percentage identity (50–100%). mVISTA was used to perform the comparison.
Figure 5Pairwise sequence distances of the protein-coding genes in the chloroplast genomes of mainland Aster spathulifolius and related species. The mainland A. spathulifolius chloroplast genome was compared with those of the island A. spathulifolius, A. altaicus, A. hersileoides, A. indicus, and A. tataricus. Of the 80 protein-coding genes analyzed, the 64 showing noticeable divergences are displayed here. Genes with zero divergence were omitted.
Figure 6Ka/Ks values of protein-coding genes in the Aster species. Mainland A. spathulifolius was compared with island A. spathulifolius, A. altaicus, A. hersileoides, A. indicus, and A. tataricus. Genes with zero Ka/Ks value in all comparisons were omitted.
Figure 7Analyses of the simple sequence repeats (SSRs) in the Aster chloroplast genomes. (A) The frequency of simple sequence repeats (SSRs) by unit length in six Aster chloroplast genomes. (B) The frequency of SSRs per sequence type. (C) Frequency of SSRs in exon, intron, and intergenic spacer (IGS). (D) Frequency of SSRs in large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions.
Figure 8Analyses of the long-dispersed repeats (LDRs) in the Aster chloroplast genomes. (A) Frequency of four LDR types. (B) Frequency of LDRs by length. (C) Frequency of LDRs in intergenic spacer (IGS), coding region (CDS), and intron. IGS/CDS represents repeats shared in IGS and CDS. (D) Frequency of LDRs in large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions. The frequency in (C,D) was determined for the first copy and second copy separately.
Figure 9Phylogenetic tree of the complete chloroplast genomes of various Asteraceae members, generated using the maximum likelihood method. Numbers at nodes indicate bootstrap values (1000 repeats). The scale bar represents the expected number of nucleotide substitutions per site. Lactuca sativa was used as the outgroup.
Figure 10The partial alignment of positively selected genes in the Aster species. Two genes were determined by branch-site model and Bayesian Empirical Bayes test. (A) The partial alignment of amino acid sequences of accD. (B) The partial alignment of amino acid sequences of ndhF.