| Literature DB >> 30248930 |
Xiaofeng Shen1, Shuai Guo2, Yu Yin3, Jingjing Zhang4, Xianmei Yin5, Conglian Liang6, Zhangwei Wang7, Bingfeng Huang8, Yanhong Liu9, Shuiming Xiao10,11, Guangwei Zhu12,13.
Abstract
We sequenced and analyzed the complete chloroplast genome of Aster tataricus (family Asteraceae), a Chinese herb used medicinally to relieve coughs and reduce sputum. The A. tataricus chloroplast genome was 152,992 bp in size, and harbored a pair of inverted repeat regions (IRa and IRb, each 24,850 bp) divided into a large single-copy (LSC, 84,698 bp) and a small single-copy (SSC, 18,250 bp) region. Our annotation revealed that the A. tataricus chloroplast genome contained 115 genes, including 81 protein-coding genes, 4 ribosomal RNA genes, and 30 transfer RNA genes. In addition, 70 simple sequence repeats (SSRs) were detected in the A. tataricus chloroplast genome, including mononucleotides (36), dinucleotides (1), trinucleotides (23), tetranucleotides (1), pentanucleotides (8), and hexanucleotides (1). Comparative chloroplast genome analysis of three Aster species indicated that a higher similarity was preserved in the IR regions than in the LSC and SSC regions, and that the differences in the degree of preservation were slighter between A. tataricus and A. altaicus than between A. tataricus and A. spathulifolius. Phylogenetic analysis revealed that A. tataricus was more closely related to A. altaicus than to A. spathulifolius. Our findings offer valuable information for future research on Aster species identification and selective breeding.Entities:
Keywords: Aster tataricus; chloroplast genome; phylogeny
Mesh:
Year: 2018 PMID: 30248930 PMCID: PMC6222381 DOI: 10.3390/molecules23102426
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Summary of complete chloroplast genomes for three Aster species.
| Species |
|
|
|
|---|---|---|---|
|
| |||
| Length (bp) | 84,240 | 81,998 | 84,698 |
| G + C (%) | 35.3 | 31.4 | 35.2 |
| Length (%) | 55.3 | 54.9 | 55.4 |
|
| |||
| Length (bp) | 18,196 | 17,973 | 18,250 |
| G + C (%) | 31.3 | 35.8 | 31.3 |
| Length (%) | 11.9 | 12.0 | 11.9 |
|
| |||
| Length (bp) | 25,005 | 24,751 | 25,022 |
| G + C (%) | 43.0 | 43.2 | 43.0 |
| Length (%) | 16.4 | 16.6 | 16.4 |
|
| |||
| Length (bp) | 152,446 | 149,473 | 152,992 |
| G + C (%) | 37.3 | 37.7 | 37.3 |
Genes in the A. tataricus chloroplast genome.
| Category | Gene Group | Gene Names |
|---|---|---|
| Self-replication | Large subunit of ribosomal proteins | |
| Small subunit of ribosomal proteins | ||
| DNA-dependent RNA polymerase | ||
| rRNA genes | ||
| tRNA genes | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| NADH oxidoreductase | ||
| Cytochrome b6/f complex | ||
| ATP synthase | ||
| Rubisco |
| |
| Other genes | Maturase |
|
| Protease |
| |
| Envelope membrane protein |
| |
| Subunit acetyl-CoA-carboxylase |
| |
| c-Type cytochrome synthesis gene |
| |
| Conserved open reading frames |
** Genes containing introns. a Duplicated gene (genes present in the inverted repeat (IR) regions).
Figure 1Gene map of the A. tataricus chloroplast genome. Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Genes belonging to different functional groups are color-coded. The darker gray in the inner circle corresponds to DNA G + C content, while the lighter gray corresponds to A + T content.
Codon–anticodon recognition patterns and codon usage for the A. tataricus chloroplast genome.
| Amino Acid | Codon | No. | RSCU * | tRNA | Amino Acid | Codon | No. | RSCU * | tRNA |
|---|---|---|---|---|---|---|---|---|---|
| Phe | UUU | 1064 | 1.34 | Tyr | UAU | 793 | 1.39 | ||
| Phe | UUC | 528 | 0.66 |
| Tyr | UAC | 348 | 0.61 |
|
| Leu | UUA | 554 | 1.55 |
| Stop | UAA | 508 | 1.09 | |
| Leu | UUG | 492 | 1.37 |
| Stop | UAG | 368 | 0.79 | |
| Leu | CUU | 459 | 1.28 | His | CAU | 374 | 1.36 | ||
| Leu | CUC | 205 | 0.57 | His | CAC | 175 | 0.64 |
| |
| Leu | CUA | 272 | 0.76 |
| Gln | CAA | 482 | 1.4 |
|
| Leu | CUG | 169 | 0.47 | Gln | CAG | 206 | 0.6 | ||
| Ile | AUU | 798 | 1.46 | Asn | AAU | 792 | 1.39 | ||
| Ile | AUC | 397 | 0.73 |
| Asn | AAC | 347 | 0.61 |
|
| Ile | AUA | 441 | 0.81 |
| Lys | AAA | 914 | 1.42 |
|
| Met | AUG | 415 | 1 |
| Lys | AAG | 376 | 0.58 | |
| Val | GUU | 364 | 1.45 | Asp | GAU | 471 | 1.43 | ||
| Val | GUC | 176 | 0.7 |
| Asp | GAC | 187 | 0.57 |
|
| Val | GUA | 318 | 1.26 |
| Glu | GAA | 550 | 1.39 |
|
| Val | GUG | 148 | 0.59 | Glu | GAG | 241 | 0.61 | ||
| Ser | UCU | 509 | 1.37 | Cys | UGU | 345 | 1.15 | ||
| Ser | UCC | 329 | 0.89 |
| Cys | UGC | 253 | 0.85 |
|
| Ser | UCA | 493 | 1.33 |
| Stop | UGA | 524 | 1.12 | |
| Ser | UCG | 292 | 0.79 | Trp | UGG | 491 | 1 |
| |
| Pro | CCU | 259 | 1.29 | Arg | CGU | 204 | 0.73 |
| |
| Pro | CCC | 156 | 0.78 |
| Arg | CGC | 108 | 0.39 | |
| Pro | CCA | 224 | 1.12 |
| Arg | CGA | 282 | 1.01 | |
| Pro | CCG | 164 | 0.82 | Arg | CGG | 173 | 0.62 | ||
| Thr | ACU | 321 | 1.22 | Arg | AGA | 329 | 0.89 |
| |
| Thr | ACC | 246 | 0.93 |
| Arg | AGG | 277 | 0.75 | |
| Thr | ACA | 314 | 1.19 |
| Ser | AGU | 578 | 2.06 | |
| Thr | ACG | 174 | 0.66 | Ser | AGC | 337 | 1.2 |
| |
| Ala | GCU | 250 | 1.25 | Gly | GGU | 315 | 0.95 | ||
| Ala | GCC | 169 | 0.85 | Gly | GGC | 205 | 0.62 |
| |
| Ala | GCA | 242 | 1.21 |
| Gly | GGA | 466 | 1.4 |
|
| Ala | GCG | 138 | 0.69 | Gly | GGG | 342 | 1.03 |
* RSCU: relative synonymous codon usage.
A comparison of exon and intron length in genes with introns in the A. tataricus and A. spathulifolius chloroplast genomes.
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|
|
| LSC | 37 | 2497 | 38 | ||
| 37 | 2502 | 35 | ||||
|
| LSC | 23 | 732 | 48 | ||
| 23 | 723 | 47 | ||||
|
| LSC | 34 | 441 | 50 | ||
| 37 | 423 | 50 | ||||
|
| LSC | 36 | 575 | 37 | ||
| 38 | 573 | 37 | ||||
|
| IR | 38 | 781 | 35 | ||
| 43 | 776 | 35 | ||||
|
| IR | 38 | 820 | 35 | ||
| 38 | 820 | 35 | ||||
| LSC | 234 | 535 | 25 | —— | 114 | |
| 114 | —— | 243 | —— | 243 | ||
|
| LSC | 234 | 820 | 40 | ||
| 39 | 826 | 216 | ||||
|
| LSC | 402 | 1008 | 10 | ||
| —— | —— | —— | ||||
|
| IR | 391 | 671 | 434 | ||
| 393 | 668 | 435 | ||||
|
| LSC | 431 | 709 | 1639 | ||
| 429 | 721 | 1641 | ||||
|
| SSC | 552 | 1055 | 540 | ||
| 553 | 1105 | 540 | ||||
|
| IR | 777 | 674 | 756 | ||
| 777 | 670 | 756 | ||||
|
| LSC | 124 | 690 | 228 | 739 | 155 |
| 124 | 697 | 230 | 739 | 153 | ||
|
| LSC | 6 | 754 | 658 | ||
| 6 | 745 | 642 | ||||
|
| LSC | 144 | 718 | 411 | ||
| 145 | 699 | 410 | ||||
|
| LSC | 71 | 812 | 291 | 614 | 229 |
| 71 | 800 | 291 | 623 | 229 | ||
|
| LSC | 9 | 645 | 526 | ||
| 9 | 724 | 474 |
Exon and intron lengths in genes with introns in the A. tataricus chloroplast genome (gray background), and in the A. spathulifolius chloroplast genome (normal background) * The rps12 gene is a trans-spliced gene with the 5′ end located in the LSC region and the duplicated 3′ ends located in the IR regions.
Figure 2Analysis of simple sequence repeats (SSRs) in the three Aster chloroplast genomes.
Figure 3Comparison of three chloroplast genomes using mVISTA. Gray arrows and thick black lines above the alignment indicate gene orientation. Purple bars represent exons, blue bars represent untranslated regions (UTRs), pink bars represent conserved non-coding sequences (CNS), and gray bars represent mRNA. The y-axis represents the percentage identity (shown: 50–100%).
Figure 4Comparison of border distance between adjacent genes and junctions of the LSC, SSC, and two IR regions among the chloroplast genomes of three Aster species. Boxes below the main line indicate the adjacent border genes. The figure is not to scale with respect to sequence length and only shows relative changes at or near the IR/SC borders.
Figure 5Maximum likelihood (ML) phylogenetic tree reconstruction including 17 species based on concatenated sequences from all chloroplast genomes. The position of A. tataricus is indicated in red text. Paeonia ostii was used as the outgroup.