| Literature DB >> 30740573 |
Changkun Liu1, Zhenyan Yang1, Lifang Yang1,2, Junbo Yang3, Yunheng Ji1.
Abstract
Panax stipuleanatus (Araliaceae) is an endangered and medicinally important plant endemic to China. However, phylogenetic relationships within the genus Panax have remained unclear. In this study, we sequenced the complete plastome of P. stipuleanatus and included previously reported Panax plastomes to better understand the relationships between species and plastome evolution within the genus Panax. The plastome of P. stipuleanatus is 156,069 base pairs (bp) in length, consisting of a pair of inverted repeats (IRs, each 25,887 bp) that divide the plastome into a large single copy region (LSC, 86,126 bp) and a small single copy region (SSC, 8169 bp). The plastome contains 114 unigenes (80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes). Comparative analyses indicated that the plastome gene content and order, as well as the expansion/contraction of the IR regions, are all highly conserved within Panax. No significant positive selection in the plastid protein-coding genes was observed across the eight Panax species, suggesting the Panax plastomes may have undergone a strong purifying selection. Our phylogenomic analyses resulted in a phylogeny with high resolution and supports for Panax. Nine protein-coding genes and 10 non-coding regions presented high sequence divergence, which could be useful for identifying different Panax species.Entities:
Keywords: Araliaceae; Comparative genomics; Panax stipuleanatus; Phylogenomics; Plastome
Year: 2018 PMID: 30740573 PMCID: PMC6317490 DOI: 10.1016/j.pld.2018.11.001
Source DB: PubMed Journal: Plant Divers ISSN: 2468-2659
Fig. 1The geographic distribution and morphological features of Panax stipuleanatus H. T. Tsai & K. M. Feng. A, geographic distribution; B, aerial part; C, inflorescence; D, fruit; E, rootstock.
Fig. 2Plastome map of the Panax stipuleanatus. Genes shown outside of the outer layer circle are transcribed counterclockwise, whereas those genes inside of this circle are transcribed clockwise. The colored bars indicate the known protein-coding genes, tRNA, and rRNA. The dashed, darker gray area of the inner circle denotes the GC content, while the lighter gray area indicates the AT content of the genome. LSC, large single-copy; SSC, small single-copy; IR, inverted repeat.
Comparison of plastome features among Panax species.
| Species | Total | LSC | SSC | IRs | Coding sequence | Non-coding sequence |
|---|---|---|---|---|---|---|
| Length (bp) | Length (bp) | Length (bp) | Length (bp) | Length (bp) | Length (bp) | |
| 156,333 | 86,028 | 18,089 | 26,108 | 92,417 | 63,916 | |
| 156,063 | 86,111 | 18,174 | 25,889 | 91,632 | 64,431 | |
| 156,241–156,425 | 86,106–86,200 | 16,077–18,084 | 26,018–28,018 | 92,060–92,238 | 64,116–64,259 | |
| 156,188 | 86,199 | 18,013 | 25,988 | 91,822 | 64,365 | |
| 156,324–156,466 | 86,082–86,190 | 18,004–18,554 | 25,861–26,136 | 91,866–92,134 | 64,202–64,521 | |
| 156,088–156,364 | 86,095–86,124 | 17,993–18,080 | 26,000–26,080 | 88,009–92,012 | 64,076–68,355 | |
| 156, 069 | 86,126 | 18,169 | 25,887 | 91,632 | 64,437 | |
| 156,157 | 86,322 | 18,047 | 25,894 | 91,441 | 64,716 | |
| 155,992–155,993 | 86,177–86,178 | 17,935 | 25,940 | 91,634–91,643 | 64,350–64,358 |
List of genes identified in the plastome of Panax stipuleanatus.
| Category of Genes | Group of gene | Name of gene |
|---|---|---|
| Self-replication | Ribosomal RNA genes | |
| Transfer RNA genes | ||
| Ribosomal protein (small subunit) | ||
| Ribosomal protein (large subunit) | ||
| RNA polymerase | ||
| Translational initiation factor | ||
| Genes for photosynthesis | Subunits of photosystem I | |
| Subunits of photosystem II | ||
| Subunits of cytochrome | ||
| Subunits of ATP synthase | ||
| Large subunit of Rubisco | ||
| Subunits of NADH dehydrogenase | ||
| Other genes | Maturase | |
| Envelope membrane protein | ||
| Subunit of acetyl-CoA | ||
| Synthesis gene | ||
| ATP-dependent protease | ||
| Component of TIC complex | ||
| Genes of unknown function | Conserved open reading frames |
×2: Two gene copies in IR regions; *: With one intron; **: With two introns.
Fig. 3Comparison of the borders of the LSC, SSC, and IR regions among the eight Panax plastomes. LSC, large single-copy; SSC, small single-copy; IR, inverted repeat.
Fig. 4Visualized alignment of the eight Panax plastomes. The mVISTA-based identity plots show the sequence identity among the eight Panax plastome, for which P. ginseng serves as the reference. Gray arrows indicate the position and direction of each gene. Genome regions are color-coded as protein coding, rRNA, tRNA, or conserved non-coding regions. Black lines define the regions of sequence identity shared with P. ginseng (using a 50%-identity cutoff criterion).
Summary of SNPs found in the eight representative Panax plastomes.
| Data type | Number of SNPs | Characters (bp) | Divergence proportion (%) |
|---|---|---|---|
| Complete plastome | 1130 | 156,069 | 0.7240 |
| Protein-coding genes | 498 | 79,774 | 0.6243 |
| Non-coding regions | 620 | 64,437 | 0.9622 |
| LSC region | 743 | 86,126 | 0.8627 |
| SSC region | 291 | 18,169 | 1.6016 |
| IR regions | 48 | 25,887 | 0.1854 |
Substitution rates of 80 protein-coding genes in the eight Panax plastomes.
| Taxa | Non-synonymous (Ka) | Synonymous (Ks) | Ka/Ks |
|---|---|---|---|
| 0.0032 ± 0.0006 | 0.0166 ± 0.0018 | 0.1391 | |
| 0.0032 ± 0.0005 | 0.0185 ± 0.0020 | 0.1575 | |
| 0.0034 ± 0.0005 | 0.0186 ± 0.0020 | 0.1683 | |
| 0.0032 ± 0.0006 | 0.0187 ± 0.0022 | 0.1521 | |
| 0.0032 ± 0.0005 | 0.0177 ± 0.0020 | 0.1729 | |
| 0.0034 ± 0.0006 | 0.0169 ± 0.0018 | 0.1393 | |
| 0.0034 ± 0.0005 | 0.0218 ± 0.0022 | 0.1523 | |
| 0.0034 ± 0.0006 | 0.0188 ± 0.0021 | 0.1559 |
Aralia undulata was used as an outgroup. Data are presented as the means ± standard error.
Fig. 5Non-synonymous substitution (Ka), synonymous substitution (Ks), and the Ka/Ks values for the Panax plastid protein-coding genes.
Fig. 6Phylogenetic tree reconstruction of the genus Panax via maximum likelihood (ML), based on (A) the whole plastome; (B) the protein-coding exons; (C) the large single-copy (LSC) regions; (D) the small single-copy (SSC) regions; (E) the inverted repeated (IR) regions; and (F) the introns and intergenic spacers. The numbers above the line represent the ML-bootstrap values (1000 replicates).