| Literature DB >> 28687778 |
Kyunghee Kim1, Van Binh Nguyen1, Jingzhou Dong2, Ying Wang3, Jee Young Park1, Sang-Choon Lee1, Tae-Jin Yang4,5.
Abstract
We produced complete sequences and conducted comparative analysis of the maternally inherited chloroplast (cp) genomes and bi-parentally inherited 45S nuclear ribosomal RNA genes (nrDNA) from ten Araliaceae species to elucidate the genetic diversity and evolution in that family. The cp genomes ranged from 155,993 bp to 156,730 bp with 97.1-99.6% similarity. Complete 45S nrDNA units were about 11 kb including a 5.8-kb 45S cistron. Among 79 cp protein-coding genes, 74 showed nucleotide variations among ten species, of which infA, rpl22, rps19 and ndhE genes showed the highest Ks values and atpF, atpE, ycf2 and rps15 genes showed the highest Ka/Ks values. Four genes, petN, psaJ, psbF, and psbN, related to photosynthesis and one gene, rpl23, related to the ribosomal large subunit remain conserved in all 10 Araliaceae species. Phylogenetic analysis revealed that the ten species could be resolved into two monophyletic lineages, the Panax-Aralia and the Eleutherococcus-Dendropanax groups, which diverged approximately 8.81-10.59 million years ago (MYA). The Panax genus divided into two groups, with diploid species including P. notoginseng, P. vietnamensis, and P. japonicus surviving in Southern Asia and a tetraploid group including P. ginseng and P. quinquefolius Northern Asia and North America 2.89-3.20 MYA.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28687778 PMCID: PMC5501832 DOI: 10.1038/s41598-017-05218-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Cp genomes and 45S nrDNA sequences used for comparative analysis in this study.
| Genus | Species (Abbreviated name) | WGS reads used | Length (GenBank accession no.) | |||
|---|---|---|---|---|---|---|
| Amounts (Mb) | Cp Coverage (x) | 45S nrDNA Coverage (x) | Cp | 45S nrDNA (bp) | ||
|
|
| 505 | 97 | 659 | 156,248 (KM088019) | 11,091 (KM036295)b |
|
| 1,010 | 127 | 533 | 156,088 (KM088018) | 11,169 (KM036297)c | |
|
| 2,811 | 246 | 2555 | 156,466 (KP036468) | 6,306 (KT380921)b | |
|
| 2,870 | 237 | 4295 | 156,188 (KP036469) | 6,275 (KT380920)b | |
|
| 4,586 | 1,005 | 2267 | 155,993 (KP036470) | 7,280 (KT380922)b | |
|
|
| 505 | 90 | 267 | 156,220 (KT153023) | 6,073 (KT380919)b |
|
| NA | NA | NA | 156,333 (NC_022810)a | 610 (AF273540)d | |
|
|
| 468 | 57 | 426 | 156,730 (KT153019) | 10,109 (KT380924)c |
|
| NA | NA | NA | 156,768 (NC_016430)a | 610 (AB570259.1)d | |
|
|
| 3,453 | 222 | 1124 | 156,366 (KR136270) | 9,332 (KT380923)b |
aCp sequences retrieved from GenBank. b45S nrDNA including full 45S transcription sequence (5.8 kb) and partial IGS sequence. c45S nrDNA including full 45S transcription sequence (5.8 kb) and full IGS sequence. dnrITS (ITS1-5.8S-ITS2) sequences retrieved from GenBank. NA: not available.
Figure 1Gene map and nucleotide polymorphism of cp genomes in ten Araliaceae species. (a) Colored boxes are conserved chloroplast genes classified based on product function. The complete cp genome sequence was generated by the dnaLCW method and annotated using the DOGMA program (http://dogma.ccbb.utexas.edu/). The map was prepared using OGDRAW (http://ogdraw.mpimp-golm.mpg.de/). Genes transcribed clockwise and counterclockwise are indicated on the outside and inside of the large circle, respectively. (b) The depth of polymorphisms found among cp genomes of ten Araliaceae species. Cp genome (KM088019) of P. ginseng was used as a reference for comparison.
Figure 2Summary of Ks and Ka values among cp genomes of ten species. The rates of non-synonymous substitution (Ka) and synonymous substitution (Ks) of 79 conserved protein-coding sequences were calculated and averaged using KaKs calculator. The average Ks and Ka values are indicated by grey and dark blue bars, respectively. Black stars indicate the genes evolved under positive selection pressure with over 1 for the value of Ka/Ks ratios. Red stars indicate five conserved genes without variations and black triangles indicate two conserved genes without SNP but with 21 and 6 bp InDel variations in Araliaceae family.
Ks values and estimated divergence time of 10 Araliaceae species.
| Species | Divergence time (MYA)b | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PG | PQ | PJ | PV | PN | AU | AE | Esen | ES | DM | ||
|
|
| 0.72 | 3.41 | 3.30 | 3.05 | 8.41 | 8.28 | 8.70 | 9.37 | 8.27 | |
|
| 0.0014 | 3.30 | 3.18 | 2.93 | 8.28 | 8.15 | 8.57 | 9.32 | 8.06 | ||
|
| 0.0068 | 0.0066 | 1.69 | 3.15 | 8.85 | 8.72 | 9.20 | 9.95 | 8.72 | ||
|
| 0.0066 | 0.0064 | 0.0034 | 3.01 | 8.62 | 8.53 | 8.98 | 9.74 | 8.48 | ||
|
| 0.0061 | 0.0059 | 0.0063 | 0.0060 | 8.46 | 8.26 | 8.82 | 9.58 | 8.38 | ||
|
| 0.0168 | 0.0166 | 0.0177 | 0.0172 | 0.0169 | 3.20 | 8.33 | 9.13 | 8.11 | ||
|
| 0.0166 | 0.0163 | 0.0174 | 0.0171 | 0.0165 | 0.0064 | 8.30 | 9.09 | 7.96 | ||
|
| 0.0174 | 0.0171 | 0.0184 | 0.0180 | 0.0176 | 0.0167 | 0.0166 | 2.98 | 4.10 | ||
|
| 0.0187 | 0.0186 | 0.0199 | 0.0195 | 0.0192 | 0.0183 | 0.0182 | 0.0060 | 4.85 | ||
|
| 0.0165 | 0.0161 | 0.0174 | 0.0170 | 0.0168 | 0.0162 | 0.0159 | 0.0082 | 0.0097 | ||
aAverage Ks values between common cp protein coding genes of each species calculated using KaKs calculator program. bDivergence time was estimated by Ks/2λ, where λ = 1.0 × 10−9. PG to DM indicate abbreviated species name (Table 1).
Figure 3Validation of polymorphic sites with TR CNV in cp genomes of Araliaceae species. (a) Schematic diagram of CNV of TR units found in ycf1 genes among nine species, and triangles indicate TR unit of 57 bp. (b) Validation of CNV of 57-bp TR in nine species using genomic DNA PCR analysis with the pgycf01 primer set (Supplementary Table S2). Abbreviated species names (Table 1) are shown above the lanes. M indicates 100-bp DNA ladder.
Figure 4Assembly and comparison of 45S nrDNA sequences. (a–c) Full-length 45S nrDNA sequence of P. quinquefolius assembled in this study. Average read depth was 376× (a) and average GC content was 53.15% (b). (c) Schematic diagram of 45S nrDNA structure. (d) Comparison of 45S nrDNA transcription sequences of eight Araliaceae species. P. ginseng 45S nrDNA transcription sequence (KM036295) was used as reference for sequence comparison.
Figure 5Phylogenetic tree and divergence time of 10 Araliaceae species. (a,b) Phylogenetic trees were generated based on cp protein-coding sequences (a) and nrDNA ITS sequences (ITS1-5.8S-ITS2) (b). Dashed lines connect the positions of each species in the two trees. Numbers next to nodes indicate average divergence time (*) based on Ks values (Table 2), and median divergence time using the BEAST program with 95% highest posterior density. The number above each branch refers to the Bayesian posterior probability (a) and ML bootstrap values (b).