| Literature DB >> 24205264 |
Rong Li1, Peng-Fei Ma, Jun Wen, Ting-Shuang Yi.
Abstract
BACKGROUND: The ginseng family (Araliaceae) includes a number of economically important plant species. Previously phylogenetic studies circumscribed three major clades within the core ginseng plant family, yet the internal relationships of each major group have been poorly resolved perhaps due to rapid radiation of these lineages. Recent studies have shown that phyogenomics based on chloroplast genomes provides a viable way to resolve complex relationships. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2013 PMID: 24205264 PMCID: PMC3799623 DOI: 10.1371/journal.pone.0078568
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Taxa included in phylogenomic analyses of the Araliaceae.
|
|
|
|
|
|---|---|---|---|
| The Asian Palmate group | |||
|
| KC456164 | Current study | |
|
| JN637765 | Yi et al., 2012 [29] | |
|
| KC456167 | Current study | |
|
| KC456165 | Current study | |
|
| KC456166 | Current study | |
| The | |||
|
| KC456163 | Current study | |
|
| AY582139 | Kim and Lee, 2004 [28] | |
| Outgroup | |||
|
| DQ898156 | Ruhlman et al., 2006 [42] |
Summary of the chloroplast genome features.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Size (bp) | 156,333 | 156,459 | 156,413 | 156,343 | 156,341 |
| LSC length (bp) | 86,028 | 86,566 | 86,466 | 86,360 | 86,122 |
| SSC length (bp) | 18,089 | 18,021 | 18,119 | 18,131 | 19,117 |
| IR length (bp) | 26,108 | 25,936 | 25,914 | 25,926 | 25,551 |
| Number of genes | 114 | 114 | 114 | 114 | 114 |
| Protein-coding genes | 80 | 80 | 80 | 80 | 80 |
| Structure RNAs | 34 | 34 | 34 | 34 | 34 |
| AT content (%) | 61.9 | 62 | 62 | 62 | 62.2 |
| Coding regions (%) | 50.3 | 50.1 | 50.1 | 50.2 | 50.3 |
Genes contained in chloroplast genomes (114 genes in total).
|
|
|
|
|---|---|---|
| Self replication | Ribosomal RNA genes |
|
| Transfer RNA genes |
| |
| Small subunit of ribosome |
| |
| Large subunit of ribosome |
| |
| RNA polymerase subunits |
| |
| Photosynthesis | NADH dehydrogenase |
|
| Photosystem I |
| |
| Photosystem II |
| |
| Cytochrome b/f complex |
| |
| ATP synthase |
| |
| Large subunit of rubisco |
| |
| Other genes | Translational initiation factor |
|
| Maturase |
| |
| Protease |
| |
| Envelope membrane protein |
| |
| Subunit of acetyl-CoA-carboxylase |
| |
| c-type cytochrome synthesis gene |
| |
| Unknown function | Conserved open reading frames |
|
Note: one and two asterisks indicate one- and two-intron containing genes, respectively. Genes located in the IR region are indicated by (×2) after the gene name.
Figure 1Gene map of the five Araliaceae chloroplast genomes.
Genes on the outside of the outer circle are transcribed in the clockwise direction and genes on the inside of the outer circle are transcribed in the counterclockwise direction. Functional categories of genes are color-coded. Dashed area in the inner circle indicates the GC content of the chloroplast genome.
Figure 2A percent identity plot showing the overall sequence similarity of the seven Araliaceae chloroplast genomes.
Pair-wise comparison of chloroplast genomes within the Araliaceae using the mVISTA program with Panax ginseng as reference. The Y scale represents % identity ranging from 50–100%. Thick black lines show the inverted repeats (IRs) regions in the chloroplast genome. Genome regions are color-coded as protein coding, tRNA or rRNA genes, and conserved noncoding sequences.
Figure 3Comparison of border positions of LSC, SSC, and IR regions among seven Araliaceae chloroplast genomes.
Boxes above the main line indicate the genes while the pseudogenes at the borders are shown by Ψ. The figure is not the scale and just shows relative changes at or near the IR/SC borders.
Figure 4Repeat structure analysis in the five Araliaceae chloroplast genomes.
A: Histogram showing the frequency of repeats by length in the five Araliaceae chloroplast genomes; B: Composition of the 140 repeats from five Araliaceae species; C: Histogram showing the number of three repeat types in each Araliaceae chloroplast genome; D: Location of the 140 repeats from five Araliaceae species.
Statistics of seven data partitions used in phylogenomic analyses.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| Maximum parsimony | Aligned length (bp) | 164,615 | 90,637 | 19,009 | 27,467 | 69,683 | 44,631 | 15,851 |
| Variable sites (%) | 12,360 (7.51%) | 8,294 (9.15%) | 2,533 (13.33%) | 769 (2.80%) | 4,376 (6.28%) | 5,767 (12.92%) | 1,373 (8.66%) | |
| Informative sites (%) | 1,164 (0.71%) | 776 (0.86%) | 309 (1.63%) | 39 (0.14%) | 464 (0.67%) | 497 (1.11%) | 156 (0.98%) | |
| Tree length | 13,503 | 9,027 | 2,889 | 791 | 4,801 | 6,292 | 1,528 | |
| Consistency index (CI) | 0.9511 | 0.9523 | 0.9339 | 0.9823 | 0.9488 | 0.9541 | 0.9359 | |
| Retention index (RI) | 0.5728 | 0.5864 | 0.5201 | 0.7308 | 0.6019 | 0.5533 | 0.5442 | |
| Maximum likelihood | -lnL | 299568.231123 | 171416.691615 | 39791.133899 | 42835.819761 | 122770.698809 | 90158.672476 | 29847.663050 |
| Bayesian inference | Model selected by AIC | TVM+I+G | GTR+I+G | TVM+I+G | TIM+I | TVM+I+G | TVM+G | TIM+I+G |
Figure 5Phylogenetic relationships of Araliaceae based on seven different data partitions in chloroplast genomes.
A: Complete chloroplast DNA sequences; B: The large single-copy (LSC) region; C: The small single-copy (SSC) region; D: The inverted repeat (IR) region; E: Eighty common protein coding genes; F: The intergenic spacers region; G: The introns region. Support values are shown for nodes as maximum parsimony bootstrap (PB)/maximum likelihood bootstrap (LB)/Bayesian inference posterior probability (PP). Dash shows that the PB value lower than 50% or the LB value lower than 70% or the PP value lower than 0.95. Branch lengths were calculated through Bayesian analysis, and scale bar denotes substitutions per site.
Figure 6Phylogeny reconstructed using the indels from exon of coding regions in seven Araliaceae species.
The numbers above branches indicate parsimony bootstrap values (PB) for maximum parsimony analysis.
Figure 7Percentage of variable characters in homologous regions between the chloroplast genomes of the Asian Palmate group and the Panax-Aralia group.
A: Coding region; B: Noncoding region. The homologous regions are oriented according to their locations in the chloroplast genome.