| Literature DB >> 23638113 |
Chuan Ku1, Jer-Ming Hu, Chih-Horng Kuo.
Abstract
Ardisia is a basal asterid genus well known for its medicinal values and has the potential for development of novel phytopharmaceuticals. In this genus of nearly 500 species, many ornamental species are commonly grown worldwide and some have become invasive species that caused ecological problems. As there is no completed plastid genome (plastome) sequence in related taxa, we sequenced and characterized the plastome of Ardisia polysticta to find plastid markers of potential utility for phylogenetic analyses at low taxonomic levels. The complete A. polysticta plastome is 156,506 bp in length and has gene content and organization typical of most asterids and other angiosperms. We identified seven intergenic regions as potentially informative markers with resolution for interspecific relationships. Additionally, we characterized the diversity of asterid plastomes with respect to GC content, plastome organization, gene content, and repetitive sequences through comparative analyses. The results demonstrated that the genome organizations near the boundaries between inverted repeats (IRs) and single-copy regions (SCs) are polymorphic. The boundary organization found in Ardisia appears to be the most common type among asterids, while six other types are also found in various asterid lineages. In general, the repetitive sequences in genic regions tend to be more conserved, whereas those in noncoding regions are usually lineage-specific. Finally, we inferred the whole-plastome phylogeny with the available asterid sequences. With the improvement in taxon sampling of asterid orders and families, our result highlights the uncertainty of the position of Gentianales within euasterids I.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23638113 PMCID: PMC3640096 DOI: 10.1371/journal.pone.0062548
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Plastome map of Ardisia polysticta.
Genes drawn inside the circle are transcribed clockwise, those outside counterclockwise. The within-plastome GC content variation is indicated in the middle circle. Pseudogenes (Ψ) and genes containing one (*) and two (**) introns are indicated. Regions of potential phylogenetic utility (Table 2) are indicated by hollow triangles outside the circle (numbered from more to less divergent). Numbers and locations of repetitive sequences (Table 4) are drawn on the four inner circles (from inside: dispersed direct repeats (forward triangles), inverted repeats (forward and reversed triangles), tandem repeats (tandem triangles), palindromic sequences and a sequence that matches its reversed sequence (hexagrams)).
Base composition of the Ardisia polysticta plastome.
| G/C (%) | A (%) | T (%) | C (%) | G (%) | Length (bp) | |
| Total | 37.07 | 31.17 | 31.75 | 18.87 | 18.19 | 156,506 |
| By chromosomal region | ||||||
| LSC | 34.94 | 32.01 | 33.04 | 17.92 | 17.01 | 86,078 |
| IRb | 43.01 | 28.54 | 28.44 | 20.76 | 22.24 | 26,050 |
| SSC | 30.17 | 34.87 | 34.95 | 15.82 | 14.34 | 18,328 |
| IRa | 43.01 | 28.44 | 28.54 | 22.24 | 20.76 | 26,050 |
| By codon position | ||||||
| Position 1 | 45.33 | 30.77 | 23.90 | 18.65 | 26.68 | 26,535 |
| Position 2 | 37.77 | 29.58 | 32.65 | 19.95 | 17.81 | 26,535 |
| Position 3 | 29.40 | 32.17 | 38.44 | 13.58 | 15.82 | 26,535 |
List of the most conserved and the most divergent intergenic regions in Ardisia-Sesamum and Ardisia-Panax comparisons.
| Most conserved | Most divergent | ||||
| Region | Pairwise distance | Region | Pairwise distance | Length (bp) in | Shaw et al. (2007) |
|
| 0.000 |
| 0.556 | 790 | ✓ |
|
| 0.000 |
| 0.456 | 341 | |
|
| 0.000 |
| 0.455 | 120 | |
|
| 0.000 |
| 0.430 | 249 | |
|
| 0.010 |
| 0.419 | 531 | ✓ |
|
| 0.016 |
| 0.394 | 425 | |
|
| 0.037 |
| 0.391 | 1,771 | ✓ |
|
| 0.039 |
| 0.376 | 100 | |
|
| 0.041 |
| 0.375 | 737 | |
|
| 0.045 |
| 0.374 | 305 | |
|
| 0.046 |
| 0.364 | 503 | |
|
| 0.047 |
| 0.354 | 967 | ✓ |
|
| 0.047 |
| 0.339 | 62 | |
|
| 0.057 |
| 0.334 | 902 | ✓ |
|
| 0.058 |
| 0.327 | 393 | |
|
| 0.059 | ||||
|
| 0.073 | ||||
Among the 20 most conserved/divergent regions in Ardisia-Sesamum and Ardisia-Panax pairwise comparisons, the shared regions are listed in the order of increasing/decreasing divergence in Ardisia-Sesamum comparison.
The most divergent regions in boldface are those of potential phylogenetic utility (>500 bp).
Regions highlighted in Shaw et al. (2007) are checked.
Distribution of repetitive sequences in the Ardisia polysticta plastome.
| No. | Length (bp) | Type | Start | Repeat sequence | Region |
| 1 | 42 | D(I) | 44,689 ( | YTACAGAACCGTACRTGAGATKTTCAYCTCATACGGCTCCTC | LSC; IR; SSC |
| 2 | 36 | T | 93,339 (149,211) ( | TAGTGACGAYATTGATGCTAGTGACGAYATTGATGC | IR |
| 3 | 35 | D | 39,754 ( | TGCAATAGCTAAATGATGRTGWGCAATATCRGTCA | LSC |
| 4 | 30 | R* | 33,462 ( | ATTATAWTATATATAATATATATWATATTA | LSC |
| 5 | 30 | P* | 14,106 ( |
| LSC |
| 6 | 29 | I | 58,774 ( |
| LSC |
| 7 | 28 | DI | 8,839 ( | GGARAGAGAGGGATTCGAACCCTCGRTA | LSC |
| 8 | 27 | I | 10,462 ( |
| LSC |
| 9 | 26 | P*(D) | 89,712 (152,848) ( |
| IR |
| 10 | 26 | T | 88,248 (154,312) ( | CTTTAGGAKAAATCAATGCAATTCAG | IR |
Repeats are numbered in the order of decreasing lengths.
D: dispersed direct repeat; T: tandem repeat; I: inverted repeat; P*: palindromic sequence. R*: reversed sequence matches the original. Types indicated parenthetically are due to the inverted-repeat nature of the IR regions.
For repeats in IR regions, the repeat sequences are shown for those in IRb with the start positions in IRa indicated in parentheses.
Distribution of simple sequence repeats in the Ardisia polysticta plastome.
| Repeat unit | Length (bp) | Number of SSRs | Start position |
| A | 10 | 6 | 5,337; 33,249; 43,327; 64,939; 83,012; 116,070 |
| 11 | 3 | 4,628; 14,067; 112,074 (Ψ | |
| 12 | 3 | 22,968; 82,976; 84,629 | |
| 13 | 1 | 137,791 | |
| 14 | 1 | 67,937 | |
| 15 | 2 | 9,380; 12,440 | |
| T | 10 | 12 | 5,313; 13,270; 26633 ( |
| 11 | 8 | 3,834; 18,901 ( | |
| 12 | 3 | 23,038; 36,512; 52,482 | |
| 13 | 2 | 10,782; 104,782 | |
| 14 | 1 | 59,954 | |
| 15 | 1 | 127,000 ( | |
| C | 10 | 1 | 22,958 |
| G | 12 | 1 | 73,448 |
| AT | 10 | 1 | 20,283 ( |
| 12 | 2 | 68,297; 81,280 | |
| TA | 10 | 1 | 127,419 ( |
| AATA | 12 | 1 | 117,842 ( |
| ATAA | 12 | 1 | 37,382 |
| ATTT | 12 | 1 | 68,040 |
| TAGT | 12 | 1 | 7,987 |
| TTCT | 16 | 1 | 65,070 |
| GAAA | 12 | 1 | 122,685 |
| GAAT | 12 | 1 | 6,966 |
| TAAAT | 15 | 1 | 79,640 |
SSR-containing coding regions are indicated in parentheses.
Figure 2Comparison of boundaries between inverted repeats (IR) and single-copy (SC) regions in representative asterid plastomes.
For a list of asterids found in different types, see Table S4.
Figure 3Maximum likelihood phylogeny of 78 plastome genes from 11 families (6 orders) of asterids.
All nodes, except the one uniting Gentianales and Lamiales, received 100% bootstrap support. Gene loss events are mapped onto the tree in the most parsimonious way.
Loss and gain of plastome protein-coding genes relative to Ardisia polysticta in nonparasitic euasterids.
| Taxa | Loss (−) and gain (+) |
|
| − |
|
| − |
|
| − |
|
| − |
|
| − |
|
| None |
Parthenium argentatum (Asteraceae) is not included. For reasons, see Materials and Methods.
All losses and gains were manually verified by BLAST searches. Differences in IR duplicates are not included.
Pseudogenization evidenced by an extended 3′ end, two frameshift mutations and an accelerated evolutionary rate (McNeal et al., 2007).