| Literature DB >> 29095905 |
Péter Poczai1, Jaakko Hyvönen1,2.
Abstract
Spanish moss (Tillandsia usneoides) is an epiphytic bromeliad widely distributed throughout tropical and warm temperate America. This plant is highly adapted to extreme environmental conditions. Striking features of this species include specialized trichomes (scales) covering the surface of its shoots aiding the absorption of water and nutrients directly from the atmosphere and a specific photosynthesis using crassulacean acid metabolism (CAM). Here we report the plastid genome of Spanish moss and present the comparison of genome organization and sequence evolution within Poales. The plastome of Spanish moss has a quadripartite structure consisting of a large single copy (LSC, 87,439 bp), two inverted regions (IRa and IRb, 26,803 bp) and short single copy (SSC, 18,612 bp) region. The plastid genome had 37.2% GC content and 134 genes with 88 being unique protein-coding genes and 20 of these are duplicated in the IR, similar to other reported bromeliads. Our study shows that early diverging lineages of Poales do not have high substitution rates as compared to grasses, and plastid genomes of bromeliads show structural features considered to be ancestral in graminids. These include the loss of the introns in the clpP and rpoC1 genes and the complete loss or partial degradation of accD and ycf genes in the Graminid clade. Further structural rearrangements appeared in the graminids lacking in Spanish moss, which include a 28-kb inversion between the trnG-UCC-rps14 region and 6-kb in the trnG-UCC-psbD, followed by a third <1kb inversion in the trnT sequence.Entities:
Mesh:
Year: 2017 PMID: 29095905 PMCID: PMC5667773 DOI: 10.1371/journal.pone.0187199
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Close-up microscope photo of Spanish moss (Tillandsia usneoides L.).
Peltate scales (trichomes) are displayed.
Fig 2Phylogenetic tree showing the major clades and families of Poales, simplified from APG IV [16].
Dashed lines indicate families with uncertain position. Numbers indicate the available plastid genome sequences in NCBI Organelle Genome Resources (searched on 8.11.2016). Asterisk in case of Cyperaceae indicate the presence of one available (Carex siderosticta, NC027250) but currently unpublished sequence.
Fig 3The circular representation of the plastid genome of Spanish moss (Tillandisa usneoides).
Structural organization of the gene content ring was color coded based on its functional category. The innermost circle denotes the GC content across the genome. The genes that were transcribed counter-clockwise and clockwise were at the outer and inner ring, respectively. The illustration was drawn using OGDRAW.
List of genes in the chloroplast genome of Spanish moss.
| Group of genes | Name of genes | |
|---|---|---|
| Protein synthesis and DNA-replication | Transfer RNAs | |
| Ribosomal RNAs | ||
| Ribosomal protein small subunit | ||
| Ribosomal protein large subunit | ||
| Subunits of of RNA polymerase | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| Cythochrome b/f complex | ||
| ATP synthase | ||
| NADH-dehydrogenase | ||
| Large subunit Rubisco | ||
| Miscellaneous group | Translation initiation factor IF-1 | |
| Acetyl-CoA carboxylase | ||
| Cytochrome c biogenesis | ||
| Maturase | ||
| ATP-dependent protease | ||
| Inner membrane protein | ||
| Pseudogene unknown function | Conserved hypothetical chloroplast ORF |
aGene containing two introns;
bGene containing a single intron;
cGene divided into two independent transcription units
The genes having intron in the Tillandsia usneoides plastid genome and the length of the exons and introns.
| Gene | Location | Exon1 (bp) | Intron1 (bp) | Exon2 (bp) | Intron2 (bp) | Exon3 (bp) |
|---|---|---|---|---|---|---|
| LSC | 212 | 826 | 42 | |||
| LSC | 23 | 687 | 48 | |||
| LSC | 411 | 834 | 144 | |||
| LSC | 1626 | 706 | 435 | |||
| LSC | 153 | 747 | 228 | 726 | 132 | |
| LSC | 35 | 576 | 50 | |||
| LSC | 35 | 586 | 39 | |||
| LSC | 114 | 26 | 541 | 232 | ||
| LSC | 252 | 669 | 291 | 879 | 69 | |
| LSC | 6 | 783 | 642 | |||
| LSC | 8 | 745 | 474 | |||
| LSC | 402 | 1076 | 9 | |||
| IR | 432 | 666 | 393 | |||
| IR | 756 | 700 | 777 | |||
| IR | 37 | 945 | 35 | |||
| IR | 38 | 801 | 35 | |||
| SSC | 540 | 1061 | 549 |
*rps12 is a trans-spliced gene with 5' end exon located in the LSC region and the duplicated 3'end exon located in the IR regions.
Relative synonymous codon usage (RSCU) of Tillandsia usneoides is given in parentheses following the codon frequency.
| Codon | Count | RSCU | Codon | Count | RSCU | Codon | Count | RSCU | Codon | Count | RSCU |
|---|---|---|---|---|---|---|---|---|---|---|---|
| UUU(F) | 1205 | 1.22 | UCU(S) | 581 | 1.3 | UAU(Y) | 951 | 1.36 | UGU(C) | 456 | 1.12 |
| UUC(F) | 763 | 0.78 | UCC(S) | 422 | 0.95 | UAC(Y) | 443 | 0.64 | UGC(C) | 356 | 0.88 |
| UUA(L) | 751 | 1.43 | UCA(S) | 611 | 1.37 | UAA( | 434 | 0.91 | UGA( | 579 | 1.21 |
| UUG(L) | 675 | 1.29 | UCG(S) | 335 | 0.75 | UAG( | 425 | 0.89 | UGG(W) | 611 | 1 |
| CUU(L) | 615 | 1.17 | CCU(P) | 340 | 1.23 | CAU(H) | 529 | 1.37 | CGU(R) | 263 | 0.72 |
| CUC(L) | 337 | 0.64 | CCC(P) | 221 | 0.8 | CAC(H) | 243 | 0.63 | CGC(R) | 135 | 0.37 |
| CUA(L) | 479 | 0.91 | CCA(P) | 368 | 1.33 | CAA(Q) | 652 | 1.37 | CGA(R) | 356 | 0.97 |
| CUG(L) | 285 | 0.54 | CCG(P) | 180 | 0.65 | CAG(Q) | 302 | 0.63 | CGG(R) | 241 | 0.66 |
| AUU(I) | 1100 | 1.26 | ACU(T) | 385 | 1.15 | AAU(N) | 1052 | 1.33 | AGU(S) | 420 | 0.94 |
| AUC(I) | 704 | 0.81 | ACC(T) | 293 | 0.87 | AAC(N) | 527 | 0.67 | AGC(S) | 303 | 0.68 |
| AUA(I) | 805 | 0.93 | ACA(T) | 457 | 1.36 | AAA(K) | 1243 | 1.34 | AGA(R) | 752 | 2.05 |
| AUG(M) | 683 | 1 | ACG(T) | 209 | 0.62 | AAG(K) | 608 | 0.66 | AGG(R) | 449 | 1.23 |
| GUU(V) | 531 | 1.43 | GCU(A) | 252 | 1.26 | GAU(D) | 714 | 1.49 | GGU(G) | 388 | 1 |
| GUC(V) | 229 | 0.62 | GCC(A) | 160 | 0.8 | GAC(D) | 247 | 0.51 | GGC(G) | 188 | 0.48 |
| GUA(V) | 466 | 1.25 | GCA(A) | 276 | 1.39 | GAA(E) | 878 | 1.37 | GGA(G) | 598 | 1.54 |
| GUG(V) | 263 | 0.71 | GCG(A) | 109 | 0.55 | GAG(E) | 406 | 0.63 | GGG(G) | 382 | 0.98 |
*translation termination (stop codon)
Repeat sequences of Tillandsia usenoides chloroplast genome.
| No | Type | Location | Region | Repeat unit | Period size (bp) | Copy number | |
|---|---|---|---|---|---|---|---|
| 1 | T | intron | LSC | 17 | 4 | ||
| 2 | T | IGS | LSC | 20 | 2 | ||
| 3 | T | intron | LSC | 23 | 2 | ||
| 4 | T | IGS | LSC | 63 | 2 | ||
| 5 | D | IGS | LSC | 21 | 2 | ||
| 6 | D | IGS | LSC | 42 | 2 | ||
| 7 | D | IGS | LSC | 42 | 2 | ||
| 8 | C | IGS | LSC | 19 | 2 | ||
| 9 | C | IGS/intron | LSC | 19 | 2 | ||
| 10 | R | CDS | LSC | 10 | 2 | ||
| 11 | P | IGS | LSC | 11 | 2 | ||
| 12 | P | IGS | LSC | 13 | 2 | ||
| 13 | P | CDS/IGS | LSC | 18 | 2 | ||
| 14 | P | IGS | LSC | 18 | 2 | ||
| 15 | P | IGS | LSC | 24 | 2 | ||
| 16 | T | IGS | SSC | 22 | 2 | ||
| 17 | R | IGS | SSC | 14 | 2 | ||
| 18 | P | IGS | SSC | 12 | 2 | ||
| 19 | P | IGS | SSC | 13 | 2 | ||
| 20 | P | CDS | SSC | 14 | 2 | ||
| 21 | P | CDS | SSC | 17 | 2 | ||
| 22 | T | CDS | IRa/IRb | 28 | 2 | ||
| 23 | D/P | IGS | IRa/IRb | 17 | 2 | ||
| 24 | P/D | IGS | IRa/IRb | 18 | 2 | ||
| 25 | D/P | intron/IGS | LSC/IRA/IRB | 24 | 3 |
Abbreviations T = Tandem, D = direct, P = Palindromic, R = Reversed, C = Complementary, IGS = intergenic spacer, CDS = Coding Sequence, LSC = long single copy region, SSC = small single copy region, IR = inverted repeat
Estimates of average evolutionary divergence over 80 protein coding-gene sequences from Poales.
Standard error estimate(s) are shown in the fourth column and were obtained by a bootstrap procedure (1000 replicates). The ten most divergent genes are shown in part A, while the ten most conserved genes are listed in part B.
| Functional Group | Gene | Location | d | S.E. | |
|---|---|---|---|---|---|
| Acetyl-CoA carboxylase | LSC | 0.297 | 0.065 | ||
| ATP-dependent protease | LSC | 0.117 | 0.008 | ||
| Conserved hypothetical chloroplast ORF | SSC | 0.130 | 0.009 | ||
| IR | 0.204 | 0.068 | |||
| Maturase | LSC | 0.119 | 0.005 | ||
| NADH-dehydrogenase | SSC | 0.104 | 0.004 | ||
| Ribosomal protein small subunit | LSC | 0.105 | 0.011 | ||
| Ribosomal protein large subunit | LSC | 0.101 | 0.010 | ||
| IR | 0.119 | 0.015 | |||
| Translation initiation factor IF-1 | LSC | 0.115 | 0.016 | ||
| Cytochrome b/f complex | LSC | 0.033 | 0.010 | ||
| LSC | 0.036 | 0.010 | |||
| NADH-dehydrogenase | IR | 0.014 | 0.001 | ||
| Photosystem II | LSC | 0.038 | 0.003 | ||
| LSC | 0.028 | 0.005 | |||
| LSC | 0.014 | 0.005 | |||
| LSC | 0.029 | 0.008 | |||
| Ribosomal protein large subunit | IR | 0.024 | 0.003 | ||
| Ribosomal protein small subunit | IR | 0.028 | 0.004 | ||
| IR | 0.018 | 0.002 |
Comparison of major features of Tillandsia usneoides and nine commelinid plastid genomes.
| GenBank Nr. | KY293680 | NC026220 | NC013823 | NC020363 | NC022926 | NC002762 | NC023533 | NC026967 | NC026957 | NC001666 |
| Size (bp) | 159 657 | 159 636 | 161 572 | 155 890 | 161 347 | 134 545 | 135 362 | 136 354 | 142 772 | 140 384 |
| LSC length (bp) | 87 489 | 87 482 | 89 140 | 85 650 | 88 016 | 80 348 | 79 881 | 82 009 | 79 972 | 82 352 |
| SSC length (bp) | 18 612 | 18 622 | 19 652 | 18 444 | 18 989 | 12 791 | 12 519 | 12 587 | 12 868 | 12 536 |
| IR length (bp) | 26 803 | 26 766 | 26 390 | 25 898 | 27 171 | 20 703 | 21 481 | 20 879 | 24 966 | 22 748 |
| Total Nr of genes | 134 | 141 | 131 | 132 | 133 | 128 | 114 | 129 | 128 | 128 |
| Nr of genes duplicated in the IR | 20 | 24 | 18 | 21 | 20 | 18 | 12 | 18 | 19 | 18 |
| Nr genes with introns | 18 | 18 | 18 | 21 | 20 | 16 | 14 | 16 | 18 | 16 |
| % GC content | 37.2 | 37.4 | 33.8 | 36.3 | 35.9 | 37.2 | 38.3 | 39 | 37.2 | 37.4 |
Fig 4Multipip analysis showing overall sequence similarity of plastid genomes based on complete genome alignment.
Levels of sequence similarity are indicated by red (75–100%), green (50–75%) and white (<50%). Comparison was made for ten plastid genomes using Tillandsia usneoides as the reference genome. Arrows indicate gene/intron losses and deletions; partial duplication of the ycf1 is due to IR expansion.
Fig 5Percent identity plots.
A. Tillandsia usneoides compared to Triticum aestivum, showing the three major inversions specific for grasses. B. Ananas comosus compared to T. usneoides.
Fig 6Comparison of plastid borders of LSC, SSC and IR regions among selected species of Poales.
Selected genes or portions of genes are indicated by colored boxes. Gene and region lengths are not to scale (see Table 6).
Fig 7Maximum likelihood (ML) tree of 80 protein coding genes of Poales.
Tree of the parsimony analysis was congruent with the ML tree displayed. Topological differences are indicated by dashed lines restricted to the sister grouping of Typhaceae resolved only with parsimony. Bootstrap values less than 100 are shown at nodes for parsimony/ML. Genomic changes are indicated by black bars.