| Literature DB >> 29494552 |
Mira Park1,2, Hyun Park3,4, Hyoungseok Lee5,6, Byeong-Ha Lee7, Jungeun Lee8,9.
Abstract
Organellar genomes of bryophytes are poorly represented with chloroplast genomes of only four mosses, four liverworts and two hornworts having been sequenced and annotated. Moreover, while Antarctic vegetation is dominated by the bryophytes, there are few reports on the plastid genomes for the Antarctic bryophytes. Sanionia uncinata (Hedw.) Loeske is one of the most dominant moss species in the maritime Antarctic. It has been researched as an important marker for ecological studies and as an extremophile plant for studies on stress tolerance. Here, we report the complete plastome sequence of S. uncinata, which can be exploited in comparative studies to identify the lineage-specific divergence across different species. The complete plastome of S. uncinata is 124,374 bp in length with a typical quadripartite structure of 114 unique genes including 82 unique protein-coding genes, 37 tRNA genes and four rRNA genes. However, two genes encoding the α subunit of RNA polymerase (rpoA) and encoding the cytochrome b6/f complex subunit VIII (petN) were absent. We could identify nuclear genes homologous to those genes, which suggests that rpoA and petN might have been relocated from the chloroplast genome to the nuclear genome.Entities:
Keywords: Antarctic bryophyte; Sanionia uncinata; chloroplast genome; moss; plastome
Mesh:
Year: 2018 PMID: 29494552 PMCID: PMC5877570 DOI: 10.3390/ijms19030709
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Map of the Sanionia uncinata plastome. Complete plastome sequences were obtained from the de novo assembly of Illumina paired-end reads. Genes are color coded by functional group, which are located in the left box. The inner darker gray circle indicates the GC content while the lighter gray corresponds to AT content. IR, inverted repeat; LSC, large single copy region; SSC, small single copy region. Genes shown on the outside of the outer circle are transcribed clockwise and those on the inside counter clockwise. The map was made with OGDraw [31].
Genes present in the S. uncinata plastome.
| Gene Products | Genes |
|---|---|
| Photosystem I | |
| Photosystem II | |
| Cytochrome b6/f | |
| ATP synthase | |
| Translation factor | |
| Chlorophyll biosynthesis | |
| Rubisco | |
| NADH oxidoreductase | |
| Large subunit ribosomal proteins | |
| Small subunit ribosomal proteins | |
| RNAP | |
| Other proteins | |
| Proteins of unknown function | |
| Ribosomal | |
| Transfer RNAs |
Gene containing a single intron; Gene divided into two independent transcription units; Gene containing two introns; Two gene copies in the IRs.
The genes with introns in the S. uncinata plastome and the length of the exons and introns.
| Gene | Location | Length (bp) | ||||
|---|---|---|---|---|---|---|
| Exon I | Intron I | Exon II | Intron II | Exon III | ||
| LSC | 114 | - | 270 | |||
| LSC | 729 | 629 | 780 | |||
| LSC | 106 | 591 | 320 | |||
| LSC | 423 | 789 | 1614 | |||
| LSC | 411 | 654 | 135 | |||
| LSC | 126 | 684 | 228 | 739 | 153 | |
| LSC | 69 | 687 | 291 | 483 | 234 | |
| LSC | 396 | 637 | 438 | |||
| SSC | 556 | 731 | 551 | |||
| LSC | 37 | 2272 | 42 | |||
| LSC | 38 | 262 | 50 | |||
| LSC | 37 | 542 | 37 | |||
| IR | 42 | 769 | 35 | |||
| IR | 38 | 763 | 35 | |||
The codon-anticodon recognition pattern and codon usage for S. uncinata plastome.
| Amino Acid | Codon | No. * | tRNA | Amino Acid | Codon | No. * | tRNA |
|---|---|---|---|---|---|---|---|
| Phe | UUU | 2862 | Tyr | UAU | 1492 | ||
| Phe | UUC | 916 | Tyr | UAC | 578 | ||
| Leu | UUG | 688 | Stop | UAA | 1772 | ||
| Leu | UUA | 1766 | Stop | UAG | 577 | ||
| Leu | CUG | 252 | His | CAU | 550 | ||
| Leu | CUA | 591 | His | CAC | 224 | ||
| Leu | CUU | 741 | Gln | CAA | 685 | ||
| Leu | CUC | 305 | Gln | CAG | 273 | ||
| Ile | AUG | 1582 | Asn | AAU | 1878 | ||
| Ile | AUU | 1943 | Asn | AAC | 665 | ||
| Ile | AUC | 629 | Lys | AAA | 2942 | ||
| Met | AUG | 515 | Lys | AAG | 768 | ||
| Val | GUG | 244 | Asp | GAU | 619 | ||
| Val | GUA | 588 | Asp | GAC | 210 | ||
| Val | GUU | 652 | Glu | GAA | 845 | ||
| Val | GUC | 237 | Glu | GAG | 286 | ||
| Ser | AGU | 579 | Cys | UGU | 515 | ||
| Ser | AGC | 470 | Cys | UGC | 362 | ||
| Ser | UCG | 272 | Stop | UGA | 657 | ||
| Ser | UCA | 643 | Trp | UGG | 455 | ||
| Pro | CCG | 169 | Arg | AGG | 387 | ||
| Pro | CCA | 429 | Arg | AGA | 732 | ||
| Pro | CCU | 423 | Arg | CGG | 164 | ||
| Pro | CCC | 230 | Arg | CGA | 322 | ||
| Thr | ACG | 228 | Arg | CGU | 270 | ||
| Thr | ACA | 508 | Arg | CGC | 143 | ||
| Thr | ACU | 561 | Ser | UCU | 735 | ||
| Thr | ACC | 363 | Ser | UCC | 419 | ||
| Ala | GCG | 155 | Gly | GGG | 426 | ||
| Ala | GCA | 367 | Gly | GGA | 465 | ||
| Ala | GCU | 413 | Gly | GGU | 653 | ||
| Ala | GCC | 514 | Gly | GGC | 237 |
* Numerals indicate the frequency of usage of each codon in 40,330 in codons in 82 potential protein-coding genes.
Figure 2Alignment of complete plastome sequences from six species. Alignment and comparison were performed using mVISTA and the percentage of identity between the plastomes was visualized in the form of an mVISTA plot. The sequence similarity of the aligned regions between S. uncinata and other five species is shown as horizontal bars indicating average percent identity between 50–100% (shown on the y-axis of graph). The x-axis represents the coordinate in the plastome. Genome regions are color-coded for protein-coding (exon), rRNA, tRNA and conserved non-coding sequences (CNS) as the guide at the bottom-left.
Figure 3Phylogenetic tree reconstruction of 23 taxa using MEGA7 based on concatenated sequences of 40 protein-coding genes in the plastome. Maximum likelihood (ML) topology is shown with the bootstrap support values (MP/ML) given at nodes. Forty protein-coding sequences were extracted from annotated plastomes found in GenBank [21] (http://www.ncbi.nlm.nih.gov) (Table S2). The nucleotide sequences for each gene were translated into amino acids, aligned in MEGA7 and manually adjusted. Nucleotide sequences were aligned by constraining them to the amino acid sequence alignment. Individual gene alignments were then assembled into a single dataset.
Gene contents of plastomes from green alga, bryophytes and land plants.
| Plants * | Genome Size (bp) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Alga | 150,613 | - | + | + | + | + | - | - | - | - | |
| Hornwort | 161,162 | + | + | + | + | + | - | Ψ | Ψ | + | |
| Liverwort | 121,024 | + | + | + | + | + | + | + | + | Ψ | |
| Moss | 127,489 | - | - | - | - | - | + | + | + | - | |
| Moss | 122,890 | + | - | - | - | - | + | + | + | - | |
| Moss | 122,630 | - | - | - | - | - | + | + | + | + | |
| Moss | 124,374 | - | - | - | - | - | + | + | + | + | |
| Lycopodiophyta | 154,373 | + | + | + | - | - | + | + | + | + | |
| Moniliformopses | 133,309 | + | + | + | - | - | + | + | + | + | |
| Acrogymnospermae | 116,635 | + | + | + | - | - | - | + | + | + | |
| Magnoliophyta | 154,515 | + | + | + | - | - | - | + | + | - |
* The annotated plastomes are listed in Table S2. The presence ‘+’ or absence ‘-’ of each molecular character and pseudogenes are marked ‘Ψ’.
Figure 4Comparison of the large inversion in the LSC region among six bryophytes plastomes. In comparative LSC region alignment of rpoA, petN coding regions from M. polymorpha, S. uncinata, T. ruralis, T. pellucida and P. patens. The inverted-arrangement of 71 kb fragment was only detected for P. patens.
Figure 5Amino acid alignment of (A) nuc-rpoA and (B) nuc-petN genes of S. uncinata with the nuc-rpoA or cp-rpoA and cp-petN genes from other green plants. Identical amino acid residues are boxed in black, other residues are printed in grey. Signal peptide sequences were predicted using SignalP [45] and shown as double arrow lines and the asterisk.