| Literature DB >> 27503298 |
Monique Turmel1, Christian Otis1, Claude Lemieux2.
Abstract
To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming.Entities:
Keywords: inverted repeat; mitochondrial genome; plastid genome; promiscuous DNA; repeated sequences; retrohoming
Mesh:
Substances:
Year: 2016 PMID: 27503298 PMCID: PMC5630911 DOI: 10.1093/gbe/evw190
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FMitochondrial gene content and organization in Gloeotilopsis. (A) Gene map of the G. planctonica mitochondrial genome. Filled boxes on the gene map represent genes, with colors denoting gene categories as indicated in the legend. Genes on the outside are transcribed counterclockwise, whereas those on the inside are transcribed clockwise. Thick lines in the innermost ring represent the gene clusters shared with Pseudendoclonium; those in the second outermost inner ring represent the gene clusters shared with G. sarcinoidea. (B) Comparison of gene content among ulvophycean mtDNAs. The presence of a gene is denoted by a blue box.
General Features of Gloeotilopsis and Other Ulvophycean Mitochondrial Genomes
| Taxon | Accession | Size | A+T | Genes | Introns (no.) | Repeats | |
|---|---|---|---|---|---|---|---|
| (bp) | (%) | (no.) | GI | GII | (%) | ||
|
| |||||||
|
| NC_008256 | 56,761 | 66.6 | 54 | 2 | 1 | 8.9 |
|
| |||||||
|
| NC_028538 | 63,845 | 66.0 | 58 | 7 | 2 | 3.0 |
|
| NC_029701 | 70,858 | 65.4 | 58 | 9 | 4 | 3.5 |
|
| KP720617 | 73,493 | 67.8 | 58 | 8 | 2 | 2.3 |
|
| NC_028081 | 61,614 | 67.5 | 58 | 3 | 1 | 1.4 |
|
| |||||||
|
| NC_005926 | 95,880 | 60.7 | 57 | 7 | 0 | 13.0 |
|
| KX306823 | 105,236 | 65.3 | 58 | 10 | 1 | 6.6 |
|
| KX306822 | 85,108 | 66.2 | 58 | 5 | 0 | 3.9 |
Intronic genes and freestanding ORFs not usually found in green plant mitochondrial genomes are not included in these values. Duplicated genes were counted only once.
Number of group I (GI) and group II (GII) introns is given.
Non-overlapping repeat elements were mapped on each genome with RepeatMasker using as input sequences the repeats of at least 30 bp identified with REPuter.
FGroup I and group II introns in Gloeotilopsis and other ulvophycean mtDNAs. A grey box represents an intron lacking an ORF; a green box indicates that the intron encodes a LAGLIDADG homing endonuclease, whereas a blue box indicates that the intron encodes a reverse transcriptase (RT) with or without H-N-H endonuclease and/or intron maturase domains. Arrows denote the intron insertion sites that have not been observed in other classes of the Chlorophyta. Intron insertion sites in protein-coding and tRNA genes are given relative to the corresponding genes in Mesostigma viride mtDNA (Turmel et al. 2002); insertion sites in rrs and rrl are given relative to the Escherichia coli 16S and 23S rRNAs, respectively. For each insertion site, the position corresponding to the nucleotide immediately preceding the intron is reported.
General Features of Gloeotilopsis and Other Ulvophycean Chloroplast Genomes
| Taxon | Accession | Size (bp) | A+T | Genes | Introns (no.) | Repeats | ||
|---|---|---|---|---|---|---|---|---|
| Genome | IR | (%) | (no.) | GI | GII | (%) | ||
|
| ||||||||
|
| NC_013359 | 153,429 | – | 66.9 | 108 | 6 | 6 | 9.9 |
|
| NC_026795 | 106,859 | – | 69.2 | 108 | 7 | 6 | 2.4 |
|
| NC_026796 | 105,200 | – | 67.2 | 109 | 8 | 3 | 0.4 |
|
| ||||||||
|
| NC_008099 | 151,933 | 18,510 | 59.5 | 104 | 5 | 0 | 11.1 |
|
| ||||||||
|
| KP720616 | 99,983 | – | 74.7 | 100 | 4 | 1 | 0.5 |
|
| NC_029040 | 96,005 | – | 75.1 | 100 | 4 | 1 | 0.5 |
|
| ||||||||
|
| NC_008114 | 195,867 | 6,039 | 68.5 | 105 | 27 | 0 | 5.3 |
|
| KX306824 | 221,431 | – | 68.5 | 104 | 14 | 17 | 3.7 |
|
| KX306821 | 262,888 | – | 68.5 | 104 | 15 | 12 | 11.6 |
Intronic genes and freestanding ORFs not usually found in green plant chloroplast genomes are not included in these values. Duplicated genes were counted only once.
Number of group I (GI) and group II (GII) introns is given.
Non-overlapping repeat elements were mapped on each genome with RepeatMasker using as input sequences the repeats of at least 30 bp identified with REPuter.
This value includes four pseudogenes (rpoB, rpoC1, ycf20 and ycf47).
FChloroplast gene content and organization in Gloeotilopsis. (A) Gene map of the G. sarcinoidea chloroplast genome. Filled boxes on the gene map represent genes, with colors denoting gene categories as indicated in the legend. Duplicated gene sequences are denoted with filled circles and sequences of mitochondrial origin are highlighted in yellow. Genes on the outside are transcribed counterclockwise, whereas those on the inside are transcribed clockwise. Thick lines in the innermost ring represent the gene clusters shared with Pseudendoclonium; those in the second outermost inner ring represent the gene clusters shared with G. sarcinoidea. (B) Comparison of gene content in ulvophycean cpDNAs. Genes and pseudogenes are shown in dark and light blue, respectively. Only the conserved genes showing a variable distribution are indicated. All compared genomes share the following set of 95 genes: accD, atpA, B, E, F, H, I, ccsA, cemA, chlI, clpP, ftsH, infA, petA, B, D, G, L, psaA, B, C, I, J, M, psbA, B, C, D, E, F, H, I, J, K, L, M, N, T, Z, rbcL, rpl2, 5, 14, 16, 19, 20, 23, 32, 36, rpoA, C2, rps2, 3,4, 7, 8, 9, 11, 12, 14, 18, 19, rrf, rrl, rrs, tufA, ycf1, 3, 4, 12, trnA(ugc), C(gca), D(guc), E(uuc), F(gaa), G(gcc), G(ucc), H(gug), I(gau), K(uuu), L(uaa), L(uag), Me(cau), Mf(cau), N(guu), P(ugg), Q(uug), R(acg), R(ucu), S(gcu), S(uga), T(ugu), V(uac), W(cca), Y(gua). (C) Alignment of the G. sarcinoidea and G. planctonica cpDNA regions extending from rrs to petD. Sequence identity is denoted by the grey scale.
FModel of IR loss from the chloroplast genome based on the comparison of the Pseudendoclonium and G. planctonica cpDNAs. (A) Comparison of gene order and gene content in the regions surrounding the genes making up the rRNA operon. Shown in red are the genes from the rRNA operon that are missing in Gloeotilopsis. (B) Model of IR loss proposed for the Ulotrichales. Genes differentially lost from the two copies of the rRNA operon are shown in red.
FPhylogenetic relationships between the mitochondrial-like sequences in G. sarcinoidea cpDNA and the corresponding genes of green plant mtDNAs. The RAxML trees were inferred using the GTR + Γ4 model. Branches that received ≥ 80% bootstrap support are denoted by thick lines. The names of the G. sarcinoidea mitochondrial-like cpDNA sequences are highlighted in red.
FGroup I introns in Gloeotilopsis and other ulvophycean cpDNAs. A grey box represents an intron lacking an ORF, whereas a colored box represents an intron containing an ORF (see the color code for the type of intron-encoded protein). Arrows denote the intron insertion sites that have not been observed in other groups of chlorophytes. Intron insertion sites in protein-coding and tRNA genes are given relative to the corresponding genes in M. viride cpDNA (Lemieux et al. 2000); insertion sites in rrs and rrl are given relative to E. coli 16S and 23S rRNAs, respectively.
FAnalysis of group II introns in Gloeotilopsis cpDNAs. (A) Comparison of their insertion sites and ORFs with those of group II introns in other ulvophycean cpDNAs. A grey box represents an intron lacking an ORF, whereas a colored box represents an intron containing an ORF (see the color code for the domains present in the encoded protein: RT, reverse transcriptase; X, intron maturase; EN, H-N-H endonuclease). Arrows denote the intron insertion sites that have not been observed in other groups of chlorophytes. Intron insertion sites in protein-coding and tRNA genes are given relative to the corresponding genes in M. viride cpDNA (Lemieux et al. 2000); insertion sites in rrs and rrl are given relative to E. coli 16S and 23S rRNAs, respectively. For each insertion site, the position corresponding to the nucleotide immediately preceding the intron is reported. Note that a few introns were not correctly annotated in the GenBank accessions of Bryopsidales and Ulvales and that changes related to intron types and insertion positions were introduced during this study. (B) Phylogenetic relationships among Gloeotilopsis group II introns. This tree was inferred by RAxML analysis of an alignment of 286 nucleotides corresponding to the core secondary structures of the group II introns. Branches that received ≥ 78% bootstrap support are denoted by thick lines (the actual bootstrap values are shown). Gp, G. planctonica; Gs, G. sarcinoidea.
FConsensus secondary structure model of the nine clade-I group II introns and insertion sites of introns from clades I, II, and III. The consensus secondary structure model is displayed according to Toor et al. (2001). Highly conserved (in seven or more introns) and less conserved (in at least five introns) nucleotide positions are shown in uppercase and lowercase characters respectively; the other residues are denoted by dots. Highly conserved (in seven or more introns) and less conserved base pairings (in at least five introns) are denoted by thick and thin dashes respectively. Roman numbers specify the major structural domains, with long-range tertiary interactions being denoted by Greek letters. Variations in size of peripheral regions are indicated by numbers inside the loops. The 5' and 3' exon sequences flanking the introns from clades I, II, and III are aligned along with the intron sequences spanning the exon-binding sites EBS1, EBS2, and EBS3. Colors highlight EBS sequences and complementary nucleotide residues in the IBS exon sequences.