| Literature DB >> 24386369 |
Jean-François Pombert1, Christian Otis2, Monique Turmel2, Claude Lemieux2.
Abstract
Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24386369 PMCID: PMC3873408 DOI: 10.1371/journal.pone.0084325
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Physical map of the Prasinoderma mitochondrial genome.
Genes and introns are represented by filled boxes. Genes on the outside are transcribed clockwise, whereas genes on the inside are transcribed counterclockwise. Colors are attributed according to the categories in the inner legend; TS group I introns refers to trans-spliced group I introns. Outermost inner ring: Inverted repeats and single copy regions are displayed by thick and thin black arcs, respectively. Color-coded middle rings: Syntenies between Prasinoderma and other prasinophyte mtDNAs. From the inside to the outside are shown the comparisons with Ostreococcus, Micromonas, Nephroselmis and Pycnococcus. Highly conserved clusters are shown in cyan and magenta, whereas clusters conserved only between Prasinoderma and one other prasinophyte are shown in green, purple or orange. Regions featuring no conserved clusters are shown in light gray. Inner ring: G+C percentages calculated with OGDRAW [45]; light gray, A+T; dark gray, G+C.
Main features of the prasinophyte mitochondrial genomes compared in this study.
|
|
|
|
|
| |
| Genome feature | II | II | III | V | VI |
| Size (bp) | 44,237 | 47,425 | 45,223 | 24,321 | 54,546 |
| % A+T | 61.8 | 65.4 | 67.2 | 62.2 | 45.8 |
| Inverted repeat | + | + | − | − | + |
| Size (bp) | 9,772 | 12,234 | 14,364 | ||
| % Genome | 44.2 | 51.6 | 54.2 | ||
| Genes | |||||
| Number | 63 (78) | 63 (79) | 66 (70) | 36 (36) | 55 (74) |
| % Genome | 89.9 | 84.8 | 80.2 | 87.7 | 73.8 |
| Introns | |||||
| Group I | 0 (0) | 0 (0) | 4 (0) | 0 (0) | 0 (2) |
| Group II | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
| Repeats | |||||
| Number | 34 | 0 | 42 | 242 | 2141 |
| % Genome | 0.1 | 0 | 0.1 | 1.0 | 3.9 |
| Accession Number | NC_008290 | NC_012643 | NC_008239 | NC_013935 | KF387569 |
The Roman number refers to the phylogenetic clade identified by Guillou et al [18].
The total number of unique genes is indicated outside the parentheses. The number inside the parentheses includes duplicated genes and intronic ORFs.
Conserved genes (unique and duplicated), introns and intronic ORFs were taken into account.
The total numbers of cis-spliced and trans-spliced introns are indicated outside and inside the parentheses, respectively.
Nonoverlapping repeat elements were mapped on each genome with RepeatMasker using the repeats ≥30 bp identified with REPuter.
Gene repertoires of prasinophyte mtDNAs.
|
|
|
|
|
| |
| Gene | II | II | III | V | VI |
|
| + | − | + | − | − |
|
| + | + | + | − | − |
|
| + | + | + | − | + |
|
| + | + | + | − | − |
|
| + | + | + | − | + |
|
| + | + | + | − | − |
|
| − | − | + | − | − |
|
| + | + | + | − | + |
|
| + | + | + | − | + |
|
| − | − | + | − | − |
|
| + | + | + | − | + |
|
| + | + | + | − | + |
|
| + | + | + | − | − |
|
| + | + | + | − | − |
|
| + | + | + | − | + |
|
| + | + | + | − | − |
|
| + | + | + | − | + |
|
| + | + | + | − | + |
|
| + | + | + | − | + |
|
| + | + | + | − | + |
|
| − | − | + | − | − |
|
| − | − | − | − | + |
|
| + | + | + | − | + |
|
| + | + | + | − | + |
|
| + | + | − | − | − |
|
| + | + | + | − | + |
|
| − | + | + | − | − |
|
| + | + | + | − | + |
|
| − | − | − | − | + |
|
| + | + | − | − | − |
|
| + | + | + | − | − |
|
| + | + | + | − | − |
|
| − | − | + | − | + |
|
| + | + | + | − | − |
|
| + | + | + | − | + |
Only the genes that are missing in one or more genomes are indicated. A total of 35 genes are shared by all compared mtDNAs: atp6, 8, 9, cob, cox1, 2, 3, nad1, 2, 3, 4, 4L, 5, 6, rnl, rns, rps3, 4, 12, trnC(gca), D(guc), E(uuc), H(gug), I(gau), L(uag), Me(cau), Mf(cau), N(guu), P(ugg), Q(uug), S(gcu), S(uga), V(uac), W(cca), Y(gua).
The Roman number refers to the phylogenetic clade identified by Guillou et al [18].
Figure 2Small repeats identified in three of the four Prasinoderma mitochondrial intergenic regions bordered by rnl intron fragments.
The figure shows the repeated sequences ≥15 bp located in each of these intergenic regions (top sequence) as well as elsewhere in the Prasinoderma mtDNA. The latter repeats are aligned against the intergenic sequence found in copy A of the large inverted repeat (positions 17,080 to 31,443), with the numbers indicating the coordinates of the genome sequence corresponding to the 5′ ends of the repeats. A plus or minus sign is used to indicate the DNA strand containing each repeat, with the plus sign denoting the strand whose sequence is reported in GenBank accession KF387569. To simplify the figure, the repeats located in copy B of the inverted repeat (positions 40,182 to 54,545) were not shown. Numbered arrows indicate the repeats (8 to 18 bp) present in multiple copies within each intergenic region as well as the repeats shared between the intergenic regions bordered by Pr.rnl.2 fragments. Repeats ≥15 bp were detected using the REPuter 2.74 program [44] with the options -f -p -l 15 -allmax; no repeats were found in the intergenic region delimited by 5′ Pr.rnl.1 and rns.
Figure 3Prasinoderma Pr.rnl.1 intron and its relatives.
Comparison of the predicted secondary structure of Pr.rnl.1 (A) with the consensus structure derived from organellar cis-spliced introns at the same cognate site (B). This consensus was generated using the structures of the 21 site-1931 mitochondrial and plastid introns in the Group I Intron Sequence and Structure Database [33]. Introns are displayed according to Burke et al [46]. Highly conserved residues (in all 21 introns) and less conserved residues (in 15 to 20 introns) are shown in uppercase and lowercase characters, respectively; the other residues are represented by dots. Conserved base-pairings in all introns and in 15 to 20 introns are denoted by thick and thin dashes, respectively. The P9.0 base-pairing is represented according to Cech [15]. Numbers inside the loops indicate the size variations of these loops. Splice sites between intron and exon junctions are indicated by arrows.
Figure 4Prasinoderma Pr.rnl.2 intron and its relatives.
Comparison of the predicted secondary structure of Pr.rnl.2 (A) with the consensus structure derived from organellar cis-spliced introns at the same cognate site (B). This consensus was generated using the structures of the 22 site-2500 mitochondrial and plastid introns in the Group I Intron Sequence and Structure Database [33]. Introns are displayed according to Burke et al [46]. Highly conserved residues (in all 22 introns) and slightly less conserved residues (in 16 to 21 introns) are shown in uppercase and lowercase characters, respectively; the other residues are represented by dots. Conserved base-pairings in all introns and in 16 to 21 introns are denoted by thick and thin dashes, respectively; the others are represented by dots. The P9.0 base-pairing is represented according to Cech [15]. Arrowheads point to sites of insertions/deletions. Numbers inside the loops indicate the size variations of these loops. Splice sites between intron and exon junctions are indicated by arrows.
Figure 5Confirmation of group I intron trans-splicing by RT-PCR analysis.
(A) Genomic configuration of the rnl exons in Prasinoderma mtDNA. Trans-spliced group I intron sequences are shown as black-to-gray gradient boxes. Primer locations are indicated by numbered arrows (see methods for primer sequences); the numbers in parentheses denote the nucleotide positions corresponding to the 5’ ends of the primers on the predicted rnl gene product, i.e. the RNA species derived from the three rnl exon sequences. Coding regions shown above or below the horizontal line are transcribed to the right or to the left, respectively. (B) Electrophoretic analysis of PCR products. PCR assays were carried out on cDNA or genomic DNA (gDNA), with the numbers above the gel lanes indicating the combinations of primers used. The sizes of the amplicons derived from the PCR assays on cDNA are entirely consistent with the hypothesis that two events of trans-splicing must occur to produce the large subunit RNA sequence. The results obtained for the two PCR assays on gDNA are also those expected: the assay using primers 3 and 4 yielded an amplicon with the size predicted by the genome map, whereas the assay using primers 1 and 5 produced no amplicon because both primers point toward the same direction. The identities of all amplicons were confirmed by DNA sequencing.
Figure 6Intron insertion sites in the mitochondrial rnl genes of selected green algae.
Cis- and trans-spliced introns are shown by triangles and broken triangles, respectively. When present, intronic ORFs are shown by filled triangles. Intron insertion sites are given relative to the E. coli 23S rRNA; for each site, the position corresponding to the nucleotide immediately preceding the intron is reported. Accession numbers for the rnl sequences represented are as follows: Prasinoderma coloniale, KF387569; Monomastix sp, KF060939; Nephroselmis olivacea, NC_008239; Oltmannsiellopsis viridis, NC_008256; Pseudendoclonium akinetum, NC_005926; Chlamydomonas eugametos, NC_001872; Dunaliella salina, NC_012930; Mesostigma viride, NC_008240; and Chara vulgaris, NC_005255.
Previously reported trans-spliced group I introns in cox1 that have cis-spliced relatives containing an ORF at cognate sites.
|
|
| |||||
| Taxon | Accession | Breakpoint | Taxon | Accession | ORF location | ORF type |
|
| ||||||
|
| NC_017841 | L8 |
| NC_001613 | L8 | LAGLIDADG |
|
| NC_009630 | L8 | LAGLIDADG | |||
|
| NC_017598 | L8 | LAGLIDADG | |||
|
| ||||||
|
| NC_008151 | L8 |
| KF060942 | L8 | LAGLIDADG |
|
| NC_016684 | L8 |
| X00790 | L8 | LAGLIDADG |
|
| NC_016985 | L8 |
| NC_001326 | L8 | LAGLIDADG |
|
| ||||||
|
| FJ390841 | L9 |
| Y13644 | L9 | GIY-YIG |
|
| JF338145 | L9 |
When multiple introns are present, a number following the taxon name designates the intron number.
Each of the three intron insertion sites is given relative to the cox1 gene of Mesostigma viride; for each site, the position immediately preceding the intron is given.