| Literature DB >> 21478163 |
R Bundschuh1, J Altmüller, C Becker, P Nürnberg, J M Gott.
Abstract
RNAs transcribed from the mitochondrial genome of Physarum polycephalum are heavily edited. The most prevalent editing event is the insertion of single Cs, with Us and dinucleotides also added at specific sites. The existence of insertional editing makes gene identification difficult and localization of editing sites has relied upon characterization of individual cDNAs. We have now determined the complete mitochondrial transcriptome of Physarum using Illumina deep sequencing of purified mitochondrial RNA. We report the first instances of A and G insertions and sites of partial and extragenic editing in Physarum mitochondrial RNAs, as well as an additional 772 C, U and dinucleotide insertions. The notable lack of antisense RNAs in our non-size selected, directional library argues strongly against an RNA-guided editing mechanism. Also of interest are our findings that sites of C to U changes are unedited at a significantly higher frequency than insertional editing sites and that substitutional editing of neighboring sites appears to be coupled. Finally, in addition to the characterization of RNAs from 17 predicted genes, our data identified nine new mitochondrial genes, four of which encode proteins that do not resemble other proteins in the database. Curiously, one of the latter mRNAs contains no editing sites.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21478163 PMCID: PMC3152335 DOI: 10.1093/nar/gkr180
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Transcript map of the mitochondrial genome of P. polycephalum. Red arrows indicate the transcripts detected in our experiment. RNAs described prior to this study are shown as gray boxes; colored boxes indicate mRNAs newly characterized in this work. Transcripts derived from newly identified genes are shaded in pink, while those from previously annotated or predicted genes are shaded in blue and green, respectively. The red asterisks show the approximate positions of extragenic C insertion sites while the red arrow indicates the position of partial editing.
Number of editing sites known before our study, found in our study and total number, separated by editing type
| Type | Known before | This study | Total |
|---|---|---|---|
| I. Mononucleotide insertions | |||
| C | 511 | 744 | 1255 |
| U | 19 | 24 | 43 |
| G | 0 | 2 | 2 |
| A | 0 | 1 | 1 |
| II. Dinucleotide insertions | |||
| AA | 4 | 0 | 4 |
| UU | 2 | 0 | 2 |
| UG/GU | 3 | 1 | 4 |
| UC/CU | 8 | 1 | 9 |
| UA | 1 | 1 | 2 |
| GC/CG | 1 | 1 | 2 |
| III. Nucleotide deletions | |||
| A deletion | 3 | 0 | 3 |
| IV. Nucleotide conversions | |||
| U→G | 1 | 0 | 1 |
| C→G | 1 | 0 | 1 |
| C→U | 4 | 0 | 4 |
| Total | 558 | 775 | 1333 |
Note that the sequence context of the UG/GU, UC/CU and GC/CG dinucleotide insertions makes the insertion order ambiguous. For instance, all of the UC/CU editing sites occur next to an encoded C. Thus, the resulting RNA sequence could be due to either a CU insertion (CUC) or a UC insertion (CUC). The sequence context of dinucleotide editing sites is discussed further later in the text.
Figure 2.Acceptor stem of tRNA-Lys (a) according to (28) and (b) taking into account the two additional C insertions (arrows) and the 5′- and 3′-ends identified in this study.
Figure 3.(a) Sequencing trace of bulk RT–PCR product encompassing the intergenic region between tRNA-Met2 and tRNA-Lys. Sequences at the 3′-end of tRNA-Met2 are indicated by a green line; the 5′ portion of tRNA-Lys is indicated by a red line. Genomic position 53352 where the high-throughput sequencing reads identify a C insertion site with partial editing is indicated by a blue asterisk. Note that the sequencing trace doubles starting at this position, reflecting the mixture of edited and unedited sequences at this site. (b) Sequencing trace of bulk PCR product encompassing the same region as in (a).
Figure 4.Deviations from expected nucleotide frequencies in the vicinity of unambiguous C insertion sites. (a), (b) and (c) correspond to editing sites in protein coding genes at the first, second and third codon position, respectively; (d) corresponds to editing sites in the structural RNAs. The relative size of the letters in one position indicates how much enhanced or suppressed the nucleotide is compared to the background distribution. The total height of the stack at a position is the total amount of information encoded by the nucleotide distribution at that position in bits. The dashed lines indicate the border of statistical significance given the total numbers of observed sites of each kind.
Read counts for all eight combinations of three neighboring C to U editing sites in the cox1 mRNA
| 26826 | 26824 | 26823 | Read count | Percentage |
|---|---|---|---|---|
| U | U | U | 851 | 90.8 |
| U | U | C | 2 | 0.2 |
| U | C | U | 8 | 0.9 |
| U | C | C | 30 | 3.2 |
| C | U | U | 31 | 3.3 |
| C | U | C | 0 | 0.0 |
| C | C | U | 0 | 0.0 |
| C | C | C | 15 | 1.6 |