| Literature DB >> 27399202 |
Laura E Kirby1, Yanni Sun2, David Judah3, Scooter Nowak4, Donna Koslowsky1.
Abstract
The mitochondrial genome of Trypanosoma brucei contains many cryptogenes that must be extensively edited following transcription. The RNA editing process is directed by guide RNAs (gRNAs) that encode the information for the specific insertion and deletion of uridylates required to generate translatable mRNAs. We have deep sequenced the gRNA transcriptome from the bloodstream form of the EATRO 164 cell line. Using conventionally accepted fully edited mRNA sequences, ~1 million gRNAs were identified. In contrast, over 3 million reads were identified in our insect stage gRNA transcriptome. A comparison of the two life cycle transcriptomes show an overall ratio of procyclic to bloodstream gRNA reads of 3.5:1. This ratio varies significantly by gene and by gRNA populations within genes. The variation in the abundance of the initiating gRNAs for each gene, however, displays a trend that correlates with the developmental pattern of edited gene expression. A comparison of related major classes from each transcriptome revealed a median value of ten single nucleotide variations per gRNA. Nucleotide variations were much less likely to occur in the consecutive Watson-Crick anchor region, indicating a very strong bias against G:U base pairs in this region. This work indicates that gRNAs are expressed during both life cycle stages, and that differential editing patterns observed for the different mitochondrial mRNA transcripts are not due to the presence or absence of gRNAs. However, the abundance of certain gRNAs may be important in the developmental regulation of RNA editing.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27399202 PMCID: PMC4939953 DOI: 10.1371/journal.pntd.0004793
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Differences in mitochondrial transcript abundance, polyadenylation and the extent of RNA editing in two life cycle stages of T. brucei.
| No. of uridines | Relative level of mature RNA | PolyA tail length | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Added | Deleted | Edited size (nt) | Stage Edited | Long Slender | Short Stumpy | Procyclic | Bloodstream | Procyclic | Number of PC major classes | References |
| 12S | 2–17 (tail) | 0 | 1149 | N.D. | 0.04 | 1.3 | 1.0 | N.A. | N.A. | N.A. | [ |
| 9S | 7 (tail) | 0 | 611 | N.D. | 0.07 | 1.4 | 1.0 | N.A. | N.A. | N.A. | [ |
| CYb | 34 | 0 | 1,151 | P | ~0 | 0.5 | 1.0 | Short (UE | Short (E) & Long (E) | 11 | [ |
| A6 | 448 | 28 | 821 | P/BS | 1.0 | N.D. | 1.0 | Short (E) & Long (E) | Short (E) & Long (E) | 81 | [ |
| COI | 0 | 0 | 1,647 | NE | 0.07 | 0.4 | 1.0 | Short | Short & Long | N.A. | [ |
| COII | 4 | 0 | 663 | P | ~0 | 0.5 | 1.0 | Short | Short (UE) & Long (E) | N.A. | [ |
| COIII | 547 | 41 | 969 | P/BS | 1.0 | N.D. | 1.0 | Short (E) | Short(E) & Long(E) | 151 | [ |
| ND1 | 0 | 0 | 960 | NE | ~1 | N.D. | ~1 | Short & Long | Short & Long | N.A. | [ |
| ND2 | 0 | 0 | 1343 | NE | >1.0 | N.D. | 1.0 | Short & Long | Short & Long | N.A. | [ |
| ND3 | 210 | 13 | 452 | P/BS | >1.0 | N.D. | 1.0 | Short (E) | Short (E) | 34 | [ |
| ND4 | 0 | 0 | 1314 | NE | ~1.0 | N.D. | ~1.0 | Short & Long | Short & Long | N.A. | [ |
| ND5 | 0 | 0 | 1,779 | NE | 0.5 | 0.8 | 1.0 | N.D. | N.D. | N.A. | [ |
| ND7 | 553 | 89 | 1,238 | 5’P/BS, 3’BS | ~10 | N.D. | 1.0 | Short(E) | Short(UE) | 129 | [ |
| ND8 | 259 | 46 | 574 | BS | ~20 | N.D. | 1.0 | Short (E) & Long (E) | Short (E) | 70 | [ |
| ND9 | 345 | 20 | 649 | BS | >1.0 | N.D. | 1.0 | Short (PE) & Long (E) | Short (PE) | 39 | [ |
| RPS12 | 132 | 28 | 325 | BS | >1.0 | N.D. | 1.0 | Short(UE & E) & Long (E) | Short(UE & E) & Long (E) | 50 | [ |
| Murf 2 | 26 | 4 | 1,111 | P/BS | ~1 | ~1 | ~1 | Short & Long | Short & Long | 1 | [ |
| Murf 5 | N.D. | N.D. | N.D. | N.D. | N.D. | N.D. | N.D. | N.D. | N.D. | N.A. | N.D. |
| CR3 | 148 | 13 | 299 | BS | >1.0 | N.D. | 1.0 | N.D. | N.D. | 37 | [ |
| CR4 | 325 | 40 | 567 | BS | 1.0 | N.D. | ~0 | Short (E) & Long (E) | Short (UE) | 41 | [ |
aP, transcript is edited only in the procyclic (insect) developmental stage.
bP/BS, transcript is edited in both bloodstream and procyclic stages.
cNE, never edited, editing of these transcripts has not been reported.
dThe ND7 transcript is differentially edited in the procyclic and bloodstream stages.
eBS These transcripts are only fully edited in the bloodstream developmental stage [36].
fN.D. Values have not yet been determined.
gAll data comes from EATRO 164 procyclic gRNA transcriptome previously published [20].
hUE, unedited, the transcripts which carried these tails were typically unedited.
iE, edited, the transcripts carrying these tails were typically edited.
jN.A., Not applicable.
kCOII is a cis-edited transcript. Poly-A tails listed as short are between 10 and 50 nts long and tails listed as long are between 150 and 200 nts long.
Number of gRNA transcripts in procyclic and bloodstream major classes and ratio of procyclic transcripts to bloodstream transcripts for each gene.
| Gene | Bloodstream gRNA Reads | Procyclic gRNA Reads | Ratio of PC to BS Reads |
|---|---|---|---|
| A6 | 41,628 | 266,532 | 6.40 |
| COIII | 371,139 | 948,845 | 2.56 |
| CR3 | 13,316 | 236,808 | 17.78 |
| CR4 | 25,753 | 51,979 | 2.02 |
| CYb | 11,022 | 31,622 | 2.87 |
| MurfII | 157 | 2,605 | 16.59 |
| ND3 | 13,567 | 75,739 | 5.58 |
| ND7 | 291,927 | 702,061 | 2.40 |
| ND8 | 112,868 | 584,639 | 5.18 |
| ND9 | 83,924 | 72,027 | 0.86 |
| RPS12 | 17,191 | 403,131 | 23.45 |
| Total | 982,492 | 3,375,988 | 3.44 |
Fig 1The abundance of the initiating gRNA of all edited mRNAs in each stage.
mRNAs to the left of the dashed line are constitutively edited or are edited only in the procyclic stage [16,17,27,33]. mRNAs to the right of the dashed line are only fully edited or more abundant fully edited in the bloodstream stage [11–15].
Summary of the gRNA data coverage for each gene.
| Populations | Unique Populations | Average | Gaps | Weak Overlaps | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gene | BS | PC | BS | PC | BS | PC | BS | PC | BS | PC |
| A6 | 29 | 28 | 1 | 0 | 18 | 20 | 1 | 0 | 0 | 0 |
| COIII | 42 | 39 | 4 | 1 | 19 | 22 | 0 | 0 | 0 | 0 |
| CR3 | 9 | 9 | 0 | 0 | 19 | 14 | 1 | 0 | 0 | 2 |
| CR4 | 16 | 18 | 0 | 2 | 17 | 18 | 2 | 0 | 0 | 0 |
| CYb | 2 | 2 | 0 | 0 | 12 | 14 | 0 | 0 | 0 | 0 |
| MurfII | 1 | 1 | 0 | 0 | N.A. | N.A. | 1 | 1 | 0 | 0 |
| ND3 | 12 | 12 | 0 | 0 | 15 | 15 | 1 | 1 | 0 | 0 |
| ND7 | 45 | 48 | 2 | 5 | 17 | 21 | 7 | 2 | 4 | 1 |
| ND8 | 20 | 21 | 2 | 3 | 17 | 21 | 2 | 1 | 1 | 0 |
| ND9 | 24 | 23 | 1 | 0 | 16 | 16 | 1 | 0 | 0 | 2 |
| RPS12 | 11 | 13 | 0 | 2 | 17 | 21 | 2 | 0 | 1 | 0 |
| Total | 211 | 214 | 10 | 13 | 17 | 19 | 18 | 5 | 6 | 5 |
aThe average gRNA overlaps were determined excluding any regions where neighboring gRNAs shared no overlap.
Fig 2The frequency of nt variations versus nucleotide position in the gRNA.
Gray bars indicate the number of gRNA sequence classes with an identified nt difference between related procyclic and bloodstream gRNAs. Nucleotide numbering for each gRNA was normalized by setting the start of the Watson-Crick anchor region to zero. Black bars indicate the number of gRNA sequence classes whose contiguous Watson-Crick anchors end at that position (start of Watson-Crick = zero, so this is an indication of the length of the contiguous Watson-Crick region).
Fig 3Comparing the number of non-complementary nucleotides 5’ of the anchoring region (A) or 3’ of the guiding region (excluding the U-tail) (B) in procyclic and bloodstream gRNAs.
Fig 4Length of gRNA complementarity (including anchors) to fully edited mRNAs for both bloodstream and procyclic gRNAs.
Most common gRNA transcription start sites in procyclic and bloodstream data.
| Percentage of Sequence Classes | Percentage of Transcripts | |||
|---|---|---|---|---|
| Initiating Sequence | Bloodstream | Procyclic | Bloodstream | Procyclic |
| ATATAT | 32.20% | 35.20% | 33.60% | 37.40% |
| ATATAA | 20.00% | 21.10% | 17.90% | 24.70% |
| AAAAAA | 3.60% | 4.60% | 1.60% | 1.30% |
| ATATAC | 3.80% | 4.30% | 1.50% | 3.80% |
| ATACAA | 2.40% | 2.80% | 1.30% | 1.70% |
| ATATTA | 4.90% | 2.60% | 13.40% | 0.90% |
| ATATAG | 2.20% | 2.60% | 0.70% | 7.60% |
| ATAAAT | 2.50% | 2.60% | 1.00% | 0.70% |
| ATACAT | 2.70% | 2.20% | 3.20% | 2.20% |
| ATAAAA | 1.70% | 2.10% | 0.20% | 1.10% |
Fig 5The percentage of different nucleotide overlaps found between adjacent gRNAs.
gRNAs were aligned to their fully edited mRNA sequence and the number of mRNA nts with complementarity to both adjacent gRNAs determined.
Identified gaps or weak overlaps (less than 6 nucleotides) between populations of gRNAs observed in both data sets.
| Gene | Stage Missing Coverage | Range | Gap or Overlap | Abundance of Equivalent gRNA |
|---|---|---|---|---|
| A6 | Bloodstream | 669–670 | 2 nt Gap | 39,063 |
| CR3 | BS/P | 233/226-230 | 1 nt G/5 nt O | Missing in Both Stages |
| CR3 | Procyclic | 278–292 | 15 nt Gap | 125 |
| CR4 | Bloodstream | 143–165 | 23 nt Gap | 7,175 |
| CR4 | Bloodstream | 302–306 | 5 nt Gap | 643 |
| MurfII | BS/P | 80–85 | 6 nt Gap | Missing in Both Stages |
| ND3 | BS/P | 389–401 | 13 nt Gap | Missing in Both Stages |
| ND7 | BS/P | 92–94 | 3 nt Gap | Missing in Both Stages |
| ND7 | Bloodstream | 95–120 | 26 nt Gap | 1 |
| ND7 | Bloodstream | 292–293 | 2 nt Overlap | 3,259 |
| ND7 | Bloodstream | 325–326 | 2 nt Gap | 888 |
| ND7 | BS/P | 485–486 | 0 nt Overlap | Missing in Both Stages |
| ND7 | Bloodstream | 1000–1000 | 1 nt Overlap | 101 |
| ND7 | Bloodstream | 1079–1085 | 7 nt Gap | 44 |
| ND7 | BS/P | 1086/1086-1088 | 1 nt G/3 nt G | Missing in Both Stages |
| ND7 | Bloodstream | 1225–1232 | 8 nt Gap | 123 |
| ND7 | Bloodstream | 1269–1270 | 2 nt Overlap | 63 |
| ND8 | Bloodstream | 54–56 | 3 nt Overlap | 1 |
| ND8 | Bloodstream | 153–159 | 7 nt Gap | 4 |
| ND8 | Bloodstream | 386–389 | 4 nt Gap | 2 |
| ND8 | Procyclic | 541–553 | 13 nt Gap | 413 |
| ND9 | Procyclic | 238–242 | 5 nt Overlap | 652 |
| ND9 | Bloodstream | 340–342 | 3 nt Gap | 36 |
| ND9 | Procyclic | 609–612 | 4 nt Overlap | 7 |
| RPS12 | Bloodstream | 122–132 | 11 nt Gap | 3 |
| RPS12 | Bloodstream | 156–158 | 3 nt Overlap | 62 |
| RPS12 | Bloodstream | 337–349 | 13 nt Gap | 128 |
aIndicates that the region of poor coverage was identified in both data sets.
Range is respective nucleotides in the fully edited mRNA. Both nucleotides and deletion sites in the fully edited mRNA were numbered starting from the 5’ end (see S2 Fig for numbered sequences).
Summary of populations found in both data sets that have more reads in the bloodstream data set than in the procyclic data set.
| Gene | PC and BS Shared Populations | Populations more abundant in BS | Percentage of populations more abundant in BS |
|---|---|---|---|
| A6 | 28 | 6 | 21% |
| COIII | 38 | 12 | 32% |
| CR3 | 9 | 5 | 56% |
| CR4 | 16 | 10 | 63% |
| CYb | 2 | 0 | 0% |
| MurfII | 1 | 0 | 0% |
| ND3 | 12 | 6 | 50% |
| ND7 | 43 | 20 | 47% |
| ND8 | 18 | 8 | 44% |
| ND9 | 23 | 16 | 70% |
| RPS12 | 11 | 4 | 36% |
| Total | 201 | 87 | 43% |
Fig 6Alignment of conventional ATPase 6 protein sequence to hypothetical proteins generated by the 11U alternative edited mRNA and the 4U alternatively edited mRNA.
Double underlined residues show where the alternative sequences differ from the conventional sequence. The shaded residues in the 4U sequence show where it differs from the 11U sequence.
Fig 7Editing sites 420–489 of COIII aligned with the gRNAs identified for that region in the procyclic (grey) and bloodstream (black) data sets.
The gRNA covering 443–474 was only found in the bloodstream data set.