| Literature DB >> 28180286 |
Katherine James1, Pamela Gamba1, Simon J Cockell2, Nikolay Zenkin1.
Abstract
The transcription error rate estimated from mistakes in end product RNAs is 10−3–10−5. We analyzed the fidelity of nascent RNAs from all actively transcribing elongation complexes (ECs) in Escherichia coli and Saccharomyces cerevisiae and found that 1–3% of all ECs in wild-type cells, and 5–7% of all ECs in cells lacking proofreading factors are, in fact, misincorporated complexes. With the exception of a number of sequence-dependent hotspots, most misincorporations are distributed relatively randomly. Misincorporation at hotspots does not appear to be stimulated by pausing. Since misincorporation leads to a strong pause of transcription due to backtracking, our findings indicate that misincorporation could be a major source of transcriptional pausing and lead to conflicts with other RNA polymerases and replication in bacteria and eukaryotes. This observation implies that physical resolution of misincorporated complexes may be the main function of the proofreading factors Gre and TFIIS. Although misincorporation mechanisms between bacteria and eukaryotes appear to be conserved, the results suggest the existence of a bacteria-specific mechanism(s) for reducing misincorporation in protein-coding regions. The links between transcription fidelity, human disease, and phenotypic variability in genetically-identical cells can be explained by the accumulation of misincorporated complexes, rather than mistakes in mature RNA.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28180286 PMCID: PMC5388426 DOI: 10.1093/nar/gkw969
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Upon misincorporation, the elongation complex (EC) backtracks by 1 base pair, which then leads to further backtracking (7,8). Misincorporated and deeply backtracked ECs result in long-living pauses of transcription until resolved by intrinsic or factor-dependent cleavage. The paused ECs may cause collisions with replication, and cause RNAP traffic jams. (B) Native Elongating Transcripts sequencing (NET-seq) is a technique that involves sequencing of the 3΄ proximal parts of transcripts that are bound to transcribing RNAP. Shown is the scheme of the transcription EC, with positions in the transcript RNA (red) numbered from the 3΄ end. (C) The error rates at the 3΄ to –10 positions of the nascent RNAs of all active ECs with no filtering from S. cerevisiae (Sc; wild-type and ΔTFIIS mutant strains) and E. coli (Ec; independent data set for wild-type and ΔGre mutant strains). (D) The specific misincorporation rates at the 3΄, −1 and −2 positions for all ECs with no filtering from wild-type and mutant E. coli and S. cerevisiae strains.
Figure 2.Sequence logos for the specific misincorporations (T of the read corresponds to U in the RNA). (A) The sequences surrounding the G>A misincorporations in the EcΔGre and ScΔTFIIS strains. (B) The sequences surrounding the C>A and U>A hotspots for the ScΔTFIIS strain. C. The sequences surrounding the U>C misincorporations at the 3΄, −1 and −2 positions in ScΔTFIIS.
Distribution of G>A misincorporations and hotspots
| Dataset | Type | # locations | #ECs | Translated | Transcribed non-translated | ||
|---|---|---|---|---|---|---|---|
| Length (bp) | mm/100 000 bp | Length (bp) | mm/100 000 bp | ||||
| All | 197947 | 361319 | 7173492 | 1848.14 | 1 241 209 | 1326.21 | |
| Hotspot | 40 | 4939 | 0.22 | 0.24 | |||
| All | 199307 | 519122 | 3871814 | 4356.2 | 140 405 | 4352.41 | |
| Hotspot | 223 | 35023 | 1.34 | 10.68 | |||
The number of G>A misincorporation (mm) positions and hotspots (G>A hotspots were defined as having >50 misincorporations) in the deletion mutants, and the misincorporation rates in the translated regions in comparison to the transcribed non-translated regions. Transcribed translated regions were defined as aligned locations within protein coding sequences, while transcribed non-translated regions as aligned locations within the untranslated regions. In S. cerevisiae introns were also included in the transcribed non-translated regions.
Figure 3.The number of misincorporated ECs (A) and misincorporation positions (B) as the threshold for misincorporations per position is increased. The vast majority of misincorporations occur at positions with a single misincorporation event.