Literature DB >> 28973470

Specific reverse transcriptase slippage at the HIV ribosomal frameshift sequence: potential implications for modulation of GagPol synthesis.

Christophe Penno1,2,3, Romika Kumari1, Pavel V Baranov1, Douwe van Sinderen2,3, John F Atkins1,2,4.   

Abstract

Synthesis of HIV GagPol involves a proportion of ribosomes translating a U6A shift site at the distal end of the gag gene performing a programmed -1 ribosomal frameshift event to enter the overlapping pol gene. In vitro studies here show that at the same shift motif HIV reverse transcriptase generates -1 and +1 indels with their ratio being sensitive to the relative concentration ratio of dNTPs specified by the RNA template slippage-prone sequence and its 5' adjacent base. The GGG sequence 3' adjacent to the U6A shift/slippage site, which is important for ribosomal frameshifting, is shown here to limit reverse transcriptase base substitution and indel 'errors' in the run of A's in the product. The indels characterized here have either 1 more or less A, than the corresponding number of template U's. cDNA with 5 A's may yield novel Gag product(s), while cDNA with an extra base, 7 A's, may only be a minor contributor to GagPol polyprotein. Synthesis of a proportion of non-ribosomal frameshift derived GagPol would be relevant in efforts to identify therapeutically useful compounds that perturb the ratio of GagPol to Gag, and pertinent to the extent in which specific polymerase slippage is utilized in gene expression.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28973470      PMCID: PMC5737442          DOI: 10.1093/nar/gkx690

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Moloney Murine Leukemia retrovirus gag gene is 5′ adjacent to, and in the same frame as its pol gene, whereas the Rous Sarcoma and HIV counterpart pol genes are in the –1 frame with respect to their gag genes. Following the evidence that synthesis of Moloney Murine Leukemia Virus GagPol involves in-frame readthrough of the gag termination codon (1,2), attention focused on Rous Sarcoma and HIV GagPol synthesis. A ribosomal –1 frameshift event was shown to occur in the decoding of a sequence near the 3′ end of their gag genes that overlaps their pol genes (3,4). The demonstration for such frameshifting was a major discovery, and much subsequent work has confirmed the involvement of a –1 ribosomal frameshift event in GagPol synthesis (5,6). The HIV-1 gag-pol shift site, U6A, is AAU-UUU-UUA-GGG with the codons shown in the gag reading frame and the underlining highlighting the first two codons following realignment to the -1 frame, the pol reading frame. The initial evidence for the frameshifting occurring at the translation level involved cell-free synthesis experiments. The argument against a small proportion of the mRNA used in the mammalian cell-free protein synthesis experiments having gag and pol in the same reading frame was the reported finding that Escherichia coli cell-free translation did not yield the fusion product (3). However, later work showed that E. coli ribosomes perform counterpart –1 ribosomal frameshifting at the same sequence with 40% efficiency of mammalian ribosomes (5,7). This ruled out the argument against some GagPol potentially being derived from reverse transcriptase slippage followed by standard in-frame decoding. During the subsequent 28 years, the possibility of reverse transcriptase slippage at the retroviral gag-pol frameshift sites has not been investigated, though deep sequencing has revealed heterogeneity at several sites in retroviral sequences (8). Numerous studies, some ongoing, seek compounds that specifically modulate HIV ribosomal frameshifting efficiency (9–15). The therapeutic goal of this work is expected to be dependent not only on their specificity and effectiveness in modulating ribosomal frameshifting, but also on whether a proportion of GagPol is derived from the alternative process of specific polymerase slippage. The present study is confined to in vitro experiments, but it points to the need for further work to assess the origin(s) of GagPol for retroviruses whose pol is in the –1 frame with respect to their gag coding sequence. In addition to practical considerations, historical aspects of the issues faced by one of the two groups that initially studied retroviral ‘frameshifting’ (16), the wider issue is the extent of functional utilization of specific polymerase slippage in gene expression. Early studies on mechanistically distinct utilization in the expression of paramyxoviruses (17,18), the filovirus Ebola (19), and a bacterial DNA polymerase gene (20) have been broadened by several studies including those on bacterial IS elements (21–23), Ebola virus (23–26), several secretion apparatus genes of Shigella flexneri (27–29), Citrobacter rodentium and Yersinia pseudotuberculosis (30), certain high genomic AT content endosymbiotic bacteria of insects (31,32) and a cell movement gene of the major plant virus family of Potyviruses (33–36). Despite these discoveries, the present work points to the need for further work on the relatively understudied phenomenon of specific transcriptional slippage in cellular and viral gene expression. One aspect relevant to searches for unknown cases of slippage utilization is context effects that may inhibit the propensity for polymerase slippage. This is investigated here by analysis of the GGG sequence 3′ adjacent to the ribosomal frameshift site and previously shown to be important for the ribosomal frameshifting (37). Another relevant aspect is potential stimulatory effects of ‘partial roadblocking’ RNA structures ahead of polymerase that could stimulate specific slippage at sites including those not recognizable as likely shift-prone. Such structures can stimulate reverse transcriptase slippage at a nearby 5′ sequence at which it would not otherwise detectably occur (38). Retroviral gag-pol ribosomal frameshifting is stimulated by RNA structural recoding elements 3′ of the shift site (16). This raises the possibility that the same structures, or components thereof, may also act to stimulate reverse transcriptase slippage. Studies addressing potential, or demonstrably, functionally utilized specific polymerase slippage build on earlier fidelity investigations of different polymerases, including of reverse transcriptases. There have been a number of such studies with HIV-1 RT. One of the notable findings was several fold higher fidelity with RNA than DNA templates (39), and another discerned the effects of dNTP pool imbalances on frameshift fidelity with DNA templates (40). This was also studied in a comparative analysis of slippage-mediated product indel formation by several different reverse transcriptases (38). Furthermore, substrate imbalance is also well known to influence E. coli and S. cerevisiae DNA-dependent RNA polymerase slippage (23). The degree of influence of the ratio of the cognate substrate for the slippage site to that of the substrate base for the 5′ adjacent base, also features in the present study of specific HIV RT slippage at a site region that is also responsible for the ribosomal frameshifting that mediates GagPol synthesis. In distinction to these earlier studies, the present study involves the natural context of the relevant sequence, since it permits potential limiting or stimulatory features to be discerned. Also the deep sequencing employed permits a more refined analysis.

MATERIALS AND METHODS

RNA template constructs

Double-stranded DNA templates, for T7 RNA polymerase, were generated by an overlapping DNA oligonucleotide strategy (Supplementary Table S1) and 3′ extended with Vent DNA polymerase. Preparation of RNA template (200 nt HIV) was generated by T7 RNA polymerase in vitro in 20 μl reaction volumes with a Promega kit (T7 RiboMAX™ large scale RNA production system) according to the manufacturer's instructions. Reaction products were treated with 2 units of DNase I turbo (Ambion) in 50 μL reaction volumes at 37°C for 30 min. 500 μl Trizol (Invitrogen) was added to the reaction and incubated for 2 min. Then, 100 μl chloroform was added and incubated for 5 min at room temperature, before centrifugation, 12 000 × g for 15 min at 4°C. The upper aqueous phase was extracted, and an equal volume of 100% ethanol was mixed with it before it was loaded on a silica column (Anachem PCR purification kit). After centrifugation, 12 000 × g for 30 s, the column was washed with 600 μl of a low salt wash buffer (Anachem's ‘PCR clean up’ kit wash buffer to which ethanol was added to 80% concentration). The column was then treated with DNase I turbo, 10 units in a 100 μl volume for 1 h incubation at 37°C. This was followed by 600 μl of a high salt wash buffer (Anachem's ‘PCR clean up’ kit wash buffer to which ethanol was added to 40% concentration) and centrifugation. It was then washed twice, via centrifugation, with the low salt buffer, 600 and 300 μL. The final centrifugation was at 12 000 × g for 2 min to dry the column before elution with 100 μl of RNase free water. Chemically synthesized RNA templates and DNA oligonucleotides were from IDT-DNA (Supplementary Table S1).

Reverse transcription

HIV-1 reverse transcriptase enzyme was purchased from Abcam. RT reactions involved a pre-annealing step of the RNA template (100 ng): DNA Primer (2 pmol) (Supplementary Table S1), in 1× SuperScript™ III buffer (50 mM Tris–HCl, pH 8.3 at 25°C, 75 mM KCl, 3 mM MgCl2, 5 mM DTT), in the presence of the dNTP substrate (with the specific concentrations of each indicated in the main text) and, where present, antisense (100 pmol), in a 10 μl reaction volume. With the HIV-1 200 nt cassette, the annealing step was at 95°C for 30 s with a 10% temperature decrease (from 95 to 16°C) every 30 s. With RT reactions involving the HIV-1 chemically synthesized RNA cassette combinations (Figure 3), the annealing step was at 65°C for 5 min before chilling on ice. Then, a 10 μl mix (1× SuperScript™ III buffer, 4 units HIV-1 RT enzyme, 20 mM DTT) was added and incubated at 37°C for 50 min, followed by 85°C for 5 min.
Figure 3.

HIV gag-pol RT slippage stimulators. Strategy to assess possible stimulatory or inhibitory effects on slippage by potential structure formation in the RNA template and/or in its derived nascent cDNA, during reverse transcription of the U6A motif. Sequence set I shows the chemically synthesized RNA (‘long RNA’) and a potential for stem-loop formation (red and green). When the RT transcribes as far as the 5′ U of the U6 tract (to generate ‘long cDNA’), the parts of the RNA or cDNA free behind the 18 bp hybrid, and potentially free to participate in stem-loop formation, are indicated by inverted arrows. Sequence set II shows the same ‘long RNA’ template reverse transcribed from a primer whose sequence is in italics at the 5′ end of the ‘short cDNA’. Sequence set III has ‘short RNA’ without the potential for structure formation involving the sequence underlined by inverted arrows in sets I and II. The RNA triplet GGG present in both ‘long and short RNA’ templates, which was substituted by the sequence CCC (designated ‘mut’), is highlighted in yellow.

Polymerization chain reaction

Each specific cDNA was amplified using the corresponding set of forward and reverse primers (Supplementary Table S1). Standard PCR reactions were 50 μL volume and contained: 1× Thermo buffer (Biolabs), 2 μl cDNA or 4 nM DNA template oligo, 200 μM each dNTP (Biolabs), 500 nM each specific primer, and 0.8 unit Taq DNA polymerase (Biolabs). The PCR cycle was: denaturation at 94°C for 5 min, then 25 cycles of denaturation at 94°C for 30 s, annealing at 52°C for 30 s and elongation at 72°C for 30 s. This was followed by a final elongation at 72°C for 1 min.

Sample preparation for deep sequencing

A first round of amplification of the cDNA product was with a set of forward and reverse primers with an NGS adapter sequence at their 5′ ends and sequence complementary to the cDNA product at their 3′ ends. This set of primers included the common forward primer, HIV_NGS_F1697, whose 3′ sequence is complementary to the 3′ end of the cDNA. The reverse primer used for set I (‘long RNA: long cDNA’) was HIV_NGS_R1695, and for sets 2 and 3 (‘long RNA: short cDNA’ and ‘short RNA: short cDNA’, respectively) it was HIV_NGS_R1696. PCR products were then purified using Zymo-SpinTM I column (Zymo Research) with buffers from the PCR clean-up kit (Roche). Subsequently, a limited cycle PCR using Nextera XT indices (Illumina) was performed to incorporate sequencing adaptors and dual-index barcodes to each PCR product. Products were then purified with Ampure XP beads (Beckman Coulter), quantified using the Qubit fluorimeter (Thermo Fisher) and pooled at an equimolar concentration. The mix of 80 samples was sequenced on a MiSEq (Illumina) using a MiSeq Reagent Nano Kit (300 cycles) with V2 chemistry (Illumina).

Deep sequencing analysis

The paired-end cDNA libraries were obtained for 80 samples with approximate yield of ∼10 000/reads per sample. The reads were trimmed using Cutadapt (41). Adapter sequences were clipped from forward (5′-CTGTCTCTTATACACATCTCCGAGCCCACGAGAC-3′) and reverse (5′-CTGTCTCTTATACACATCTGACGCTGCCGACGA-3′) reads. Trimmed reads were aligned to the corresponding reference gene sequence using bowtie2 short read alignment program (42). Local alignments of paired end reads were carried out with default parameters. Sorted BAM alignment files were obtained using SAMtools version 1.3.1 (43) which were used for prediction of SNPs and INDELs. Variant calling was performed using SAMtools mpileup. Parameters used for mplileup predictions were (-g -uf –output-tags DP, AD -L 1000000 -d 1000000) which generated output in BCF format. BCFtools (44) view option was used to convert variant calls to the Variant Call Format (VCF). For graphical analysis of substitution rates in the slippery sites custom script based on seqLogo (45) R package was used to plot sequence logos. Heatmaps were generated using MeV (Multiple Experiment Viewer) (46).

RESULTS

Strategy for testing the effect of a 200 nt HIV1 gag-pol cassette

The sequence 3′ of the 5′-U6A-3′ HIV gag-pol frameshift site can form alternative structures. Though most work on ribosomal frameshifting has been done with cassettes with 65 nt 3′ of the shift site (47), additional flanking sequences may form other relevant structures (48–51). This was tested using a 140 nt cassette with 71 nt 3′, and 63 nt 5′, of the shift site. Structures similar to those in template RNA may also form in the nascent cDNA and are potentially relevant to reverse transcriptase slippage. Conceptual precedent comes from the finding that formation of a nascent RNA stem-loop structure can be a strong stimulator for DNA-dependent RNA polymerase slippage (23). To assess potential HIV RT enzyme slippage on the gag-pol U6A frameshift motif with putative stimulation by the flanking sequence context, the 140 nt RNA used in the ribosomal frameshifting studies plus 60 nt for primer annealing (total 200 nt), sequence was generated in vitro by T7 RNAP (Materials and Methods). This RNA had 70 nt 5′ of the shift site, the U6A shift site and 124 nt 3′ of it (Figure 1).
Figure 1.

The HIV gag-pol ribosomal frameshift site also mediates HIV reverse transcriptase slippage. The gag-pol frameshift site, U_UUU_UUA (underlined) is shown with 30 nt 5′ of the shift site and 48 nt 3′ (for which one of the potential pairing possibilities is indicated in red and green). The 5′ and 3′ borders of an RNA sequence involved in a potential and relevant large alternative structure revealed in a prior study (48), are indicated by red brackets. RNA three frame translation is shown with the 0 and –1 reading frames being those of gag and pol respectively. The highlighted grey box indicates the amino-acid sequence of the ribosomal frameshift products and the corresponding sequence specified only by the original zero frame. The 5′ and 3′ nt sequences in bold indicate the nt annealing region of the primers used in the (RT)-PCR reactions.

The HIV gag-pol ribosomal frameshift site also mediates HIV reverse transcriptase slippage. The gag-pol frameshift site, U_UUU_UUA (underlined) is shown with 30 nt 5′ of the shift site and 48 nt 3′ (for which one of the potential pairing possibilities is indicated in red and green). The 5′ and 3′ borders of an RNA sequence involved in a potential and relevant large alternative structure revealed in a prior study (48), are indicated by red brackets. RNA three frame translation is shown with the 0 and –1 reading frames being those of gag and pol respectively. The highlighted grey box indicates the amino-acid sequence of the ribosomal frameshift products and the corresponding sequence specified only by the original zero frame. The 5′ and 3′ nt sequences in bold indicate the nt annealing region of the primers used in the (RT)-PCR reactions. The cDNA products were amplified by polymerase chain reaction (PCR) using specific primers. As a control for DNA polymerase fidelity, a counterpart PCR experiment was performed with a chemically synthesized DNA whose sequence was the reverse complement of the 200 nt RNA template used above. The resulting RT-PCR and PCR products were quantitatively analyzed by next-generation sequencing (NGS) (Materials and Methods). For each sample, single nucleotide polymorphism (SNP) and nucleotide insertion/deletion (indel) frequency were determined at each nt location by comparison to their respective cDNA reference sequence. For indel analysis at the 5′-U6A-3′ gag-pol derived sequence, NGS reads were selected only when the sequence derived from the RNA template run of U’s was free of substitutions.

Effect of the HIV1 gag-pol 140 nt frameshifting element RNA on RT slippage

With an A6 slippage motif HIV RT slippage directionality is sensitive to an imbalance of the concentrations of the cognate substrate dNTPs specified by a template slippage motif and the base 5′ adjacent to the motif (38). To ascertain if counterpart effects occur with 5′-AU6A, an imbalanced ratio of dATP, cognate for the U6 tract, and of dTTP, cognate for the template base 5′ adjacent to the U6, was tested. RT reactions were performed with 3 dNTP ratio conditions for the substrates: (i) all 4 dNTPs at 500 μM, (ii) dTTP 5 μM, other 3 at 500 μM, (iii) dATP at 5 μM other 3 at 500 μM. At the 5′-U6A-3′ gag-pol derived sequence, the NGS data shows a correlation of indel frequency for the cDNA base A with the [dATP]:[dTTP] ratio in the RT reactions. With equimolar dNTP, base omission is 2.1% and base addition 1.4%; with the [dATP]500μM:[dTTP]5μM ratio, base omission is 1.7% and addition 2.0%; with the [dATP]5μM:[dTTP]500μM ratio, base omission is 2.6% and addition is 1.1%. The control for DNA polymerase slippage shows a lower frequency (1.2%) of base omission, and a background level (0.1%) for base addition (Figure 2). These NGS results show that HIV RT undergoes specific slippage-mediated base addition (especially with [dATP] > [dTTP]) and omission (especially with [dATP] < [dTTP]). Importantly, the natural gag-pol nucleotide context 5′ and 3′ to the U6 motif does not abolish the slippage propensity of the HIV-1 RT enzyme.
Figure 2.

NGS analysis of indels in the run of A’s derived from the 200 nt HIV sequence. Frequency of insertion, or omission of base A, derived from cDNA generated at three different relative dNTP concentration conditions. The standard deviation is for 4 independent reverse transcription reactions. The control reflects only DNA polymerase slippage.

NGS analysis of indels in the run of A’s derived from the 200 nt HIV sequence. Frequency of insertion, or omission of base A, derived from cDNA generated at three different relative dNTP concentration conditions. The standard deviation is for 4 independent reverse transcription reactions. The control reflects only DNA polymerase slippage. The other type of fidelity error, base mis-incorporation was next analyzed to enable its comparison with indel formation. The SNP frequency is less than 0.5%. Its frequency at locations corresponding to each base in the cDNA run of A’s (5′-A1A2A3A4A5A6–3′, with the number in subscript indicating the position of the base in the homopolymeric run) is similar whether all dNTPs are equimolar, or with the [dATP]500μM:[dTTP]5μM ratio. In contrast, with the [dATP]5μM:[dTTP]500μM ratio, the SNP frequency strongly correlates with the cDNA A-tract base position. It increases from 1.3% corresponding to A1 to 7.1% at A6 with a preference for mis-incorporation of the base G (from ≈ 1.1% at A1 to 4.6% at A6) (Supplementary Results). These results show that depending on the relative dNTP concentrations, SNP errors can be less significant than indel errors for the 5′-U6A-3′ derived sequence. [A graphical comparison of the frequency of indels and SNPs in the cDNA reference sequence 5′-TCCCTAAAAAATTAGCCTC-3′ for 4 repeated experiments is shown in Supplementary Dataset S1. A heatmap representation of the SNP frequency is also shown for 25 nt 5′ to the A-tract of the reference cDNA (Supplementary Figure S3). Supplementary Dataset S1 and Supplementary Figure S3 also show the repeat experiments for the constructs described below]. Responsiveness to alterations of the ratios of the different dNTPs is relevant to discrimination against effects of potential infidelity of the T7 RNA polymerase used to generate the template for this work. The same derived RT-PCR and PCR products used in the NGS analysis were also analyzed by Limited Primer Extension (LPE) (Supplementary Information; Supplementary Figure S1, panel A). This has the advantage of allowing more conditions to be quickly and cheaply tested. Though it permits analysis of only a specific part of the template sequence, the segment analyzed here is the most relevant. LPE analysis yielded qualitatively similar results to NGS for both base indel and SNP, for products derived from the 5′-U6A-3′ containing sequence (Supplementary Results and Supplementary Figure S1, panel B). To explore a possible template road-blocking effect on HIV RT reverse transcriptase, an experiment was performed with a 10 nt antisense RNA complementary to the HIV RNA sequence 5′ adjacent to the U6-tract. LPE analysis revealed that presence of the antisense favors slippage-mediated base addition (Supplementary Results, Supplementary Figure S1, panel B) (LPE analysis was used for this characterization since it was also used for multiple reaction conditions with the templates described below).

Strategy for testing nucleic acid scaffolds for RT reactions with chemically synthetized RNA templates

Chemically synthesized RNA avoids potential complications due to T7 RNA polymerase infidelity. We explored slippage with synthetic RNAs close to the maximum practical length of single synthesis RNA (84 nt). The two RNAs contain 30 HIV nt 5′ of the shift site, the U6 tract, and, in RNA_1654, 48 HIV nt, and in RNA_1655, 25 HIV nt. 1654 is referred to below as ‘WT long RNA’ and 1655 as ‘WT short RNA’. In ‘WT long RNA’ at least 35 nt 3′ of the U6 tract have potential to be involved in stem–loop structure formation. In ‘WT short RNA’ the potential for stem–loop structure is limited (Figure 3). HIV gag-pol RT slippage stimulators. Strategy to assess possible stimulatory or inhibitory effects on slippage by potential structure formation in the RNA template and/or in its derived nascent cDNA, during reverse transcription of the U6A motif. Sequence set I shows the chemically synthesized RNA (‘long RNA’) and a potential for stem-loop formation (red and green). When the RT transcribes as far as the 5′ U of the U6 tract (to generate ‘long cDNA’), the parts of the RNA or cDNA free behind the 18 bp hybrid, and potentially free to participate in stem-loop formation, are indicated by inverted arrows. Sequence set II shows the same ‘long RNA’ template reverse transcribed from a primer whose sequence is in italics at the 5′ end of the ‘short cDNA’. Sequence set III has ‘short RNA’ without the potential for structure formation involving the sequence underlined by inverted arrows in sets I and II. The RNA triplet GGG present in both ‘long and short RNA’ templates, which was substituted by the sequence CCC (designated ‘mut’), is highlighted in yellow. HIV RT slippage may be stimulated by a structure in the nascent cDNA that is a counterpart of an RNA template structure. To investigate this possibility two different sites were used for priming, only one of which is in ‘WT short RNA’. The primer for this site, R_1648, yields a 25 HIV nt nascent cDNA sequence 5′ of the A-tract, ‘short cDNA’, with limiting stem–loop structure formation (Figure 3). The additional primer for ‘WT long RNA’, (R_1645) yields a 48 HIV nt nascent cDNA sequence 5′ of the A-tract, ‘long cDNA’, with the potential for structure formation (Figure 3). Three different ‘RNA template: Primer’ combinations were tested (Figure 3). The first combination involved ‘WT long RNA: long cDNA’ with both having structure potential, (Figure 3, combination I). The second combination involved ‘WT long RNA: short cDNA’ with only the template having relevant structure potential (Figure 3, combination II). The third combination involved ‘short RNA: short cDNA’ with neither having structure potential (Figure 3, combination III). Due to the effect of dNTP imbalance on RT slippage directionality (resulting in base addition or omission), RT reactions were performed with 9 dNTP concentration conditions. In all conditions dCTP and dGTP were constant at 500 μM. The concentrations of dATP (the substrate specified by the U6 motif) and dTTP (the substrate specified by the RNA template base 5′ adjacent to the U6) were either at 5, 50 or 500 μM.

Run of U’s derived indels studied with several template:primer combinations

For each sample, NGS reads were selected for slippage analysis only when sequence derived from the RNA template run of U’s was free of substitutions. For the 3 ‘RNA template: Primer’ combinations (Figure 3), sequence derived from the run of U’s exhibits both types of indel due to slippage in both directions, and with a higher frequency of base omission than base addition (Figure 4). This means that potential to form a small cDNA structure adjacent, or almost adjacent, to the hybrid does not influence product indel formation. Also, potential by the ‘long RNA’ template containing the GGG to form structure does not, at least, significantly influence the efficiency of indel generation. However, specific dNTP ratios are seen to influence the slippage. With cognate dATP at any of 5, 50 or 500 μM concentrations and the dTTP concentration increasing from 5 to 50 to 500 μM, the frequency of specific base omission increased from 0.5% up to 2.5%. With the same substrate concentrations, the efficiency of base addition is less than 0.5% except with the highest dATP:dTTP ratios (Figure 4). Qualitatively similar results were also obtained by LPE analysis (Supplementary Results and Supplementary Figure S2A and C).
Figure 4.

NGS analysis of ‘WT’ HIV cDNA derived sequence: indel distribution. Frequency of base addition, or omission, in the cDNA run of A’s. This is shown for the 3 ‘RNA template: Primer’ combinations indicated above each graph. Sample identity and dNTP concentration ratio conditions are indicated at the bottom. Indel results are from representative samples 1–27 (SI Results & SI Dataset 1).

NGS analysis of ‘WT’ HIV cDNA derived sequence: indel distribution. Frequency of base addition, or omission, in the cDNA run of A’s. This is shown for the 3 ‘RNA template: Primer’ combinations indicated above each graph. Sample identity and dNTP concentration ratio conditions are indicated at the bottom. Indel results are from representative samples 1–27 (SI Results & SI Dataset 1).

Does the highly conserved GGG 3′ adjacent to the U6A motif influence RT slippage?

The highly conserved GGG triplet 3′ adjacent to the U6A motif (37) is in the RNA:DNA hybrid when HIV RT reverse transcribes the U-tract. This GGG is potentially relevant to hybrid stability and realignment. We used a similar strategy as in the previous section to test for possible effects of substituting GGG with CCC (Figure 3, highlighted in yellow). The three combinations, ‘mut long RNA: long cDNA’, ‘mut long RNA: short cDNA’ and ‘mut short RNA: short cDNA’ were tested with nine dNTP concentration conditions. In all conditions dCTP and dGTP were at 500 μM. The concentrations of dATP (specified by U6) and dTTP (specified by the template base 5′ adjacent to U6) were either at 5, 50 or 500 μM. The 27 different RT-PCR products derived from cDNA generated by the nine dNTP concentration conditions for each of the 3 ‘RNA template: Primer’ combinations, were analyzed by LPE (Supplementary Results and Supplementary Figure S2B and D). Nine of these different RT-PCR products were also analyzed by NGS; they correspond to the 3 ‘RNA template: Primer’ combinations tested with the three following dNTP ratio conditions (i) all four dNTPs at 500 μM, (ii) dTTP 5 μM, other three at 500 μM, (iii) dATP at 5 μM other three at 500 μM. With the two combinations, ‘mut long RNA: short cDNA’ and ‘mut short RNA: short cDNA’, slippage-mediated base addition occurs at a higher frequency than with their WT counterpart (Figure 5A). With equimolar dNTP concentration, base addition is ∼3 times more efficient with the mut combination compared with the WT equivalent. Interestingly, the efficiency of base addition of the mut combinations at equimolar dNTPs is even higher than the efficiency of base addition for the corresponding WT combination at the dNTP concentration ratio which favors base addition, i.e. [dATP]500μM:[dTTP]5μM. In contrast, mut combinations exhibit a similar frequency of product base omission as their WT counterparts (Figure 5A).
Figure 5.

Effect of the nt triplet one nt 3′ to the run of U’s on HIV RT fidelity. (A) Frequency of slippage-mediated base addition, or omission, for the WT and mut combinations tested with RT reactions involving three different dNTP concentrations conditions. SEM error bars are indicated for two independent experiments. (B) Sequence logo of the substituted base in the run of A’s. The cognate substrate was omitted to enhance visualization of the substituted bases in the run of A’s in the cDNA. The identity of the ‘RNA template: Primer’ combination and the dNTP concentration conditions, are indicated above the WT (top row), and below the corresponding mut combination (bottom row). The name of the representative sample is indicated; its corresponding dataset is in SI Dataset S1.

Effect of the nt triplet one nt 3′ to the run of U’s on HIV RT fidelity. (A) Frequency of slippage-mediated base addition, or omission, for the WT and mut combinations tested with RT reactions involving three different dNTP concentrations conditions. SEM error bars are indicated for two independent experiments. (B) Sequence logo of the substituted base in the run of A’s. The cognate substrate was omitted to enhance visualization of the substituted bases in the run of A’s in the cDNA. The identity of the ‘RNA template: Primer’ combination and the dNTP concentration conditions, are indicated above the WT (top row), and below the corresponding mut combination (bottom row). The name of the representative sample is indicated; its corresponding dataset is in SI Dataset S1. With the mut sequence, base substitutions are more prominent at each position of the cDNA run of A’s (5′-A1A2A3A4A5A6–3′), especially with 5 μM dATP and the other three dNTPs at 500 μM. This SNP difference between mut and WT can be seen with sequence logo illustrations (Figure 5, panel B and Supplementary Dataset 1). Compared to the ‘WT RNA: cDNA’ combinations, the corresponding mut combinations, there is a marked SNP increase at cDNA base position A2 and this high proportion is maintained at a similar level for positions A3–A6. In contrast, with the corresponding WT combination, there is a gradual increase of substitutions from positions A1 to A6 in the cDNA. This result suggests that mutating the conserved RNA template GGG triplet to CCC, influences RT polymerase fidelity at least at the template base position U2. As judged only from mutating the conserved RNA template GGG to CCC, the identity of the bases at this position in WT serves to constrain the frequency of base A errors in the run of A’s in the product cDNA.

DISCUSSION

Reverse transcriptase slippage is shown to mediate generation of DNA which when transcribed would yield RNA with the sequence AAU-UUU-UUU-AGG. The underlined U is absent in the product, AAU-UUU-UUA-GG, generated without slippage. Standard translation of the slippage derived RNA would yield Asn-Phe-Phe-Arg, identical to the minor product from -1 ribosomal frameshifting (Figure 6) [two GagPol products are generated by –1 ribosomal frameshifting, 80% with Asn-Phe-Leu-Arg and 20% with Asn-Phe-Phe-Arg specified by the frame junction sequence (4,5,52)]. Despite all the work on retroviral ribosomal frameshifting, it is only relatively recently that a product from ribosome shifting into the alternative +1 frame has been detected. The frameshifting involved was detected with reporter cassettes in reticulocyte lysate translation experiments, and the shift involved is –2 rather than +1 (Figure 6). It was considered likely insignificant as its occurrence during HIV translation would lead to a Gag product with the C-terminal sequence Asn-Phe-Leu-stop (termination being specified by the +1 frame UAG above) (53). Reverse transcriptase slippage is here shown to yield DNA sequence whose derived RNA with the sequence AAU-UUU-UAG would be translated Asn-Phe-stop, i.e. the same as the –2 ribosomal frameshift derived product except for the absence of one amino acid, Leu (Figure 6). WT HIV Gag is proteolytically cleaved after its amino acid Asn encoded by the AAU whose U is the 5′ U of the homopolymeric component of the ribosomal frameshift site. In WT Gag multiple amino acids follow Asn and presumably cleavage does not occur when only two AA are C-terminal to Asn. If so, and if the novel termination product is synthesized in virus-infected cells, its generation could be significant. There are five sequential cleavage sites (54–59) and alteration of the 3rd cleavage site may have an effect on the fourth and fifth proteolytic cleavages of the Gag precursor protein. Irrespective, its potential functional role merits investigation, in part because of possible relevance to resistance mechanisms against drugs targeting the protease involved.
Figure 6.

Expression of HIV gag-pol. (A) Genetic organization of the gag and pol open reading frames. With the nomenclature in common use, standard translation of RNA with an extra base is the counterpart of –1 ribosomal frameshifting on WT mRNA (and from the protein product perspective, base omission is the counterpart of +1 ribosomal frameshifting). The gag-pol ribosomal frameshift sequence is indicated in bold with the corresponding amino-acid sequences indicated for the gag reading 0 frame (black), the pol reading –1 frame (blue) and the +1 reading frame (green). (B) Ribosomal frameshifting yield to Gag’ and GagPol products. The alternative amino-acid sequences in prior analyses of –1 ribosomal frameshift products with the major transframe encoded Phe-Leu containing GagPol product being 80% of the total and its Phe–Phe counterpart constituting the remaining 20% (16). Brackets with dashed lines under codons indicate initial tRNA anticodon pairing prior to realignment. Potential transcription slippage counterparts due to RT slippage-mediated one base addition (C) or one base omission (D) are considered in the Discussion.

Expression of HIV gag-pol. (A) Genetic organization of the gag and pol open reading frames. With the nomenclature in common use, standard translation of RNA with an extra base is the counterpart of –1 ribosomal frameshifting on WT mRNA (and from the protein product perspective, base omission is the counterpart of +1 ribosomal frameshifting). The gag-pol ribosomal frameshift sequence is indicated in bold with the corresponding amino-acid sequences indicated for the gag reading 0 frame (black), the pol reading –1 frame (blue) and the +1 reading frame (green). (B) Ribosomal frameshifting yield to Gag’ and GagPol products. The alternative amino-acid sequences in prior analyses of –1 ribosomal frameshift products with the major transframe encoded Phe-Leu containing GagPol product being 80% of the total and its PhePhe counterpart constituting the remaining 20% (16). Brackets with dashed lines under codons indicate initial tRNA anticodon pairing prior to realignment. Potential transcription slippage counterparts due to RT slippage-mediated one base addition (C) or one base omission (D) are considered in the Discussion.

Protease mutants

In some protease resistance mutants there is a compensatory C-terminal amino acid substitution of the NC-SP2-P6 Gag-derivative product (60). The mutations involved may enhance reverse transcriptase slippage potential. For instance, at the proteolytic cleavage site of NC/SP2, in the mutation that causes an A431V substitution, the codon GCU, 5′ adjacent to the gag sequence AAU_UUU_UUA_GGG that contains the ribosomal frameshifting site, is changed to GUU (60). This mutation may increase potential formation of a weak GNRA capped stem-loop 5′ adjacent to the U-tract (Supplementary Figure S6). At the proteolytic cleavage site for SP2/P6, in the mutation that causes the L449F substitution, the codon CUU (bold) in the gag sequence AAU_UUU_CUU_CAG is replaced by one of the Phe codons, which are UUU and UUC (61,62). Either would create the potential for an additional slippage motif with a 5′ sequence with potential to form an RNA structure that could act as a ‘partial roadblock’ RT slippage stimulator.

Substrate ratios

dUTP incorporation by RT HIV has a protective affect against host chromosome integration (63). The effect of dUTP on reverse transcriptional slippage was not explored in the present manuscript. However, this could also be relevant with the natural target of HIV, as CD4 T cells and macrophages contain a 1:4 ratio of dUTP:dTTP in contrast to other cells (1:100 ratio) (63 and refs therein). dNTP depletion is part of the host's response to viral infection. The deoxynucleotide triphosphohydrolase SAMHD1 depletes dNTP pools which interferes with RT function in myeloid cell (64). Lentiviruses, such as HIV-2, have developed a vpx gene to counteract this effect (65–67). [Contrastingly, in many colon cancer cells SAMHD1 is directly debilitated by mutants in its coding sequence with consequences for dNTP levels and ratios (68); mutator phenotypes of colon cancer cells due to DNA polymerase mutants also affects dNTP levels and ratios (69).] Future work is needed to explore the potential effect on RT HIV slippage of the relative intracellular dNTP concentration in virus-infected cells. The potential selective advantage of base insertion or omission in the gag-pol region of chromosomally integrated HIV virus sequence may also be relevant. Such an investigation could involve gag-pol sequencing of specific tissues of latent phase infected patients.

Slippage versus ribosomal frameshifting: hybrid lengths

A big distinction between ribosomal frameshifting and polymerase slippage is that with frameshifting at each position a much shorter template sequence is involved in base pairing. The RNA:cDNA hybrid in reverse transcriptase is ∼18 bp (70,71). It extends from the RNase H catalytic center to the polymerase catalytic center. The length of the RT slippage U6 motif investigated here is much shorter than the length of the hybrid, suggesting that only a portion of the 18 bp are involved in the re-alignment. RT modelling to identify locations that could accommodate potential hybrid bulge formation (due to an extra helical base), pointed to such potential only corresponding to a hybrid region whose cDNA component is 5–6 bp from its 3′ end (72). However, it is possible that the portion of the hybrid between the slippage motif and the RNase H catalytic center may also affect slippage, perhaps due to its helical topology. A constraint on the latitude of sequence variation relevant to both frameshifting and slippage is retention of an Asn codon (overlapping the 5′ end of the shift motif) since Asn is important for proteolytic cleavage (55,73). While this restricts the number of possible heptanucleotide frameshift sequences that could have been utilized at this site, there are still some that are unlikely to mediate reverse transcriptase slippage. Other features of comparative importance of the shift/slippage cassette are also noteworthy. The highly conserved triplet, GGG, separated by 1 base from the 3′ end of the U-tract, is significant for frameshifting. This sequence has generally been considered not to have reached the ribosomal A-site when frameshifting occurs. However, a recent study has proposed that it is in a (perhaps distorted) A-site when the ribosomal frameshifting occurs (37). In the present work when GGG was substituted with CCC, efficient RT slippage still occurred but it was qualitatively different. There was also an effect on product base substitutions and further work may reveal relevance of the GGG for shift/slippage site preservation. Formally structural RNA recoding signals can potentially stimulate frameshifting or slippage due to effects on their initial unwinding/unpairing or when the sequence involved (re)forms structure on or after, exit from the ribosome/polymerase. With the WT GGG just discussed, structure involving at least most of this ribosomal frameshift relevant sequence only minimally affects slippage. However, when the GGG is substituted with CCC, the further 3′ ribosomal frameshift relevant sequence has also a marked effect on slippage. However, to avail of the advantages of using homogeneous chemically synthesized—though necessarily shorter - template RNA, slippage studies with the full extent of the 3′ recoding signal important for ribosomal frameshifting, were limited. Accordingly, deductions from the part of the work involving the 3′ RNA structure need to be treated with caution. Future experiments could employ RNA construct variants with increased pairing potential 3′ of the U-tract and at different distances from it, or a similar strategy involving complementary antisense oligonucleotides.

RNA structure effects

The potential for RNA structure formation ahead a slippage motif can strongly influence the directionality of HIV reverse transcriptase slippage (38). Interestingly, an RNA antisense specific to the sequence 5′ adjacent to the HIV gag-pol U6 tract also influences, though moderately, HIV-1 RT polymerase slippage. Testing whether native gag RNA template structure formation 5′ to the run of U’s influence HIV GagPol synthesis via reverse transcriptase slippage is outside the scope of the present work. During synthesis of HIV double-stranded DNA from single-stranded cDNA, before the polymerase reaches 3′-TA6TCCC-5′, it encounters the complement of the sequence that forms the ribosomal frameshift stimulatory structure. Studies with a model stem-loop 5′ adjacent to an A6 RNA motif have shown the potential for ‘road-blocking’ RNA structure to enhance slippage-mediated base addition (38) raising the possibility that HIV RT undergoes structure stimulated slippage during DNA second strand synthesis. In contrast to RT slippage at 5′-AU6AGGG-3′ in the generation of single-stranded cDNA, a higher ratio of dATP compared to dTTP is expected to positively influence slippage-mediated base omission on second strand DNA synthesis. While any conclusion about whether the presence of a cassette that can mediate both ribosomal frameshifting and reverse transcriptase slippage is fortuitous or has been selected to facilitate both processes, is premature, dual function may not be coincidental.

Perspective

One type of recoding and splicing are alternatively used in different occurrences of one set of othologs (74–76), in several instances one type of recoding has been replaced by a different type (16). One example of the latter is dnaX where in E. coli expression of a second protein product involves a ribosomal frameshift event and in other bacteria including Thermus thermophilus, synthesis of the counterpart product involves transcription slippage (20,77–79). The present work raises the possibility that several retroviruses use a single cassette for productive ribosomal frameshifting and polymerase slippage. Such dual function could be relevant to studies aiming to ameliorate retroviral infection with compounds that alter only the efficiency of one of the two mechanisms involved. Click here for additional data file.
  77 in total

1.  Transcriptional slippage in mxiE controls transcription and translation of the downstream mxiD gene, which encodes a component of the Shigella flexneri type III secretion apparatus.

Authors:  Christophe Penno; Claude Parsot
Journal:  J Bacteriol       Date:  2006-02       Impact factor: 3.490

2.  A [Cu]rious Ribosomal Profiling Pattern Leads to the Discovery of Ribosomal Frameshifting in the Synthesis of a Copper Chaperone.

Authors:  John F Atkins; Gary Loughran; Pavel V Baranov
Journal:  Mol Cell       Date:  2017-01-19       Impact factor: 17.970

3.  The effects of dNTP pool imbalances on frameshift fidelity during DNA replication.

Authors:  K Bebenek; J D Roberts; T A Kunkel
Journal:  J Biol Chem       Date:  1992-02-25       Impact factor: 5.157

4.  Murine leukemia virus protease is encoded by the gag-pol gene and is synthesized through suppression of an amber termination codon.

Authors:  Y Yoshinaka; I Katoh; T D Copeland; S Oroszlan
Journal:  Proc Natl Acad Sci U S A       Date:  1985-03       Impact factor: 11.205

5.  Heterozygous colon cancer-associated mutations of SAMHD1 have functional significance.

Authors:  Matilda Rentoft; Kristoffer Lindell; Phong Tran; Anna Lena Chabes; Robert J Buckland; Danielle L Watt; Lisette Marjavaara; Anna Karin Nilsson; Beatrice Melin; Johan Trygg; Erik Johansson; Andrei Chabes
Journal:  Proc Natl Acad Sci U S A       Date:  2016-04-11       Impact factor: 11.205

6.  High-affinity recognition of HIV-1 frameshift-stimulating RNA alters frameshifting in vitro and interferes with HIV-1 infectivity.

Authors:  Leslie O Ofori; Thomas A Hilimire; Ryan P Bennett; Nathaniel W Brown; Harold C Smith; Benjamin L Miller
Journal:  J Med Chem       Date:  2014-01-15       Impact factor: 7.446

7.  Highly conserved RNA pseudoknots at the Gag-Pol junction of HIV-1 suggest a novel mechanism of -1 ribosomal frameshifting.

Authors:  Xiaolan Huang; Yang Yang; Guan Wang; Qiang Cheng; Zhihua Du
Journal:  RNA       Date:  2014-03-26       Impact factor: 4.942

8.  Transcriptional frameshifting rescues Citrobacter rodentium type VI secretion by the production of two length variants from the prematurely interrupted tssM gene.

Authors:  Erwan Gueguen; Norma M Wills; John F Atkins; Eric Cascales
Journal:  PLoS Genet       Date:  2014-12-04       Impact factor: 5.917

9.  Paramyxovirus mRNA editing leads to G deletions as well as insertions.

Authors:  J P Jacques; S Hausmann; D Kolakofsky
Journal:  EMBO J       Date:  1994-11-15       Impact factor: 11.598

10.  The function of a ribosomal frameshifting signal from human immunodeficiency virus-1 in Escherichia coli.

Authors:  E Yelverton; D Lindsley; P Yamauchi; J A Gallant
Journal:  Mol Microbiol       Date:  1994-01       Impact factor: 3.501

View more
  8 in total

1.  GC-Content Dependence of Elastic and Overstretching Properties of DNA:RNA Hybrid Duplexes.

Authors:  Dongni Yang; Wenzhao Liu; Xiangyu Deng; Wei Xie; Hu Chen; Zhensheng Zhong; Jie Ma
Journal:  Biophys J       Date:  2020-07-16       Impact factor: 4.033

2.  Maximum depth sequencing reveals an ON/OFF replication slippage switch and apparent in vivo selection for bifidobacterial pilus expression.

Authors:  Christophe Penno; Mary O'Connell Motherway; Yuan Fu; Virag Sharma; Fiona Crispie; Paul D Cotter; Benoit Houeix; Lokesh Joshi; Francesca Bottacini; Aoife O'Dwyer; Gary Loughran; John F Atkins; Douwe van Sinderen
Journal:  Sci Rep       Date:  2022-06-10       Impact factor: 4.996

Review 3.  Unconventional viral gene expression mechanisms as therapeutic targets.

Authors:  Jessica Sook Yuin Ho; Zeyu Zhu; Ivan Marazzi
Journal:  Nature       Date:  2021-05-19       Impact factor: 49.962

Review 4.  Contribution of yeast models to virus research.

Authors:  R Sahaya Glingston; Jyoti Yadav; Jitika Rajpoot; Neha Joshi; Shirisha Nagotu
Journal:  Appl Microbiol Biotechnol       Date:  2021-06-04       Impact factor: 4.813

Review 5.  From Recoding to Peptides for MHC Class I Immune Display: Enriching Viral Expression, Virus Vulnerability and Virus Evasion.

Authors:  John F Atkins; Kate M O'Connor; Pramod R Bhatt; Gary Loughran
Journal:  Viruses       Date:  2021-06-27       Impact factor: 5.048

6.  Modulation of HIV-1 Gag/Gag-Pol frameshifting by tRNA abundance.

Authors:  Natalia Korniy; Akanksha Goyal; Markus Hoffmann; Ekaterina Samatova; Frank Peske; Stefan Pöhlmann; Marina V Rodnina
Journal:  Nucleic Acids Res       Date:  2019-06-04       Impact factor: 16.971

Review 7.  Mechanisms and biomedical implications of -1 programmed ribosome frameshifting on viral and bacterial mRNAs.

Authors:  Natalia Korniy; Ekaterina Samatova; Maria M Anokhina; Frank Peske; Marina V Rodnina
Journal:  FEBS Lett       Date:  2019-06-20       Impact factor: 4.124

8.  Stimulation of reverse transcriptase generated cDNAs with specific indels by template RNA structure: retrotransposon, dNTP balance, RT-reagent usage.

Authors:  Christophe Penno; Romika Kumari; Pavel V Baranov; Douwe van Sinderen; John F Atkins
Journal:  Nucleic Acids Res       Date:  2017-09-29       Impact factor: 16.971

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.