| Literature DB >> 30375330 |
Natalie Jing Ma1,2, Colin F Hemez2,3, Karl W Barber2,4, Jesse Rinehart2,4, Farren J Isaacs1,2.
Abstract
Organisms possessing genetic codes with unassigned codons raise the question of how cellular machinery resolves such codons and how this could impact horizontal gene transfer. Here, we use a genomically recoded Escherichia coli to examine how organisms address translation at unassigned UAG codons, which obstruct propagation of UAG-containing viruses and plasmids. Using mass spectrometry, we show that recoded organisms resolve translation at unassigned UAG codons via near-cognate suppression, dramatic frameshifting from at least -3 to +19 nucleotides, and rescue by ssrA-encoded tmRNA, ArfA, and ArfB. We then demonstrate that deleting tmRNA restores expression of UAG-ending proteins and propagation of UAG-containing viruses and plasmids in the recoded strain, indicating that tmRNA rescue and nascent peptide degradation is the cause of impaired virus and plasmid propagation. The ubiquity of tmRNA homologs suggests that genomic recoding is a promising path for impairing horizontal gene transfer and conferring genetic isolation in diverse organisms.Entities:
Keywords: E. coli; genetic code; genetics; genomics; horizontal gene transfer; infectious disease; microbiology; ribosomal stalling; ribosome rescue; translation; viral resistance; virus
Mesh:
Substances:
Year: 2018 PMID: 30375330 PMCID: PMC6207430 DOI: 10.7554/eLife.34878
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.140
Figure 1.A UAG-ending transcript in the genomically recoded organism (GRO) may produce proteins with multiple differing C-termini.
(A) Unassigned codons arise when either the cognate tRNA or release factor recognizing a codon are removed. (B) Since the GRO lacks Release Factor 1 (RF1), ribosomal stalling at the UAG codons results in three possible fates for the nascent protein (blue): (1) suppression of the codon by a near-cognate or suppressor tRNA (yellow) and continued translation, (2) frameshifting of bases along the mRNA transcript into a new reading frame and continued translation (purple), or (3) ribosomal rescue by the ssrA-encoded tmRNA, ArfA, or ArfB proteins. If ribosomal rescue occurs via tmRNA, the resulting protein is tagged with a peptide sequence (red) for degradation, while rescue via ArfA or ArfB results in release of peptide without C-terminal modification.
Figure 2.UAG codons in the genomically recoded organism elicit suppression, frameshifting, and tagging for degradation by the tmRNA.
(A) Schematic of the GFP construct with a C-terminal 6x-His tag and a UAG stop codon, showing 102 nucleotides downstream of the UAG codon and the positions of other stop codons in the downstream tail. (B) Peptides identified from the C-terminus of a UAG-ending GFP construct expressed in the GRO (using libraries detailed in Supplementary file 3 and 4). Purified GFP protein was digested with trypsin, processed via MS/MS, and the resulting data were computationally searched using libraries encoding all possible suppressors and all possible subsequent reading frames. Peptides are mapped to the C-terminus of the original GFP construct and grouped by reading frame, with the number of bases skipped listed in the left column. Green text represents GFP, blue text represents the C-terminal 6xHis tag and unframeshifted readthrough, orange text represents the position of a UAG stop codon, purple text represents frameshifted readthrough, and red text represents the tmRNA tag. Black dashes represent ribosomal frameshifts (Figure 2—source datas 1 and 2). (C) MS-MS spectra for two peptides: the C-terminus of GFP with the appended degradation tag (LEHHHHHHAANDENYALDD) and the C-terminus of GFP demonstrating a + 10 base skip in translation (LEHHHHHHGDPMVR). The other spectra validated from UAG-GFP expressing GRO.AA are shown in Supplementary file 2.
Strains used in this study.
| Strain Abbreviation* | Ancestor (source)† | Genotype | # UAG Codons‡ | RF1 Status§ | Ribosomal rescue gene deletion | ssrA tag Status# | Investigated in |
|---|---|---|---|---|---|---|---|
| GRO.DD. | GRO.AA (this study) | Δ | 0 | +RF1 | n/a | DD | GFP expression for mass spectrometry ( |
| GRO.DD | GRO.AA (this study) | Δ | 0 | ∆RF1 | n/a | DD | GFP expression for mass spectrometry ( |
| ECNR2.AA | MG1655 Δ | 321 | +RF1 | n/a | AA | Fitness, conjugation, and viral infection ( | |
| GRO.AA | ECNR2.AA ( | Δ | 0 | ∆RF1 | n/a | AA | Fitness, conjugation, and viral infection ( |
| GRO.AA.∆ | GRO.AA (this study) | Δ | 0 | ∆RF1 | ∆ | AA | Fitness, conjugation, and viral infection ( |
| GRO.AA.∆ | GRO.AA (this study) | Δ | 0 | ∆RF1 | ∆ | AA | Fitness, conjugation, and viral infection ( |
| GRO.AA.∆ | GRO.AA (this study) | Δ | 0 | ∆RF1 | ∆ | AA | Fitness, conjugation, and viral infection ( |
| GRO.AA.∆ | GRO.AA (this study) | Δ | 0 | ∆RF1 | ∆ | AA | Fitness, conjugation, and viral infection ( |
| GRO.AA.∆ | GRO.AA (this study) | Δ | 0 | ∆RF1 | ∆ | AA | Fitness, conjugation, and viral infection ( |
*All strains derived from ECNR2, as described in Wang et al. (2009).
†See Key Resources Table for additional information on strains and sources. The GenBank accession number for E. coli MG1655 is U00096, and the GenBank accession number for GRO.AA is CP006698.
‡ Out of a total of 321 in the original ECNR2 strain.
§RF1 terminates translation at UAG and UAA. Deletion of RF1 eliminates recognition of UAG during translation; translational termination continues through RF2, which recognizes UAA and UGA.
#The ssrA gene encodes the tmRNA, which appends the ssrA degradation tag to stalled ribosomes. The wild-type sequence is AANDENYAL; mutation of the C-terminus to AANDENYAL slows degradation of peptides to enable detection by mass spectrometry.
Components of peptide library constructed to search and analyze tandem mass spectrometry data.
The LEHHHHHHXXX library was separate from the library that contained the entries of the first three rows of the table (see Supplementary file 3 and 4).
| Library component | Example peptides (from | Enables detection of… | Complete peptide list |
|---|---|---|---|
| Any one of 20 canonical amino acids inserted at the UAG codon | LEHHHHHH | Near-cognate suppression | |
| Any length of C-tail following UAG codon to the next non-UAG stop codon or to 38 amino acids downstream of the UAG codon, whichever came first | ALGDPMVR | Readthrough, frameshifting, and rescue by ArfA or ArfB | |
| AANDENYALDD degradation tag | LEHHHHHHGD | Rescue by tmRNA-SmpB | |
| All peptides of form LEHHHHHHXXX, where X is any amino acid | LEHHHHHH | Loss of translational fidelity |
Figure 3.Deletion of both ssrA and arfB restores protein production in the genomically recoded organism.
(A) Comparison of doubling times for WT and GRO strains carrying listed deletions with and without GFP induction. Error bars show standard deviation centered at mean, n = 3; data were analyzed using Source code 1 (Figure 3—source datas 1 and 2). (B) Change in maximum optical density at 600 nm (OD600) due to expression of UAG-GFP or UAA-GFP in wild-type (WT) and GRO strains carrying listed deletions. Error bars show standard deviation centered at mean, n = 3 (Figure 3—source datas 1 and 2). (C) Quantification of GFP abundance per 1 mL of cells at OD600 of 2.5 via western blot from biological replicates of indicated strains (Figure 3—source datas 3–6). Error bars show standard deviation centered at mean, n = 3 (Figure 3—source datas 3–5). See Figure 3—figure supplement 1 for linear calibration curves used to quantify GFP abundance for each replicate experiment. Image of representative western blot is below the graph. p-values are calculated in relation to the GRO containing the UAG-ending GFP (GRO – UAG) and are as follows: * is p≤0.05, ** is p≤0.01, *** is p≤0.001, and **** is p≤0.0001.
File contains doubling times and maximum OD600’s for three separate experiments conducted on different plate reader machines. Each experiment tested each sample in biological triplicate. Only the biological triplicate data from Plate 3 is represented in Figure 3A and B.
Replicate 1 corresponds to the western blot shown in Figure 3—source data 3; Replicate 2 corresponds to the western blot shown in Figure 3—source data 4; Replicate 3 corresponds to the western blot shown in Figure 3—source data 5.
Figure 3—figure supplement 1.Calibration curves used for quantification of GFP yields, as represented in Figure 3C, using GFP samples of known concentration.
Replicate 1 corresponds to the western blot shown in Figure 3—source data 3; Replicate 2 corresponds to the western blot shown in Figure 3—source data 4; Replicate 3 corresponds to the western blot shown in Figure 3—source data 5.
Figure 4.Deleting ssrA restores propagation of both viruses and conjugative plasmids in the genomically recoded organism.
(A) Percent transfer of conjugative plasmid RK2 from a wild-type donor into wild-type (WT), GRO, or GRO with designated deletions (KO) as recipients (Figure 4—source data 1). Data are obtained from technical triplicates generated from a single biological sample. (B) Percent increase in doubling time for strains carrying plasmid RK2 compared to strains lacking RK2 (Figure 4—source datas 2 and 3). (C) Number of conjugation events for conjugative plasmid F from wild-type, GRO, or GRO with designated gene deletions as donors to a wild-type recipient (Figure 4—source data 4). Data are obtained from technical triplicates generated from a single biological sample. (D) Relative titer on wild-type, GRO, and GRO with designated deletions of phage λ (Figure 4—source data 5). Error bars show standard deviation centered at mean, n = 3. p-values are calculated in relation to the GRO condition and are as follows: * is p≤0.05, ** is p≤0.01, *** is p≤0.001, and **** is p≤0.0001. (E) Effects of sequential deletions of ribosomal rescue mechanisms on conjugative plasmid transfer efficiency. (F) Effects of sequential deletions of ribosomal rescue mechanisms on viral susceptibility.
Note: These data represent technical triplicates generated from the same biological sample.
Note: These data represent technical triplicates generated from the same biological sample.
Genetic reagents, bacterial strains, antibodies, and software used in this study.
| Reagent | Designation | Source or | Identifiers | Additional | Isaacs | Full | # UAG | RF1 | Ribosomal | ssrA |
|---|---|---|---|---|---|---|---|---|---|---|
| Gene | pUAG-GFP | this paper | eGFP-6xHis | eGFP protein | Plasmid | N/A | N/A | N/A | N/A | N/A |
| Gene | pUAA-GFP | this paper | eGFP-6xHis | eGFP protein with | Plasmid | N/A | N/A | N/A | N/A | N/A |
| Genetic | RK24 | 10.1126/science | Conjugative RK2 | Strain | N/A | N/A | N/A | N/A | N/A | |
| Genetic | F | Yale University | Conjugative F | Strain | N/A | N/A | N/A | N/A | N/A | |
| Genetic | pZE21_ | this paper | pZEtR-eGFP | pZE21 plasmid | Plasmid | N/A | N/A | N/A | N/A | N/A |
| Genetic | pZE21_ | this paper | pZEtR-eGFP | pZE21 plasmid | Plasmid | N/A | N/A | N/A | N/A | N/A |
| Genetic | λ.CI857 | Coli Genetic | λ.CI857; λ | Phage λ with | Phage | N/A | N/A | N/A | N/A | N/A |
| Cell line | GRO.DD | this paper | C31GIB. | MG1655-derived | Strain | 0 | +RF1 | n/a | DD | |
| Cell line | GRO. | this paper | C31GIB. | MG1655-derived | Strain | 0 | ∆RF1 | n/a | DD | |
| Cell line | ECNR2 | 10.1016/j.cels | ECNR2.Δmut | MG1655-derived | Strain | 321 | +RF1 | n/a | AA | |
| Cell line | GRO.AA | 10.1016/j.cels | C31.final. | MG1655-derived | Strain | 0 | ∆RF1 | n/a | AA | |
| Cell line | GRO. | this paper | C31GIB.arfB: | MG1655-derived | Strain | 0 | ∆RF1 | ∆ssrA | AA | |
| Cell line | GRO. | this paper | C31GIB.ssrA | MG1655-derived | Strain | 0 | ∆RF1 | ∆arfA | AA | |
| Cell line | GRO. | this paper | C31GIB.arfA | MG1655-derived | Strain | 0 | ∆RF1 | ∆arfB | AA | |
| Cell line | GRO.AA | this paper | C31GIB.ΔarfB | MG1655-derived | Strain | 0 | ∆RF1 | ∆ssrA. | AA | |
| Cell line | GRO.AA | this paper | C31GIB.Δarf | MG1655-derived | Strain | 0 | ∆RF1 | ∆arfA. | AA | |
| Antibody | mouse | other | Invitrogen | Invitrogen | N/A | N/A | N/A | N/A | N/A | N/A |
| Antibody | goat | other | AbCam (Ref#: | AbCam (Ref#: | N/A | N/A | N/A | N/A | N/A | N/A |
| Recombinant | ssrA:tolC | this paper; for | dsDNA | The E. coli native | dsDNA | N/A | N/A | N/A | N/A | N/A |
| Recombinant | arfA:tolC | this paper; for | dsDNA | The E. coli native | dsDNA | N/A | N/A | N/A | N/A | N/A |
| Recombinant | arfB:tolC | this paper; for | dsDNA | The E. coli native | dsDNA | N/A | N/A | N/A | N/A | N/A |
| Software, | Doubling | 10.1126/ | Growth_ | Doubling time | N/A | N/A | N/A | N/A | N/A | N/A |
| Software, | MaxQuant | other | N/A | Commercial | N/A | N/A | N/A | N/A | N/A | N/A |
| Software, | Graphpad | other | N/A | Commercial | N/A | N/A | N/A | N/A | N/A | N/A |