Literature DB >> 25341086

Genetically encoded optochemical probes for simultaneous fluorescence reporting and light activation of protein function with two-photon excitation.

Ji Luo¹, Rajendra Uprety, Yuta Naro, Chungjung Chou, Duy P Nguyen, Jason W Chin, Alexander Deiters.

Abstract

The site-specific incorporation of three new coumarin lysine analogues into proteins was achieved in bacterial and mammalian cells using an engineered pyrrolysyl-tRNA synthetase system. The genetically encoded coumarin lysines were successfully applied as fluorescent cellular probes for protein localization and for the optical activation of protein function. As a proof-of-principle, photoregulation of firefly luciferase was achieved in live cells by caging a key lysine residue, and excellent OFF to ON light-switching ratios were observed. Furthermore, two-photon and single-photon optochemical control of EGFP maturation was demonstrated, enabling the use of different, potentially orthogonal excitation wavelengths (365, 405, and 760 nm) for the sequential activation of protein function in live cells. These results demonstrate that coumarin lysines are a new and valuable class of optical probes that can be used for the investigation and regulation of protein structure, dynamics, function, and localization in live cells. The small size of coumarin, the site-specific incorporation, the application as both a light-activated caging group and as a fluorescent probe, and the broad range of excitation wavelengths are advantageous over other genetically encoded photocontrol systems and provide a precise and multifunctional tool for cellular biology.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：
Molecular Probes
Proteins

Year: 2014 PMID： 25341086 PMCID： PMC4333581 DOI： 10.1021/ja5055862

Source DB: PubMed Journal: J Am Chem Soc ISSN： 0002-7863 Impact factor: 15.419

Introduction

Good photochemical properties, chemical stability, and ease of synthesis make coumarins an important class of fluorescent probes for biological studies.[1−3] In addition to being versatile fluorophores, coumarin chromophores can be used as light-removable protecting groups, so-called “caging groups”, that are photolyzed through one- and two-photon irradiation.[4] Caged molecules have been extensively applied in the optical control of cellular processes.[5−9] In particular, the 6-bromo-7-hydroxycoumarinmethyl caging group undergoes fast two-photon photolysis at 740 nm and has been used to optically control neurotransmitters, secondary messengers, and oligonucleotides.[10−12] Two-photon irradiation enables optical activation of biological processes with enhanced tissue penetration of up to 1 mm. Moreover, two-photon caging groups can be released with greater precision in three-dimensional space than simple one-photon caging groups.[4,13] Here we report the site-specific incorporation of three coumarin amino acids into proteins via genetic code expansion with unnatural amino acids (UAAs)[14−16] to integrate the optical properties of coumarin probes into cellular systems. Genetic code expansion requires the addition of orthogonal translational machinery to achieve site-specific UAA incorporation into proteins. Recent advances in engineering pyrrolysyl-tRNA synthetase/tRNA pairs for the incorporation of sterically demanding amino acids[17−20] prompted us to synthesize coumarin lysines 1–3 (Figure 1A) and to test their incorporation into proteins. The photochemical characteristics of these UAAs complement and enhance the properties of caged and fluorescent amino acids that have been genetically encoded in bacterial and mammalian cells.[19−25] Lysines 1–3 were assembled in three steps from their corresponding coumarin alcohols (Supporting Information, Scheme S1). Briefly, the coumarin alcohols were activated with nitrophenyl chloroformate and coupled to commercially available Boc-lysine. A global deprotection under acidic conditions furnished the corresponding coumarin derivatives 1–3 in good yields.

Figure 1

(A) Structures of the genetically encoded coumarin amino acids for fluorescence reporting and light activation of protein function. (B) Crystal structure of PylRS (2Q7H) with the pyrrolysine substrate (yellow) in the active site. (C) Structure of BhcKRS with 1 (green) docked into the active site. Dashed blue lines represent H-bond interactions. (D) SDS-PAGE analysis of sfGFP-Y151TAG containing 1–3 through incorporation in E. coli. The gel was stained with Coomassie blue (top), and coumarin fluorescence was imaged via excitation at 365 nm (bottom). (E) Fluorescence micrographs of HEK 293T cells expressing the BhcKRS/tRNACUA pair and mCherry-TAG-EGFP-HA in the presence or absence of 1–3. (F) Western blot analysis of cell lysates using an anti-HA antibody and a GAPDH antibody as a loading control. Full-length protein expression is only observed in the presence of 1–3, and incorporation efficiency with all three amino acids is similar in mammalian cells.

All three coumarin lysines 1–3 contain identical benzopyrone cores as fluorescent probes. However, subtle substitutions result in a set of coumarin derivatives with unique photochemical properties. Introduction of a bromine at the 6-position enables decaging not only with UV (single photon) light (in case of 1), but also near IR (two-photon) excitation (in case of 2).[11] In contrast, extension of the coumarin–carbamate linker by a single carbon atom results in coumarin lysine 3, which does not undergo photolysis and thus represents a stable coumarin amino acid probe. Thus, coumarin lysines 1 and 2 can be used as both fluorescent and light-activated probes for optochemical control of protein function using UV or near-IR light, while coumarin lysine 3 may serve as a stable fluorescent probe that does not decage under UV excitation.

Results and Discussion

The Methanosarcina barkeri pyrrolysyl tRNA synthetase/tRNACUA (MbPylRS/tRNACUA) is functional and orthogonal in a wide range of organisms, such as Escherichia coli, yeast, mammalian cells, and animals such as Caenorhabditis elegans and Drosophila melanogaster.[24,26−31] Furthermore, wild-type PylRS recognizes several unnatural amino acids without accepting any of the 20 common amino acids as a substrate.[32] The active site of the PylRS can be further engineered through directed evolution to enable the incorporation of additional unnatural amino acids with new functions, including post-translational modifications, bioconjugation handles, photo-cross-linkers, photocaging groups, and others.[14] Thus, we generated and screened a panel of MbPylRS mutants, guided by mutants that were previously reported,[14] to direct the incorporation of 2 in response to a TAG amber codon in mammalian cells using a mCherry-TAG-EGFP reporter. Cells containing a MbPylRS mutant with only two amino acid mutations Y271A and L274M showed UAA-dependent expression of full-length mCherry-EGFP-HA. The Y271A mutation has previously been reported to direct the incorporation of Nε-carbamate-linked lysines,[33] while the L274M mutation[34,35] was discovered to facilitate higher amber suppression activities with 2 in vivo, because it allows greater flexibility of the side chain and imposes less steric bulk at the back of the hydrophobic pocket. This synthetase, termed BhcKRS, enabled the site-specific incorporation of not only 2 but also 1 and 3 in response to the amber codon TAG within sfGFP-Y151TAG-His6 in E. coli (Figure 1). This is not surprising, considering the very similar structures of 1–3 and previous observations of the high promiscuity of PylRS.[36,37] To further rationalize the ability of BhcKRS to incorporate 1–3, molecular modeling was employed. The wild-type PylRS structure (PDB: 2Q7H) was used as a starting template for which the Y271A and L274M mutations were introduced using Modeller.[38] The mutant structure was energy minimized in Amber molecular dynamics[39] before docking 1–3 into the active site pocket using AutoDock4.[40] As expected, 1–3 adopt very similar poses, reflecting their similarity in structure (see Supporting Information, Figure S1). The mutated synthetase model reveals that the Y271A and L274M mutations greatly enlarge the binding pocket to accommodate the bulky bicyclic caging group, while also orienting it in a favorable π-stacking interaction with W382. This orientation also benefits from a favorable H-bond interaction between the coumarin hydroxyl group and D373. Similar to published crystal structures, the amino group’s positioning is maintained by interactions with a structural water and Y349.[41] It has been previously shown that interactions with N311 and R295 play an important role in amino acid recognition by the PylRS system.[31,41,42] The docked structure maintains these key interactions with the carbamate carbonyl forming a H-bond with N311, while the carboxylic acid forms a H-bond with R295 (Figure 1B,C). SDS-PAGE analysis reveals coumarin fluorescence of the expressed proteins containing the coumarin lysines 1–3. No fluorescence is observed for wild-type sfGFP because its excitation wavelength (488 nm) does not match that of 1–3 (365 nm) and because of the denaturing conditions of the gel. The dependence of protein expression on the presence of 1–3 demonstrates that the engineered BhcKRS synthetase has a high specificity for coumarin lysines and does not significantly incorporate any of the common 20 amino acids. Similar results were obtained for the incorporation of 1–3 into ubiquitin and myoglobin in E. coli. Electrospray ionization mass spectrometry (ESI-MS, Supporting Information, Figures S2–S4) showed that recombinantly expressed sfGFP-1 and -3 have a mass of 28446.22 and 28460.60 Da, in agreement with the expected masses of 28446.03 and 28460.04 Da, respectively. ESI-MS analysis of sfGFP-2 showed a mass of 28445.97 Da, indicating a partial loss of bromine during E. coli expression, possibly due to reductive dehalogenation.[43] Overall, these results demonstrate that 1–3 can be incorporated into proteins in E. coli in good yields (8.0 mg/L, 1.6 mg/L, and 2.5 mg/L, respectively, for sfGFP) and with high specificity. (A) Structures of the genetically encoded coumarin amino acids for fluorescence reporting and light activation of protein function. (B) Crystal structure of PylRS (2Q7H) with the pyrrolysine substrate (yellow) in the active site. (C) Structure of BhcKRS with 1 (green) docked into the active site. Dashed blue lines represent H-bond interactions. (D) SDS-PAGE analysis of sfGFP-Y151TAG containing 1–3 through incorporation in E. coli. The gel was stained with Coomassie blue (top), and coumarin fluorescence was imaged via excitation at 365 nm (bottom). (E) Fluorescence micrographs of HEK 293T cells expressing the BhcKRS/tRNACUA pair and mCherry-TAG-EGFP-HA in the presence or absence of 1–3. (F) Western blot analysis of cell lysates using an anti-HA antibody and a GAPDH antibody as a loading control. Full-length protein expression is only observed in the presence of 1–3, and incorporation efficiency with all three amino acids is similar in mammalian cells. To demonstrate that the coumarin lysines 1–3 can also be genetically incorporated into proteins in mammalian cells, pBhcKRS-mCherry-TAG-EGFP-HA and p4CMVE-U6-PylT were cotransfected into human embryonic kidney (HEK) 293T cells. Cells were incubated for 24 h in the absence of any unnatural amino acid and in the presence of 1–3 (0.25 mM). Fluorescence imaging revealed EGFP expression only in the presence of 1–3, indicating specific incorporation of the coumarin lysines in response to the TAG codon, without measurable incorporation of endogenous amino acids (Figure 1E). This was further confirmed by an anti-HA Western blot on cell lysates from the same experiment (Figure 1F). Furthermore, full-length mCherry-EGFP protein was immunoprecipitated from HEK 293T cells using an immobilized antibody against the HA-tag and mass spectrometry sequencing confirmed that 1–3 are site-specifically incorporated into protein in mammalian cells (Supporting Information, Figure S5). Importantly, the presence of bromine was verified for protein containing 2, confirming the genetic encoding of the Bhc-caged lysine. Because the coumarin groups on 1 and 2 are caging groups that can be removed via light exposure, loss of their intrinsic fluorescence can be used as an indicator of protein decaging through UV irradiation, as shown in Figure 2A,B. This was demonstrated through a UV exposure time-course of purified sfGFP-1, followed by SDS-PAGE analysis. The coumarin fluorescence intensity of sfGFP-1 gradually decreases with extended UV exposure as more of the coumarin caging group is removed from the protein, while the continued presence of the Coomassie-stained protein band indicates stability of the protein. In a cellular context, this may enable experiments that allow for the determination of protein expression, protein localization, and protein decaging using a single optochemical probe in a single experiment. In contrast, insertion of an extra methylene unit between the lysine and the fluorophore fully abrogates photocleavage and thus establishes 3 as a stable amino acid for the site-specific fluorescent labeling of proteins. No change in coumarin fluorescence is observed after UV exposure of sfGFP-3 for 20 min (Figure 2C,D). Due to the identical fluorophores in 1 and 3 and the stability of 3 to the UV irradiation conditions, the loss of protein fluorescence in sfGFP-1 is due to decaging and not due to photobleaching. This is further supported by mass spectroscopic analysis of the proteins before and after UV exposure (see Supporting Information, Figures S2 and S4).

Figure 2

SDS-PAGE fluorescence analysis shows photodecaging of sfGFP-1 while sfGFP-3 is stable to UV exposure. (A) Loss of coumarin fluorescence after extended sfGFP-1 in-gel decaging for 0–50 min (365 nm, transilluminator). (B) Coomassie staining reveals identical sfGFP-1 protein amounts in all lanes. (C) No loss of coumarin fluorescence is observed, since sfGFP-3 does not decage. (D) Coomassie staining reveals identical sfGFP-3 protein amounts in all lanes. (E) Nuclear colocalization of coumarin and EGFP fluorescence in CHO K1 cells cotransfected with pNLS-TAG-EGFP-HA and the BhcKRS/PylT pair (pBhcKRS-4PylT) in the presence of 1 (0.25 mM). A DIC image and a merged image of all three channels are shown as well.

To demonstrate the ability of the genetically encoded coumarin lysines to act as reporters for protein localization in live cells, we investigated their utility as a protein nuclear localization marker. A plasmid was constructed to express EGFP-HA with an N-terminal NLS (nuclear localization signal, pNLS-linker-EGFP-HA),[44] which reliably localizes EGFP to the nucleus (Supporting Information, Figure S6). A TAG amber codon was introduced in the linker between the NLS and EGFP, allowing for site-specific unnatural amino acid incorporation without affecting EGFP formation or nuclear translocation. Cells cotransfected with the pNLS-KTAG-EGFP and BhcKRS/PyltRNACUA plasmid pair in the presence of 1 (0.25 mM) were analyzed for coumarin fluorescence (405 nm excitation, 450–480 nm emission) and EGFP fluorescence (488 nm excitation, 490–520 nm emission) by confocal microscopy. The observation of complete colocalization of both fluorophores in the nucleus (merged micrographs) demonstrates the ability to use 1 as a reporter of protein localization (Figure 2E and Supporting Information, movie S1 and Figure S7). SDS-PAGE fluorescence analysis shows photodecaging of sfGFP-1 while sfGFP-3 is stable to UV exposure. (A) Loss of coumarin fluorescence after extended sfGFP-1 in-gel decaging for 0–50 min (365 nm, transilluminator). (B) Coomassie staining reveals identical sfGFP-1 protein amounts in all lanes. (C) No loss of coumarin fluorescence is observed, since sfGFP-3 does not decage. (D) Coomassie staining reveals identical sfGFP-3 protein amounts in all lanes. (E) Nuclear colocalization of coumarin and EGFP fluorescence in CHO K1 cells cotransfected with pNLS-TAG-EGFP-HA and the BhcKRS/PylT pair (pBhcKRS-4PylT) in the presence of 1 (0.25 mM). A DIC image and a merged image of all three channels are shown as well. To apply the coumarin lysines 1–3 in the optical control of protein function in live cells, firefly luciferase (Fluc) was selected as an initial target because bioluminescence measurements afford low background, high sensitivity, and easy quantification. On the basis of the Fluc crystal structure, a critical lysine residue, K206, was identified, which is positioned at the edge of the substrate-binding pocket (Figure 3B). It has been proposed that this residue stabilizes and orients ATP in the active site.[45,46] The ε-amino group on K206 provides a hydrogen-bond interaction with the γ-phosphate of ATP and promotes the adenylation reaction with luciferin, thus being essential for catalytic activity as shown by the dramatic decrease in enzymatic activity displayed by the K206R mutant.[45] Therefore, we hypothesized that a sterically demanding coumarin caging group placed on K206 would prevent the interaction with ATP and limit the overall access of the substrates to the active site (Figure 3A). Photolysis of the coumarin lysine would remove the caging group and produce a native lysine residue, restoring the catalytic activity of the enzyme (Figure 3B). A genetically encoded photocaged lysine at K206 would enable the enhanced regulation of the catalytic activity of firefly luciferase via light activation.

Figure 3

Engineering of an optochemically controlled Photinus pyralis firefly luciferase through unnatural amino acid mutagenesis. (A) Caging groups at position K206 are blocking access to the binding pocket by luciferin and ATP and are disrupting a required hydrogen bonding network. (B) After decaging, wild-type Fluc is generated and the substrates can now enter the active site. PDB: 2D1S. (C) Bright-Glo luciferase assay of cells that were either kept in the dark or irradiated (365 nm, 4 min). Chemiluminescence units were normalized to the −UAA/–UV control. No enzymatic activity was observed for the caged proteins, and significant increases in luminescence were observed after photolysis of luciferase containing 1 or 2, while the K206 → 3 mutant was permanently deactivated, as expected. Error bars represent standard deviations from three independent experiments.

Site-directed mutagenesis of the corresponding K206 residue to the amber codon (TAG) enabled incorporation of 1–3 into firefly luciferase in mammalian cells. HEK 293T cells were cotransfected with the mutated firefly luciferase plasmid (pGL3-K206TAG) and the MbBhcKRS/PyltRNACUA pair (pBhcKRS-4PylT) in the presence of 1–3 (0.25 mM). After 24 h incubation, the cells were either irradiated for 4 min (365 nm, 25 W) or kept in the dark. The incorporation of 1–3 into Fluc caused complete inhibition of luciferase activity before UV irradiation, as determined by a Bright-Glo luciferase assay, comparable to the negative control (no unnatural amino acid). After UV irradiation, 1 and 2 were decaged to produce native lysine, resulting in the activation of firefly luciferase by 34-fold and 31-fold, respectively (Figure 3C). As expected, 3 did not show any activation of luciferase enzymatic activity upon illumination, as it does not undergo decaging. Therefore, the activity of firefly luciferase can be tightly optochemically regulated by incorporation of a coumarin lysine residue into the active site of the luciferase protein. Interestingly, attempts to apply 1 and 2 at position K529, another site that can be used for optical control of luciferase function,[47] led to greatly diminished luciferase activity, while introduction of our previously reported o-nitrobenzyl-caged lysine[24,48,49] worked at both positions K206 and K529. Western blots confirmed that both Fluc-K206 → 1 and Fluc-K529 → 1 were expressed at similar levels in mammalian cells (Supporting Information, Figures S8 and S9). Engineering of an optochemically controlled Photinus pyralis firefly luciferase through unnatural amino acid mutagenesis. (A) Caging groups at position K206 are blocking access to the binding pocket by luciferin and ATP and are disrupting a required hydrogen bonding network. (B) After decaging, wild-type Fluc is generated and the substrates can now enter the active site. PDB: 2D1S. (C) Bright-Glo luciferase assay of cells that were either kept in the dark or irradiated (365 nm, 4 min). Chemiluminescence units were normalized to the −UAA/–UV control. No enzymatic activity was observed for the caged proteins, and significant increases in luminescence were observed after photolysis of luciferase containing 1 or 2, while the K206 → 3 mutant was permanently deactivated, as expected. Error bars represent standard deviations from three independent experiments. To observe the optical triggering of protein function via decaging of 1 and 2 in real time, enhanced green fluorescent protein (EGFP) was selected as a second target protein for caging. EGFP consists of an 11-stranded β-barrel and a central α-helix with the Thr65-Tyr66-Gly67 chromophore.[50] The chromophore plays a crucial role in EGFP fluorescence and stability.[51] Correctly folded EGFP is a prerequisite for mature chromophore formation, with a number of lysine residues being essential to its successful folding.[52] Most notable is that only 1 lysine (K85) out of 20 is buried within the protein.[52] K85 forms a salt bridge with D82 and H-bonding interactions with the backbone of C70 and S72,[52] all of which are in close proximity to the chromophore (Figure 4A). It has been shown that C70, S72, and D82 are key residues for control of chromophore formation and oxidation.[53,54] We hypothesized that introduction of coumarin-caged lysines 1 and 2 at K85 would affect D82, C70, and S72, interrupting the α-helix bending and thus indirectly inhibiting chromophore maturation. To this end, we envisioned that UV activation would yield native EGFP that rapidly undergoes maturation. An EGFP mutant with an amber codon at position K85 (pEGFP-K85TAG) was generated as a fusion construct with mCherry, to provide a second reporter for successful plasmid transfection and incorporation of 1 and 2. HEK 293T cells were cotransfected with pEGFP-K85TAG-mCherry and the BhcKRS/PyltRNA pair in the presence of 1 and 2 (0.25 mM). After 24 h, the cells were washed and incubated in fresh media for 1 h. Cells expressing mCherry were observed by fluorescence imaging to confirm that EGFP-1/2-mCherry is generated in the presence of 1 or 2. Cells were irradiated for 30 s at 365 nm, and fluorescence was imaged by time-lapse microscopy. After photolysis of EGFP-1, green fluorescence started to appear around 10 min, and over time the fluorescence intensity gradually increased, reaching a plateau at 120 min (Figure 4D and Supporting Information, movie S2). A half-life of 49 min was observed, matching reports of EGFP chromophore maturation as the rate-limiting step.[55] Previous measurements of EGFP folding and maturation have been exclusively performed in test tubes.[56] No cellular studies have been conducted, as a precise starting point for kinetic analysis could not be provided.

Figure 4

(A) Location of K85 (yellow) and interactions with D82, C70, and S72 in EGFP. The chromophore is shown in magenta. (B) Schematic of the pEGFP-K85TAG-mCherry construct and its application in light activation studies. (C) Fluorescence imaging of HEK 293T cells expressing EGFP-K85TAG-mCherry, 90 min after irradiation at 365 nm (30 s, DAPI filter, 358–365 nm) in the presence of 1 (Nikon A1R confocal microscope, 20× objective, 2-fold zoom). (D) Normalized EGFP fluorescence as a function of time after 365 nm light activation (error bars represent standard deviations from the measurement of three independent cells, t1/2 = 49 min). Given that the coumarin lysines have relatively broad absorption bands in the 300–420 nm range that enable decaging at longer wavelengths, we speculated that irradiation at 405 nm[1,57] may efficiently activate 1 and 2. Thus, activation through blue light irradiation using a standard laser-scanning confocal microscope was tested. As expected, exposure at 405 nm induced fluorophore formation of EGFP-1 and -2 (Figure 5A,B). Because attempts to decage a previously incorporated nitrobenzyloxycarbonyl lysine[24,48,49] and nitrobenzyl tyrosine[19,58,59] at 405 nm were not successful on comparable time scales and at comparable illumination power (data not shown), the caged lysines 1 and 2 may enable multiwavelength activation of proteins caged with the two different optical probes.

Figure 5

Fluorescence confocal imaging of COS-7 cells expressing EGFP-KTAG-mCherry, before and after irradiation at 405 nm (30 mW diode laser, 20% laser power, 12.6 μs dwell time, 8 cycles) in the presence of 2 (A) or 1 (B) (Zeiss confocal LSM710 microscope, 40× water objective). Similar light-activation experiments before and after irradiation of HEK 293T cells at 760 nm (130 mW, 2 μm/s dwell time, 30 cycles, Olympus Fluoview FV1000 MPE, MaiTai DSBB-OL IR pulsed laser), in the presence of 2 (C) or 1 (D), imaged with a Olympus Fluoview1000, 40× oil objective.

Taking advantage of the two-photon decaging feature of 2,[11] photocontrol of EGFP folding by two-photon activation of EGFP-2 was performed. HEK 293T cells were cotransfected with pEGPF-K85TAG-mCherry and pBhcKRS-4PylT in the absence or presence of 1 and 2 (0.25 mM). After a 24 h incubation, the cells were washed and incubated in fresh media for 1 h and irradiated with a multiphoton laser (760 nm, 130 mW, 2 μm/s dwell time, 30 cycles, Olympus Fluoview FV1000 MPE Multiphoton laser scanning microscope FV10-ASW, MaiTai DSBB-OL IR pulsed laser). Images were acquired before and after two-photon irradiation using both EGFP (488 nm) and mCherry (561 nm) excitation. Gratifyingly, an EGFP fluorescent signal was observed after photolysis of 2 at 760 nm (Figure 5C). The cells expressing EGFP containing 1, as a control, were also exposed to two-photon excitation (760 nm) and imaged in the same fashion (Figure 5D); no EGFP activation was observed. In addition to the increased three-dimensional resolution that is provided through two-photon excitation, effectively shifting the activation wavelength to the near-IR will enable multiwavelength activation in conjunction with other optically triggered biological processes, while also preventing any overlap with established fluorescent reporter proteins. Fluorescence confocal imaging of COS-7 cells expressing EGFP-KTAG-mCherry, before and after irradiation at 405 nm (30 mW diode laser, 20% laser power, 12.6 μs dwell time, 8 cycles) in the presence of 2 (A) or 1 (B) (Zeiss confocal LSM710 microscope, 40× water objective). Similar light-activation experiments before and after irradiation of HEK 293T cells at 760 nm (130 mW, 2 μm/s dwell time, 30 cycles, Olympus Fluoview FV1000 MPE, MaiTai DSBB-OL IR pulsed laser), in the presence of 2 (C) or 1 (D), imaged with a Olympus Fluoview1000, 40× oil objective.

Summary

The site-specific genetic incorporation of three new coumarin lysine analogues 1–3 into proteins was achieved in bacterial and mammalian cells using an engineered BhcKRS synthetase system. The genetically encoded coumarin lysines were successfully applied as fluorescent cellular probes for protein localization, and the small size of these coumarin lysines is expected to minimally perturb protein structure and function, unless they are placed at critical sites. In addition to their small size, the spectral properties of 1–3 do not interfere with common fluorescent proteins (e.g., EGFP). While the amino acid 3 showed stability under irradiation conditions, the coumarins 1 and 2 were readily decaged, generating wild-type lysine residues. As a proof-of-principle, photoregulation of firefly luciferase was achieved in live cells by caging a key lysine residue, and excellent OFF to ON light-switching ratios were observed for 1 and 2. As expected, the stable fluorescent amino acid 3 did not undergo photolysis. Furthermore, two-photon and single-photon optochemical control of EGFP maturation was demonstrated, enabling the use of different, potentially orthogonal, excitation wavelengths (365, 405, and 760 nm) for the sequential activation of protein function in live cells. While the caged lysine 2 could be activated using two-photon irradiation at 760 nm, the lysine 1 was stable under these conditions. However, decaging of 1 was readily achieved with blue light of 405 nm, while a previously encoded o-nitrobenzyl-caged lysine requires UV activation.[24,48,49] These results demonstrate that coumarin lysines are a new and valuable class of optical probes that can potentially be used for the investigation and regulation of protein structure, dynamics, function, and localization in live cells. The small size of coumarin, the application as both a light-activated caging group and a fluorescent probe, and the broad range of excitation wavelengths are advantageous over other genetically encoded photocontrol systems and provide a unique and multifunctional tool for cellular biology. The ability to incorporate all three coumarin lysines with the same PylRS/tRNACUA pair further facilitates their application.

Experimental Section

Cloning

(1) Construction of pNLS-TAG-EGFP-HA: The pTAG-EGFP-HA fragment was amplified from pmCherry-TAG-EGFP-HA using the PCR primers G1/G2, digested with HindIII and BglII, and ligated into pEGFP-N1 (Clontech), generating the pTAG-EGFP-HA plamid. The pNLS PCR fragment was obtained by using primers N1/N2 and then ligated into the HindIII and XbaI sites of pTAG-EGFP-HA to generate the pNLS-TAG-EGFP-HA plasmid. (2) Construction of pNLS-WT-EGFP-HA: Plasmids were obtained by converting the TAG codon of pNLS-TAG-EGFP-HA into an AAG (Lys) codon using primers QC1/QC2 and a QuikChange site-directed mutagenesis kit (Agilent). (3) Construction of pBhcKRS-4PylT: The plasmid was obtained by ligating the p4CMVE-U6-PylT fragment from pMbPylT between the restriction sites NheI and MfeI sites of pMbBhcKRS.

Expression and Purification of Proteins in E. coli

The plasmid, pBAD-sfGFP-Y151TAG-pylT was cotransformed with pBK-BhcKRS[24] into E. coli Top10 cells. A single colony was grown in LB media overnight, and 500 μL of the overnight culture was added to 25 mL of LB media, supplemented with 1 mM of the designated unnatural amino acid and 25 μg/mL of tetracycline and 50 μg/mL of kanamycin. Cells were grown at 37 °C, 250 rpm, and protein expression was induced with 0.1% arabinose when the OD600 reached ∼0.6. After overnight expression at 37 °C, cells were harvested and washed by PBS. The cell pellets were resuspended in 6 mL of phosphate lysis buffer (50 mM, pH 8.0) and Triton X-100 (60 μL, 10%), gently mixed, and incubated for 1 h at 4 °C. The cell mixtures were sonicated, and the cell lysates were centrifuged at 4 °C, 13 000 g, for 10 min. The supernatant was transferred to a 15 mL conical tube, and 100 μL of Ni-NTA resin (Qiagen) was added. The mixture was incubated at 4 °C for 2 h under mild shaking. The resin was then collected by centrifugation (1000g, 10 min), washed twice with 400 μL of lysis buffer, and followed by two washes with 400 μL of wash buffer containing 20 mM imidazole. The protein was eluted with 400 μL of elution buffer containing 250 mM imidazole. The purified proteins were analyzed by 10% SDS-PAGE and stained with Coomassie Blue.

Protein Analysis by ESI-MS

Two different instruments were used: (A) Protein samples were analyzed using capillary LC ESI-TOF MS. The protein samples were loaded onto a PRLP-S column (Thermo Fisher 5 μm, 1000 A, 300 μm i.d. × 100 mm) on an LC system (Ultimate 3000, Dionex, Sunnyvale, CA). The LC system was directly coupled to an electrospray ionization time-of-flight mass spectrometer (microTOF, BrukerDaltonics, Billerica, MA). Chromatographic separation was performed at a constant flow rate of 3.5 μL/min using a binary solvent system (solvent A: 2.5% acetonitrile and 0.1% formic acid; solvent B: 80% acetonitrile and 0.1% formic acid) and a linear gradient program (0–5 min, 5% B; 5–10 min, 5–30% B; 10–30 min, 30–75% B; 30–35 min, 75–100% B; 35–45 min, 100–5% B; 45–60 min, 5% B). Mass spectra were acquired in positive ion mode over the mass range m/z 50 to 3000. ESI spectra were deconvoluted with the MaxEnt algorithm (Data Analysis 3.3, Bruker Daltonics, Billerica, MA), obtaining molecular ion masses with a mass accuracy of 1–2 Da. (B) High-resolution exact mass measurement were conducted on an Agilent Technologies (Santa Clara, CA) 6210 LC-TOF mass spectrometer. Samples were analyzed via a 1 μL flow injection at 300 μL/min in a water:methanol mixture (25:75 v/v) with 0.1% formic acid. The mass spectrometer was operated in positive ion mode with a capillary voltage of 4 kV, nebulizer pressure of 35 psi, and a drying gas flow rate of 12 L/min at 350 °C. The fragmentor and skimmer voltages were 200 and 60 V, respectively. Reference ions of purine at m/z 121.0509 and HP-0921 at m/z 922.0098 were simultaneously introduced via a second orthogonal sprayer and used for internal calibration.

Coumarin Lysine Incorporation in Human Cells

Human embryonic kidney (HEK) 293T cells were grown in DMEM (Dulbecco’s Modified Eagle Medium, Gibco) supplemented with 10% FBS (Gibco), 1% Pen-Strep (Corning Cellgro), and 2 mM l-glutamine (Alfa Aesar) in 96-well plates (Costar) in a humidified atmosphere with 5% CO2 at 37 °C. HEK 293T cells were transiently transfected with the pMbBhcKRS-mCherry-TAG-EGFP-HA and p4CMVE-U6-PylT[24] at ∼75% confluency in the presence or absence of 1, 2, and 3 (0.25 mM) in 96-well plates. Double transfections were performed with equal amounts of both plasmids. After an overnight incubation at 37 °C, the cells were washed by PBS and imaged with a Zeiss Axio Observer.Z1Microscope (10× objective). To confirm the expression of the fusion protein and also differentiate between expression levels, a Western blot was performed. HEK 293T cells were cotransfected with pMbBhcKRS-mCherry-TAG-EGFP-HA and p4CMVE-U6-PylT in the presence or absence of 1, 2, and 3 (0.25 mM) in six-well plates. After 24 h of incubation, the cells were washed by chilled PBS, lysed in mammalian protein extraction buffer (GE Healthcare) with complete protease inhibitor cocktail (Sigma) on ice, and the cell lysates were cleared at 13 200 rpm centrifugation (4 °C, 20 min). The protein lysate was boiled with loading buffer and then analyzed by 10% SDS-PAGE. After gel electrophoresis and transfer to a PVDF membrane (GE Healthcare), the membrane was blocked in TBS with 0.1% Tween 20 (Fisher Scientific) and 5% milk for 1 h. The blots were probed and incubated with the primary antibody, α-HA-probe (Y-11) rabbit polyclonal lgG (sc-805, Santa Cruz Biotech), overnight at 4 °C, followed by a fluorescent secondary antibody, goat-α-rabbit lgG Cy3 (GE Healthcare), for 1 h at room temperature. The binding and washing steps were performed in TBS with 0.1% Tween 20.

Protein Sequencing by LC-MS/MS

HEK 293T cells were transfected with pBhcKRS-mCherry-TAG-EGFP-HA and p4CMVE-U6-PylT in a 10 cm Petri dish and incubated with DMEM containing 1, 2, or 3 (0.25 mM) for 24 h. Cells were lysed with extraction buffer (GE Healthcare) and the mCherry-1/2/3-EGFP-HA protein was immunoprecipitated using the Pierce HA Tag IP/Co-IP kit (Pierce) according to manufacturer’s protocol. The proteins were separated on SDS-PAGE gels and stained with silver stain. Regions corresponding to the expected molecular weight of mCherry-EGFP-HA were excised, washed with HPLC water, and destained with 50% acetonitrile/25 mM ammonium bicarbonate until no visible staining. Gel pieces were dehydrated with 100% acetonitrile and reduced with 10 mM dithiothreitol at 56 °C for 1 h, followed by alkylation with 55 mM iodoacetamide at room temperature for 45 min in the dark. Gel pieces were then again dehydrated with 100% acetonitrile to remove excess alkylating and reducing agents and rehydrated with 20 ng/μL trypsin/25 mM ammonium bicarbonate and digested overnight at 37 °C. The resultant tryptic peptides were extracted with 70% acetonitrile/5% formic acid, speed-vac dried, and reconstituted in 18 μL of 0.1% formic acid. Tryptic digests were analyzed by reverse-phased LC-MS/MS using a nanoflow LC (Waters nanoACQUITY UPLC system, Waters Corp., Milford, MA) coupled online to an LTQ/Orbitrap Velos hybrid mass spectrometer (Thermo-Fisher, San Jose, CA). Separations were performed using a C18 column (PicoChip column packed with 10.5 cm Reprosil C18 3 μm 120 Å chromatography media with a 75 μm ID column and a 15 μm tip, New Objective, Inc., Woburn, MA). Mobile phase A was 0.1% formic acid in water, and mobile phase B was 0.1% formic acid in acetonitrile. Samples were injected onto a trap column (nanoACQUITY UPLC trap column, Waters Corp., Milford, MA) and washed with 1% mobile phase B at a flow rate of 5 μL/min for 3 min. Peptides were eluted from the column using a 90 min gradient running at 300 nL/min (5% B for 3 min, 5–36% B in 62 min, 36–95% B in 2 min, 95% B for 8 min, 95%–5% B in 1 min, 5% B for 16 min). The LTQ/Orbitrap instrument was operated in a data-dependent MS/MS mode in which each high resolution broad-band full MS spectra (R = 60 000 at mass to charge (m/z) 400, precursor ion selection range of m/z 300 to 2000) was followed by 13 MS/MS scans in the linear ion trap where the 13 most abundant peptide molecular ions dynamically determined from the MS scan were selected for tandem MS using a relative collision-induced dissociation (CID) energy of 35%. Dynamic exclusion was enabled to minimize redundant selection of peptides previously selected for CID. MS/MS spectra were searched with the MASCOT search engine (version 2.4.0, Matrix Science Ltd.) against a UniProt jellyfish proteome database (June 2014 release) from the European Bioinformatics Institute (http://www.ebi.ac.uk/integr8) combined with endogenous mCherry-EGFP fasta sequences. The following modifications were used: static modification of cysteine (carboxyamidomethylation, +57.0214 Da) and variable modification of methionine (oxidation, +15.9949 Da) for all searches, variable modifications of lysine for mCherry-EGFP-HA (1, +218.17 Da; 2, +295.93 Da; 3, +231.03 Da). The mass tolerance was set at 20 ppm for the precursor ions and 0.8 Da for the fragment ions. Peptide identifications were filtered using PeptideProphet and ProteinProphet algorithms with a protein threshold cutoff of 99% and peptide threshold cutoff of 95% implemented in Scaffold (Proteome Software, Portland, OR).

Expression of Caged Firefly Luciferase and Light Activation

HEK 293T cells were cultured in DMEM (Dulbecco’s Modified Eagle Medium, Gibco) supplemented with 10% FBS (Gibco), 1% Pen-Strep (Gibco), and 2 mM l-glutamine (Alfa Aesar) in 96-well plates (BD Falcon) in a humidified atmosphere with 5% CO2 at 37 °C. At 80–90% confluency, cells seeded on plates were transfected and the medium was changed to fresh DMEM supplemented without or with 1, 2, or 3 (0.25 mM). The plasmid pMbBhcKRS-4PylT was constructed containing both CMV-MbBhcKRS and 4CMVE-U6-PylT. A TAG amber stop codon was introduced at the K206 site using primers GL1/GL2 and a QuikChange mutagenesis kit (Agilent Technologies). A pGL3-control plasmid containing the gene encoding P. pyralis firefly luciferase with the TAG amber mutation at residue K206 (pGL3-K206TAG) was cotransfected into cells with the plasmid pBhcKRS-4PylT using linear PEI according to the manufacturer’s protocol (Millipore). After double transfection and 24 h incubation, the medium was changed to DMEM without phenol red, and the cells were irradiated with UV light (365 nm) for 4 min using a 365 nm UV lamp (high performance UV transilluminator, UVP, 25 W) or kept in the dark. Cells were lysed by addition of 100 μL of substrate solution (Promega) in a 96-well plate (BD Falcon), and luminescence was measured on a Synergy 4 multimode microplate reader with an integration time of 2 s and a sensitivity of 150 or on a Tecan M1000 microplate reader with an integration time of 1 s.

Visualization of Nuclear Localization through Coumarin Lysine Incorporation

CHO K1 cells were plated into a polylysine-coated four-well chamber slide (Lab-Tek) and, after incubation to 75% confluency, were transfected with 1 μg of pNLS-KTAG-EGFP and pBhcKRS-4PylT each. After 16 h incubation at 37 °C/5% CO2 in DMEM with 10% FBS in the presence of 1 (0.25 mM), cells were washed with DMEM without phenol red and then incubated for 2 h. The cells were washed with PBS, fixed with 4% formaldehyde, and stained with rhodamine–phalloidin (Life Technologies). The chamber slide was dried in the dark overnight and cells were imaged on a Zeiss 710 confocal microscope (40× water objective).

One-Photon Light Activation of EGFP

HEK 293T cells were plated into a poly-d-lysine-coated eight-well chamber slide (Lab-Tek). After incubation to 70% confluency, cells were transfected with pEGFP-K85TAG-mCherry and pBhcKRS-4PylT (200 ng each). After a 20 h incubation at 37 °C/5% CO2 in DMEM with 10% FBS in the presence of 1 (0.25 mM), cells were washed with DMEM without phenol red and then incubated for 1 h. Before light activation, mCherry-expressing cells were identified using the TXRED channel, and imaged with a Nikon A1Rsi confocal microscope (20× objective, 2-fold zoom, EGFP (ex. 488 nm) and mCherry (ex. 560 nm) channels). Subsequently, cells were illuminated for 15 s at 365 nm light (DAPI filter, 358–365 nm), and EGFP and mCherry fluorescence was acquired by time-lapse imaging (every 1 min for the first 15 min, every 5 min for the following 150 min, scan resolution 512 × 512, scan zoom 2× , dwell time 1.9 ms). The mean EGFP fluorescence intensities were quantified using Nikon Elements software.

Two-Photon Light Activation of EGFP

HEK 293T cells were plated into a polylysine-coated μ-dish (ibidi), and after incubation to 50% confluency, the cells were transfected with 1 μg each of pEGFP-KTAG-mCherry and pBhcKRS-4PylT. After a 20 h incubation at 37 °C/5% CO2 in DMEM with 10% FBS in the presence of 1 or 2 (0.25 mM, 0.5% DMSO), cells were washed with DMEM without phenol red and then incubated for 1 h. Cells were imaged with an Olympus Fluoview confocal microscope before two-photon irradiation (40× oil objective, EGFP (ex. 488 nm) and mCherry (ex. 560 nm) channels), imaging positions for mCherry-expressing cells were recorded, and the cell μ-dish was transferred to an Olympus multiphoton microscope for irradiation (Olympus Fluoview FV1000 MPE). Cells were localized at the previously recorded positions, focused using the mCherry channel, and then irradiated using a 760 nm laser (130 mW, 5% of laser power, 30 cycles of scanning, 2 μm/s dwell time, MaiTai DSBB-OL IR pulsed laser). After irradiation, the cell μ-dish was transferred back to the original microscope for imaging.

Mutant PylRS Structure Modeling and Energy Minimization

The initial template structure of PDB 2Q7H was chosen as a starting point for all modeling. The missing loops were remodeled using MODELER and the two point mutations (Y271A and L274M) were constructed using the mutate_model.py script provided by MODELER. Superposition of PDB 2Q7G on top of 2Q7H provided the coordinates for the incorporation of ATP and magnesium ions into the newly mutated structure. The ATP and magnesium ions were parametrized in antechamber using previously developed parameters.[60,61] The mutated structure was imported into AMBER12 software using the AMBER FF99SBILDN force field.[62] The protein was placed into a cubic box with a 12.0-Å border, solvated with 17 316 water molecules, and charge neutralized with the addition of six sodium ions. This system was energy minimized first with 5000 steps steepest descent method, followed by 15 000 steps conjugate gradient method with 5 kcal/mol restraints on all atoms. This was followed by another 5000 steps steepest descent method, followed by 15 000 steps conjugate gradient method with 2 kcal/mol restraints on all atoms except Y271A and L274M. The resulting energy-minimized structure was used as the starting structure for all our docking experiments. All AMBER12 computational experiments were completed on the Center for Simulation and Modeling (SAM) Frank supercomputer at the University of Pittsburgh.

Molecular Docking Experiments

The energy-minimized mutant structure was prepared for docking with AutoDock4 by removing all sodium ions, and all water molecules except for a single water molecule which exists in the active site pocket of the protein. This structural water molecule is present in all available crystal structures and plays an important role in amino acid recognition. The receptor input file was prepared using AutoDock Tools software.[40] The side chains for residue L274M were treated as flexible, while all other side chains were kept rigid. The unnatural amino acid ligands were constructed using ChemBioDraw3D, and the molecular geometry was optimized using the MMFF94 force field.[63] The ligand input files were prepared for docking using AutoDock Tools as well. Lamarkian genetic algorithm was used for docking with the following parameters: number of runs: 75, ga_pop_size 150, ga_num_evals 250 000 000, ga_num_generations 27 000 were set, all other parameters were kept default. Docking results were clustered based on RMSD of each pose. Each coumarin lysine yielded a low energy cluster with binding scores of −9.63 kJ/mol, −6.18 kJ/mol, and −6.14 kJ/mol for 1, 2, and 3, respectively.

64 in total

Review 1. Deep tissue two-photon microscopy.

Authors: Fritjof Helmchen; Winfried Denk
Journal: Nat Methods Date: 2005-12 Impact factor: 28.547

Review 2. Light-controlled tools.

Authors: Clara Brieke; Falk Rohrbach; Alexander Gottschalk; Günter Mayer; Alexander Heckel
Journal: Angew Chem Int Ed Engl Date: 2012-07-24 Impact factor: 15.336

Review 3. Fluorescent amino acids: modular building blocks for the assembly of new tools for chemical biology.

Authors: Andrew T Krueger; Barbara Imperiali
Journal: Chembiochem Date: 2013-04-22 Impact factor: 3.164

4. Photo-mediated gene activation using caged RNA/DNA in zebrafish embryos.

Authors: H Ando; T Furuta; R Y Tsien; H Okamoto
Journal: Nat Genet Date: 2001-08 Impact factor: 38.330

Review 5. Beta-barrel scaffold of fluorescent proteins: folding, stability and role in chromophore formation.

Authors: Olesya V Stepanenko; Olga V Stepanenko; Irina M Kuznetsova; Vladislav V Verkhusha; Konstantin K Turoverov
Journal: Int Rev Cell Mol Biol Date: 2013 Impact factor: 6.813

6. Genetic encoding of caged cysteine and caged homocysteine in bacterial and mammalian cells.

Authors: Rajendra Uprety; Ji Luo; Jihe Liu; Yuta Naro; Subhas Samanta; Alexander Deiters
Journal: Chembiochem Date: 2014-06-27 Impact factor: 3.164

7. Genetic Encoding of bicyclononynes and trans-cyclooctenes for site-specific protein labeling in vitro and in live mammalian cells via rapid fluorogenic Diels-Alder reactions.

Authors: Kathrin Lang; Lloyd Davis; Stephen Wallace; Mohan Mahesh; Daniel J Cox; Melissa L Blackman; Joseph M Fox; Jason W Chin
Journal: J Am Chem Soc Date: 2012-06-13 Impact factor: 15.419

8. Improved side-chain torsion potentials for the Amber ff99SB protein force field.

Authors: Kresten Lindorff-Larsen; Stefano Piana; Kim Palmo; Paul Maragakis; John L Klepeis; Ron O Dror; David E Shaw
Journal: Proteins Date: 2010-06

9. Expanding the genetic code of an animal.

Authors: Sebastian Greiss; Jason W Chin
Journal: J Am Chem Soc Date: 2011-08-22 Impact factor: 15.419

10. Expanding the genetic code of yeast for incorporation of diverse unnatural amino acids via a pyrrolysyl-tRNA synthetase/tRNA pair.

Authors: Susan M Hancock; Rajendra Uprety; Alexander Deiters; Jason W Chin
Journal: J Am Chem Soc Date: 2010-10-27 Impact factor: 15.419

44 in total

1. Cell-Lineage Tracing in Zebrafish Embryos with an Expanded Genetic Code.

Authors: Wes Brown; Jihe Liu; Michael Tsang; Alexander Deiters
Journal: Chembiochem Date: 2018-05-18 Impact factor: 3.164

2. Systematic Evaluation of Soluble Protein Expression Using a Fluorescent Unnatural Amino Acid Reveals No Reliable Predictors of Tolerability.

Authors: Zachary M Hostetler; John J Ferrie; Marc R Bornstein; Itthipol Sungwienwong; E James Petersson; Rahul M Kohli
Journal: ACS Chem Biol Date: 2018-09-20 Impact factor: 5.100

3. Optical Control of CRISPR/Cas9 Gene Editing.

Authors: James Hemphill; Erin K Borchardt; Kalyn Brown; Aravind Asokan; Alexander Deiters
Journal: J Am Chem Soc Date: 2015-04-23 Impact factor: 15.419

Review 4. Optochemical Control of Biological Processes in Cells and Animals.

Authors: Nicholas Ankenbruck; Taylor Courtney; Yuta Naro; Alexander Deiters
Journal: Angew Chem Int Ed Engl Date: 2018-02-01 Impact factor: 15.336

5. Visible-to-NIR-Light Activated Release: From Small Molecules to Nanomaterials.

Authors: Roy Weinstain; Tomáš Slanina; Dnyaneshwar Kand; Petr Klán
Journal: Chem Rev Date: 2020-10-30 Impact factor: 60.622

6. Optimizing the Genetic Incorporation of Chemical Probes into GPCRs for Photo-crosslinking Mapping and Bioorthogonal Chemistry in Live Mammalian Cells.

Authors: Robert Serfling; Lisa Seidel; Thore Böttke; Irene Coin
Journal: J Vis Exp Date: 2018-04-09 Impact factor: 1.355

Review 7. Recent advances in the optical control of protein function through genetic code expansion.

Authors: Taylor Courtney; Alexander Deiters
Journal: Curr Opin Chem Biol Date: 2018-07-26 Impact factor: 8.822

8. Reversible and Tunable Photoswitching of Protein Function through Genetic Encoding of Azobenzene Amino Acids in Mammalian Cells.

Authors: Ji Luo; Subhas Samanta; Marino Convertino; Nikolay V Dokholyan; Alexander Deiters
Journal: Chembiochem Date: 2018-10-02 Impact factor: 3.164

9. Light-activation of Cre recombinase in zebrafish embryos through genetic code expansion.

Authors: Wes Brown; Alexander Deiters
Journal: Methods Enzymol Date: 2019-04-30 Impact factor: 1.600

10. 6-Bromo-7-hydroxy-3-methylcoumarin (mBhc) is an efficient multi-photon labile protecting group for thiol caging and three-dimensional chemical patterning.

Authors: M Mohsen Mahmoodi; Stephanie A Fisher; Roger Y Tam; Philip C Goff; Reid B Anderson; Jane E Wissinger; David A Blank; Molly S Shoichet; Mark D Distefano
Journal: Org Biomol Chem Date: 2016-08-16 Impact factor: 3.876