Natalie Elia1,2. 1. Department of Life Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel. 2. National Institute for Biotechnology in the Negev (NIBN), Ben-Gurion University of the Negev, Beer Sheva, Israel.
Abstract
Twenty-five years ago, GFP revolutionized the field of cell biology by enabling scientists to visualize, for the first time, proteins in living cells. However, when it comes to current, state-of-the-art imaging technologies, fluorescent proteins (such as GFP) have several limitations that result from their size and photophysics. Over the past decade, an elegant, alternative approach, which is based on the direct labeling of proteins with fluorescent dyes and is compatible with live-cell and super-resolution imaging applications, has been introduced. In this approach, an unnatural amino acid that can covalently bind a fluorescent dye is incorporated into the coding sequence of a protein. The protein of interest is thereby site-specifically fluorescently labeled inside the cell, eliminating the need for protein- or peptide-labeling tags. Whether this labeling approach will change cell biology research is currently unclear, but it clearly has the potential to do so. In this short review, a general overview of this approach is provided, focusing on the imaging of site-specifically labeled proteins in mammalian tissue culture cells, and highlighting its advantages and limitations for cellular imaging.
Twenty-five years ago, GFP revolutionized the field of cell biology by enabling scientists to visualize, for the first time, proteins in living cells. However, when it comes to current, state-of-the-art imaging technologies, fluorescent proteins (such as GFP) have several limitations that result from their size and photophysics. Over the past decade, an elegant, alternative approach, which is based on the direct labeling of proteins with fluorescent dyes and is compatible with live-cell and super-resolution imaging applications, has been introduced. In this approach, an unnatural amino acid that can covalently bind a fluorescent dye is incorporated into the coding sequence of a protein. The protein of interest is thereby site-specifically fluorescently labeled inside the cell, eliminating the need for protein- or peptide-labeling tags. Whether this labeling approach will change cell biology research is currently unclear, but it clearly has the potential to do so. In this short review, a general overview of this approach is provided, focusing on the imaging of site-specifically labeled proteins in mammalian tissue culture cells, and highlighting its advantages and limitations for cellular imaging.
The selective labeling of proteins via unnatural amino acids (unAAs) relies on the incorporation of an unAA into the amino acid (AA) sequence of the protein of interest (POI). This incorporation takes place in cells that were engineered to enable the addition of an unAA in response to a unique codon during ribosomal translation (Fig. 1A). Labeling then occurs by the formation of a covalent bond between the unAA and a chemically modified fluorescent dye (Fl‐dye) via highly specific chemical reactions, termed bioorthogonal or ‘click’ reactions (Fig. 1B). As a result, the POI is directly and site‐specifically labeled in live cells. The major steps in this labeling technique are briefly discussed below; for a more detailed description, please refer to Ref. [1, 2, 3].
Fig. 1
Labeling cellular proteins through genetic code expansion (GCE) and click chemistry. (A) Incorporating an unAA during ribosomal translation. A UAG codon is inserted in‐frame into the amino acid sequence of a POI. A unique tRNA, which carries the complementary codon to the UAG and was charged with the unAA by a unique tRNA synthetase, recognizes the UAG codon and incorporates the unAA into the newly formed polypeptide. Translation then continues until the full‐length polypeptide is released from the ribosome, giving rise to the synthesis of a full‐length protein that carries the unAA at a specific site. (B) Bioorthogonal labeling via a click reaction. The unAA carries a chemical modification; in this case, a BCN moiety (red) that specifically and rapidly reacts with the tetrazine moiety (black) that is attached to a fluorescent dye (green). Consequently, the protein is directly labeled with a fluorescent dye at a specific site.
Labeling cellular proteins through genetic code expansion (GCE) and click chemistry. (A) Incorporating an unAA during ribosomal translation. A UAG codon is inserted in‐frame into the amino acid sequence of a POI. A unique tRNA, which carries the complementary codon to the UAG and was charged with the unAA by a unique tRNA synthetase, recognizes the UAG codon and incorporates the unAA into the newly formed polypeptide. Translation then continues until the full‐length polypeptide is released from the ribosome, giving rise to the synthesis of a full‐length protein that carries the unAA at a specific site. (B) Bioorthogonal labeling via a click reaction. The unAA carries a chemical modification; in this case, a BCN moiety (red) that specifically and rapidly reacts with the tetrazine moiety (black) that is attached to a fluorescent dye (green). Consequently, the protein is directly labeled with a fluorescent dye at a specific site.
Encoding an unAA in cellular proteins
Incorporating an unAA into a POI in the cell requires the unAA being recognized by the cellular translation machinery. One approach to accomplish this task is to use a modified AA that resembles a natural one, thus enabling its recognition by the endogenous tRNA and tRNA synthetase. This approach has been demonstrated using methionine analogs [4, 5, 6, 7, 8] and was successfully used to visualize newly synthesized proteins in mammalian cells [5, 6, 7]. However, this approach is not protein‐specific, as all endogenous methionine residues can potentially be substituted by the analogous unAA and, therefore, its applicability for the fluorescence imaging of specific proteins is limited.Another approach, which allows the selective labeling of a POI within the cell, is to incorporate the unAA into the coding sequence of the POI by using the genetic code expansion (GCE) technique. To this end, a sequence of the POI with an in‐frame TAG codon (the amber stop codon) is added to the cell, together with the selected unAA. In addition, the cell is supplemented with a unique pair of tRNA/tRNA synthetase, which was evolved to facilitate the incorporation of the desired unAA in response to the amber stop codon (Fig. 1A). Notably, by allocating the amber stop codon to encode an additional, unnatural AA, the genetic code is expanded from 20 to 21 AAs.
Specifically attaching a fluorescent dye to the unAA in the cellular milieu
The most direct GCE approach for fluorescently labeling proteins in cells is to incorporate unAAs that bear fluorescent properties into the AA sequence of the POI [9]. This approach was used successfully to visualize intracellular proteins in bacteria and in mammalian cells, without affecting their function [10, 11, 12]. However, the currently available fluorescent unAAs are bulky, their incorporation efficiency is low, and they suffer from poor fluorescent properties, altogether hindering their utilization in cellular imaging [1, 2].A more efficient and commonly used approach is to incorporate an unAA that carries a chemical moiety capable of covalently binding a Fl‐dye inside the cell via the fast and highly specific click reactions (Fig. 1B) (reviewed in Refs [1, 2, 13]). Notably, click reactions are not restricted to protein labeling and provide an attractive, superior tool for visualizing nucleotides, lipids, and sugars in mammalian cells, though this is beyond the scope of this review [1, 14, 15, 16, 17]. Currently, the preferred click reaction for protein labeling in live mammalian cells is the inverse electron‐demand Diels–Alder reaction (IEDDA) between strained alkynes or alkenes and tetrazines, which demonstrates fast kinetics at room temperature and a low cellular toxicity [2, 13, 18]. The unAAs that are most frequently used in this approach are trans‐cyclooctene (TCO)‐lysine and bicyclo[6.1.0]nonyne (BCN)‐lysine, and designated pairs of tRNA/tRNA synthetase have been evolved for their incorporation into proteins in mammalian cells [19, 20, 21]. Each of these two unAAs has its own advantages and limitations upon binding the Fl‐dye; BCN‐lysine exhibits a relatively high labeling efficiency (especially under low expression levels), while TCO‐lysine reacts faster and is more robust, but tends to produce higher background levels due to its hydrophobicity [22, 23].Once the cells express a POI that carries the chosen unAA, a tetrazine‐conjugated Fl‐dye (Tet‐Fl‐dye) is added to the cells and specifically binds the unAA via the click reaction, resulting in a direct, highly specific labeling of the POI at a selected site (Fig. 1B). The click reaction is fluorogenic; the fluorescence of the Fl‐dye is quenched by the tetrazine moiety in the unbound form, and is increased, by several folds, upon binding to the unAA [13, 20, 24]. Essentially any Fl‐dye that is conjugated to tetrazine can be used to label the POI, and numerous Tet‐Fl‐dyes are available commercially. Of special interest are the silicon rhodamine (SiR)‐based dyes (SiR, HMSiR, and JF646), which are cell‐permeable and exhibit low background labeling in mammalian cells [15, 25, 26, 27, 28]. The fluorogenicity of the click reaction reduces background fluorescence that stems from the unspecific binding of the Fl‐dye—a known limitation of other labeling approaches based onFl‐dyes, such as SNAP‐tag [29]. In general, when bound to methyl‐substituted tetrazines (Me‐Tet), Fl‐dyes are substantially more quenched than Fl‐dyes bound to H‐Tet [22, 24]. However, the fluorogenicity of the reaction also depends on the properties of the Fl‐dye itself and therefore should be examined on a case to case basis. The detailed analysis of the fluorescent properties of different Tet‐Fl‐dyes, recently completed by Beliu et al. can be used as a guideline [24].
What is it good for?
Genetic code expansion‐based labeling, namely, the labeling of cellular proteins through GCE and click chemistry, has several key advantages over other cellular imaging techniques. First, the Fl‐dye is attached directly to a specific residue, such that only the chemical handle and the Fl‐dye are added to the POI. This minimal compound is an order of magnitude smaller than a fluorescent protein (~ 0.5 nm, as compared with ~ 5 nm)—an advantage that is especially critical for preserving the functions of small POIs [22, 30]. Second, Fl‐dyes are generally brighter and more photostable than fluorescent proteins, enabling live‐cell imaging at lower laser powers and at higher temporal resolutions. Third, Fl‐dyes are available for the entire light spectrum, including the near‐infrared region (e.g., SiR and JF646), allowing imaging at longer wavelengths that are less phototoxic to the cells [26, 28, 31]. This is in contrast to fluorescent proteins, which exhibit poor photophysics at longer wavelengths. Fourth, the click reaction is stoichiometric (the Tet‐Fl‐dye binds the unAA at a 1 : 1 ratio), which enables protein quantification based on fluorescent intensity in heterologous expression systems [32]. These advantages are even more pronounced in single‐molecule applications and super‐resolution (SR) microscopy techniques, where cells are exposed to high laser powers for relatively long periods of time. The small size of the label and its 1 : 1 binding stoichiometry are especially attractive for single‐molecule SR techniques, as they set the ground for increasing localization accuracy and for molecule counting.Taken together, GCE‐based labeling is advantageous for all fluorescence imaging applications and, in particular, for high‐spatiotemporal‐resolution quantitative analyses of proteins in live cells: The minimal label size helps preserve the nature of the POI; the 1 : 1 stoichiometry ratio allows quantification; and the ability to image at low laser powers and long wavelengths increases cell viability.
What has been done with it?
In 2012, the Chin laboratory demonstrated the efficient, site‐specific labeling of proteins in live mammalian cells by using the genetic encoding of BCN‐Lys or TCO‐Lys and click labeling with Tet‐Fl‐dyes [20]. As explained above, this GCE‐based labeling approach is fundamentally different from conventional protein‐based labeling approaches, such as fluorescent protein tags and protein self‐labeling tags (e.g., SNAP‐tags). Therefore, efforts have been made over the past years to validate this approach and evaluate its performance in different imaging modalities. In tissue culture cells, GCE‐based labeling was found to be compatible with live‐cell imaging and SR microscopy of both intra‐ and extracellular proteins. Moreover, whenever tested, the localization and function of the POIs were not affected by the presence of the unAA, further validating this approach [22, 30, 33]. An overview of the imaging applications tested so far with GCE‐based labeling is provided below.
Live‐cell imaging
Genetic code expansion‐based labeling has proven successful for the imaging of several receptors and channels in live cells [4, 20, 24, 32, 34] (Fig. 2). In addition, it has been used to label the HIV‐1 envelop glycoprotein Env, enabling the first‐ever recording of its dynamics on the plasma membrane (PM) [30]. This work is of special interest because prior attempts to visualize Env in live cells failed due to the inability to generate a functionally active labeled Env by using conventional protein tags. In all the above‐mentioned examples, the unAA was incorporated at the extracellular regions of the protein and labeling was performed using cell‐impermeable Tet‐Fl‐dyes, thus avoiding the relatively high background levels associated with the technique (see subsection The click reaction, below). After further optimization, the GCE‐based labeling technique was successfully applied for intracellular labeling, and various intracellular components—including intermediate filaments, microtubules, intracellular vesicles, the PM, lysosomes, and the endoplasmic reticulum—were labeled and imaged in live mammalian cells [22, 24, 35, 36, 37, 38]. Here, too, small proteins that could not be labeled using protein tags were successfully labeled and imaged. Peng and Hang, for example, recorded, for the first time, the dynamics of the IFITM3 protein, which comprises only 137 AAs [22].
Fig. 2
Live‐cell imaging of proteins labeled via GCE and click chemistry. (A) Photobleaching experiments employing cellular proteins labeled with unAAs. Top panel: HEK293 cells expressing the HIV protein Env, which carries a BCN‐lysine at position 407. The cells were labeled with the cell‐impermeable dye Tet‐Cy5 and imaged using a spinning disk confocal microscope. Reproduced with permission from Ref. [30]. Bottom panel: COS7 cells expressing the endoplasmic reticulum marker ERcb5TM, which was conjugated to a 14‐AA tag that carries BCN‐lysine. The cells were labeled with the cell‐permeable dye Tet‐TAMRA and imaged using a spinning disk confocal microscope. Reproduced from Ref. [36]. Scale bars, 10 μm. (B) SPT of EGFR in COS7 cells, imaged in TIRF mode over time. Top panel: The cells express EGFR, which carries BCN‐lysine at position 128, and are labeled with the cell‐impermeable dye Tet‐Cy3. Bottom panel: The cells express an EGFR‐GFP. In each case, individual particles obtained from the videos were segmented and tracked through time (right panels). Note that particles obtained for EGFR‐Cy3 were brighter than those obtained for EGFR‐GFP and that significantly more tracks were generated using this labeling approach. Reproduced from Ref. [34]. Scale bars, 10 μm.
Live‐cell imaging of proteins labeled via GCE and click chemistry. (A) Photobleaching experiments employing cellular proteins labeled with unAAs. Top panel: HEK293 cells expressing the HIV protein Env, which carries a BCN‐lysine at position 407. The cells were labeled with the cell‐impermeable dye Tet‐Cy5 and imaged using a spinning disk confocal microscope. Reproduced with permission from Ref. [30]. Bottom panel: COS7 cells expressing the endoplasmic reticulum marker ERcb5TM, which was conjugated to a 14‐AA tag that carries BCN‐lysine. The cells were labeled with the cell‐permeable dye Tet‐TAMRA and imaged using a spinning disk confocal microscope. Reproduced from Ref. [36]. Scale bars, 10 μm. (B) SPT of EGFR in COS7 cells, imaged in TIRF mode over time. Top panel: The cells express EGFR, which carries BCN‐lysine at position 128, and are labeled with the cell‐impermeable dye Tet‐Cy3. Bottom panel: The cells express an EGFR‐GFP. In each case, individual particles obtained from the videos were segmented and tracked through time (right panels). Note that particles obtained for EGFR‐Cy3 were brighter than those obtained for EGFR‐GFP and that significantly more tracks were generated using this labeling approach. Reproduced from Ref. [34]. Scale bars, 10 μm.Genetic code expansion‐based intra‐ and extracellular labeling was also used in photomanipulation techniques, such as FRET and FRAP [4, 30, 33, 34, 36, 38] (Fig. 2A). Single‐molecule FRET approaches were recently developed and applied for resolving the conformational changes of hemagglutinins on virus surfaces and of shaker Kv channels upon activation in Xenopus oocytes [4, 39]. Such high‐resolution experiments have only been made possible due to the site‐selective GCE‐based labeling.
Super‐resolution microscopy
Several SR techniques, including structured illumination microscopy (SIM), single‐molecule localization microscopy (SMLM; STORM and GSDIM), and stimulated emission depletion (STED), were performed in fixed cells on proteins labeled via GCE and click chemistry [3, 23, 30, 33, 34, 35, 37, 40, 41, 42] (Fig. 3). Rizzoli and colleagues systematically labeled 26 different proteins by using unAAs and visualized their multimolecular arrangements with both SMLM and STED [40]. A dSTORM imaging of click‐labeled NMDA receptors produced significantly higher labeling densities than those produced by antibody‐based NMDA labeling, suggesting that GCE‐based labeling is advantageous for SMLM [33] (Fig. 3A,B). In SIM images of GCE‐based labeled microtubules, adjacent fibers < 100 nm apart could be resolved, indicating that resolution in SIM may be improved by implementing GCE‐based labeling, potentially due to the small size of the label [35] (Fig. 3D). The technique was also successfully combined with DNA‐PAINT technology for the SMLM of the intermediate filament protein, vimentin, and the nuclear pore complex protein, Nup153 [37].
Fig. 3
Super‐resolution microscopy imaging of proteins labeled via GCE and click chemistry. (A, B) SMLM performed on cellular proteins labeled with GCE and click chemistry. (A) Top panel: dSTORM imaging of NMDA receptors carrying a TCO‐lysine in position 392 and click‐labeled with Tet‐Cy5. Bottom panel: dSTORM imaging of NMDA receptors labeled with specific primary antibodies and Alexa 647 secondary antibodies. The right panels are zoomed‐in images of the areas marked with squares on the left panels. A wide‐field image is shown, for comparison, on the bottom right corner of the top left panel. Note that considerably more localizations were obtained using the click‐labeled NMDA receptor. Reproduced with permission from Ref. [33]. Scale bar, 2.5 μm. (B) dSTORM imaging of microtubules labeled with the microtubule‐binding protein EMTB carrying a TCO‐lysine at position 87 and click‐labeled with the cell‐permeable dye Tet‐HM‐SiR in COS7 cells. A wide‐field image is shown on the top left corner, for comparison. Reproduced from Ref. [24]. Scale bar, 1 μm. (C) Confocal (left) and STED (middle) images of the HIV protein Env, which carries a BCN‐lysine at position 407 and was labeled with Tet‐KK114 in HEK293 cells. Right panel: line intensity profiles obtained for an individual Env cluster (arrowheads in the left and middle panels), imaged via confocal microscopy or STED. The higher resolution obtained with STED can be clearly seen. Reproduced with permission from Ref. [30]. Scale bar, 1 μm. (D) SIM imaging of GCE‐labeled microtubules in COS7 cells. Microtubules, labeled in COS7 cells with tubulin that carry a BCN‐lysine at position 45 and Tet‐SiR, were imaged using wide‐field microscopy or SIM (left and middle panels, respectively). A zoomed‐in image of the red rectangle in the middle panel is shown in the top right panel. An intensity line profile across the region depicted by arrows in the top right panel is shown in the bottom right panel, demonstrating the ability to resolve fibers that are only 80 nm apart. Reproduced from Ref. [35]. Scale bars, 10 μm, zoomed‐in, 1 μm.
Super‐resolution microscopy imaging of proteins labeled via GCE and click chemistry. (A, B) SMLM performed on cellular proteins labeled with GCE and click chemistry. (A) Top panel: dSTORM imaging of NMDA receptors carrying a TCO‐lysine in position 392 and click‐labeled with Tet‐Cy5. Bottom panel: dSTORM imaging of NMDA receptors labeled with specific primary antibodies and Alexa 647 secondary antibodies. The right panels are zoomed‐in images of the areas marked with squares on the left panels. A wide‐field image is shown, for comparison, on the bottom right corner of the top left panel. Note that considerably more localizations were obtained using the click‐labeled NMDA receptor. Reproduced with permission from Ref. [33]. Scale bar, 2.5 μm. (B) dSTORM imaging of microtubules labeled with the microtubule‐binding protein EMTB carrying a TCO‐lysine at position 87 and click‐labeled with the cell‐permeable dye Tet‐HM‐SiR in COS7 cells. A wide‐field image is shown on the top left corner, for comparison. Reproduced from Ref. [24]. Scale bar, 1 μm. (C) Confocal (left) and STED (middle) images of the HIV protein Env, which carries a BCN‐lysine at position 407 and was labeled with Tet‐KK114 in HEK293 cells. Right panel: line intensity profiles obtained for an individual Env cluster (arrowheads in the left and middle panels), imaged via confocal microscopy or STED. The higher resolution obtained with STED can be clearly seen. Reproduced with permission from Ref. [30]. Scale bar, 1 μm. (D) SIM imaging of GCE‐labeled microtubules in COS7 cells. Microtubules, labeled in COS7 cells with tubulin that carry a BCN‐lysine at position 45 and Tet‐SiR, were imaged using wide‐field microscopy or SIM (left and middle panels, respectively). A zoomed‐in image of the red rectangle in the middle panel is shown in the top right panel. An intensity line profile across the region depicted by arrows in the top right panel is shown in the bottom right panel, demonstrating the ability to resolve fibers that are only 80 nm apart. Reproduced from Ref. [35]. Scale bars, 10 μm, zoomed‐in, 1 μm.
Single‐molecule techniques
More recently, GCE‐based labeling was applied to live‐cell single‐molecule applications, such as single‐particle tracking (SPT) and live STORM [32, 34] (Fig. 2B). By using image correlation spectroscopy, the fluorescence intensities obtained for GPCRs via click labeling were found to be linearly correlated with those obtained using GFP, indicating that the approach is quantitative. Measurements of the fluorescent intensity in peroxisomes colabeled with GCE‐based Tet‐SiR and GFP confirmed that the approach is quantitative also in an intracellular context [36]. SPT was used to measure the diffusion rates of click‐labeled GPCRs, EGFR, and the Shaker Kv channel [32, 34] (Fig. 2B). Notably, while the physiological response of the receptors appeared to be similar in click‐labeled and GFP‐labeled receptors, mild differences were measured in the confinement and rate of diffusion [32, 34]. On average, more tracks and longer trajectories were obtained for proteins labeled via unAAs, as compared with those tagged with GFP [34]. Therefore, GCE‐based labeling appears to be more reliable for tracking and, accordingly, superior over GFP labeling for measuring the diffusion of proteins in the cell. However, before switching from one labeling approach to the other, the cause for the differences in the diffusion of GFP‐ versus GCE‐based labeled proteins may need to be elucidated.Substituting fluorescent proteins for Fl‐dyes using self‐labeling tags was shown to improve the temporal resolution of SMLM, enabling live‐SMLM imaging [43, 44, 45]. Similarly, live SMLM of GCE‐based labeled PM proteins was recently performed at a 30‐nm spatial resolution and a 2.5‐s temporal resolution [34]. Although the temporal resolution should be further improved, combining site‐specific labeling with STORM in live‐cell applications will surely advance our understanding of the dynamics and function of proteins in cells.
Multicolor labeling
Dual‐color labeling of different proteins in the same cell using GCE is currently a challenging task, as it requires exploiting two codons from the codon table and introducing two mutually orthogonal tRNA/tRNA synthetase pairs to the cells, which is extremely difficult in mammalian cells [46]. Although unAA‐based labeling can be readily combined with conventional protein tags for a dual‐color labeling [35, 47], several laboratories proposed strategies to overcome the challenge of GCE‐based dual‐color labeling [3, 46].Rizzoli and colleagues incorporated different unAAs into different POIs in two separate cells and then fused the cells to one another by using inactivated virus particles [42]. This approach enabled dual‐color STED imaging, but its applicability to cell biology is limited because imaging is restricted to fixed cells and to protein pairs that can be studied in fused cells. By sequentially introducing two different unAAs and fine‐tuning the specificity of the click reactions, Lemke and colleagues were able to use two different unAAs to label the same POI at the same site with two different Fl‐dyes in live cells [41]. While this dual‐labeling approach is beneficial for some imaging applications (such as pulse‐chase analyses), it is limited to the labeling of a single POI. Another approach for labeling the same POI with dyes of two colors is to employ competitive labeling of the same unAA with two Tet‐conjugated Fl‐dyes bearing different fluorescent properties. This labeling strategy has been employed to study protein oligomerization, perform single‐molecule FRET, and simultaneously acquire data from a single cell by using different imaging modalities [32, 34, 39].
What should I consider before starting an experiment?
The factors that should be considered regarding GCE‐based labeling for cellular imaging can be generally divided according to the two main steps in the process: the incorporation of the unAA into the AA sequence of the POI, and the click reaction between the unAA and the Tet‐conjugated Fl‐dye. Critical considerations for each of these steps are discussed below.
Incorporation of the unAA
Conventionally, the incorporation of the unAA into the AA sequence of the POI is enabled by mutating one of the codons to TAG, such that one of the AAs in the POI is replaced by a noncanonical AA (in most cases, BCN‐Lys or TCO‐Lys). Therefore, when choosing the incorporation site, care should be taken to avoid replacing an AA that is crucial for either the structure or the function of the POI. Importantly, once a suitable location is found, the cellular localization and function of the protein do not seem to change by the presence of the unAA, suggesting that, if carefully considered, this is not a limitation of the approach [30, 33].Another factor that should be considered is the incorporation of the unAA into endogenous TAG termination sites. Such cases may result in the unspecific labeling of endogenous proteins, which will increase background levels. The ratio between the specific labeling of the POI and the unspecific labeling of other proteins in the cell can be quantified by using in‐gel fluorescence [22, 23, 30, 36, 47]. In addition, the overall background level in cells can be evaluated by measuring the signal intensity after adding all the GCE and click reaction components to the cells—except the TAG‐modified POI [22, 23, 30, 35, 36, 38, 47]. Despite these potential background‐related limitations, GCE‐based labeling demonstrates relatively high signal‐to‐noise ratios (SNRs), which enable quantitative imaging in either fixed or live cells [24, 35, 36].Adding an in‐frame TAG stop codon can also lead to a premature termination of translation. In fact, the newly introduced tRNA and the cellular translation termination factors (e.g., eRF1) constantly compete with each other over binding to the UAG stop codon in the mRNA. An interaction between eRF1 and the newly introduced stop codon will terminate translation and prevent the incorporation of the unAA. As a result, truncated versions of the POI will be expressed and the efficiency of unAA incorporation will be reduced. Expressing a dominant‐negative version of eRF1 can minimize this effect [48], but an eRF1 mutant is prone to perturb the termination of endogenous protein translation and, therefore, its expression is often avoided. The balance between the incorporation of the unAA and a premature termination appears to depend on the incorporation site, albeit the factors that govern incorporation efficiency remain elusive. Therefore, a traditional GCE‐based labeling experiment should begin by screening for efficient incorporation sites in which the full‐length POI is translated at relatively high levels and truncations are kept to a minimum. Choosing positions near the N terminus of the protein can potentially help to minimize the accumulation of protein truncations, due to the short length of the resulting polypeptide. However, in some cases, introducing a premature stop codon near the 5′ end of the mRNA promotes the re‐initiation of translation from a downstream AUG start codon [49]. To avoid such scenarios, the size of the POI should be verified by western blot analysis during the screening process.
The click reaction
An efficient click reaction requires that the unAA will be incorporated into the POI such that it is accessible to the freely diffusing Tet‐Fl‐dye [19, 35]. This requirement should be thoroughly considered, especially in the case of multicomplex proteins, and care should be taken to ensure that the unAA is accessible in both the monomeric and the multicomplex form, so as to avoid a biased visualization of only a subset of the protein population. Moreover, the addition of a Tet‐Fl‐dye can potentially affect the function of the protein and, therefore, functionality should also be verified in the labeled form.Once an efficient, suitable labeling site is selected, background fluorescence should be minimized. In that respect, it should be recognized that any unAA present in the cell (not only the one that is incorporated into to the POI) will undergo the click reaction and contribute to background fluorescence. Sources for unAAs that are not bound to the POI include excess of soluble unAAs; unAAs that were incorporated into cellular proteins in response to endogenous TAG (as discussed above); and unAAs that are bound to tRNAs. Indeed, reducing the copy number of tRNA from four to one was shown to improve the SNR in mammalian cells without affecting the efficiency of unAA incorporation, and this strategy is, therefore, recommended for labeling purposes [38]. The unspecific labeling of charged tRNAs also appears to be the source for unspecific nuclear labeling observed in cells labeled via GCE [37, 38]. When labeling proteins outside the nucleus, sequestering unspecific labeling inside the nucleus can actually be an advantage, as it may reduce cytosolic background levels. When nuclear proteins need to be specifically labeled, the nuclear accumulation of tRNA can be avoided by introducing a nuclear export signal to the coding sequence of the orthogonal tRNA synthetase [37]. Extensive washes will minimize background labeling resulting from an excess of unAA [19, 35].Labeling proteins through click reactions provides the flexibility to use a variety of Fl‐dyes that exhibit different chemical and fluorescence properties. Tailoring the characteristics of the Fl‐dye to the desired imaging application is a major advantage of GCE‐based labeling and can greatly improve the performance of advanced imaging techniques. While some dyes are cell‐permeable, others are not; some dyes may be more suited for labeling certain cellular components; and while some dyes are bright and photostable, others exhibit blinking properties, which are required in SMLM experiments [19, 36]. The cellular permeability and fluorogenicity of various Tet‐Fl‐dyes were recently reported and can be used as a guideline for a rational selection of the Fl‐dye [13, 24].Notably, the various considerations regarding the incorporation of the unAA and the click reaction make this labeling approach somewhat more challenging than traditional labeling approaches. To overcome these challenges, we recently designed a short (14 AAs long) N‐terminal tag that encodes for the incorporation of an unAA, and demonstrated its applicability for GCE‐based labeling of proteins and organelles [36]. Although using a tag is not as elegant as incorporating the label directly into the POI, this approach provides a calibration‐free, easy‐to‐implement tool that retains most of the advantages associated with GCE‐based labeling.
What are the limitations?
Genetic code expansion‐based labeling relies on the manipulation of the cellular translation machinery, and the physiological consequences of such a manipulation have yet to be carefully considered. First, the cellular response to an exogenous tRNA/tRNA synthetase pair, and the effects on ribosomal activity, has not been characterized. Second, the cellular effects of protein read‐through resulting from the incorporation of an unAA at endogenous TAG codons have not been comprehensively analyzed. Although TAG is the least abundant stop codon in mammalian cells, there are over 25 000 open reading frames in the human genome with an annotated TAG stop codon. Among these are genes encoding for cytoskeleton, cell cycle, and cell adhesion proteins, and it remains largely unknown whether and how the read‐through of these proteins affects their cellular function. Moreover, Arbely and colleagues recently reported that the translation of sequences past the annotated stop codon promotes the aggregation and lysosomal targeting of the C‐terminally extended proteins [50]. It is still to be determined whether this is the fate of cellular proteins that carry a TAG codon in GCE‐modified cells, but this study stresses the need to begin investigating these processes in GCE‐modified cells. Potential cellular phenotypes associated with GCE can also explain why the efficiency of unAA incorporation varies across cell types. This observed variability may arise from different regulatory pathways and checkpoints associated with the origin of the tissue culture cells.In addition to these global effects, introducing a premature stop codon into the natural AA sequence of the POI may affect cell physiology. Protein truncations—resulting either from an inefficient incorporation of the unAA into the genetic code of the POI or from a re‐initiation of translation at a downstream ATG codon—can affect the function and regulation of the POI. Such protein truncations should be avoided (or at least minimized), and care should be taken to verify that they do not affect the function of the POI or the overall cell physiology.Despite these potential effects on cell physiology, GCE‐based labeling through click reactions has been successfully applied in several model organisms, including fish, flies, and mice, suggesting that the technique does not severely influence the overall viability of cells (reviewed in Ref. [51]). However, until these potential effects are carefully and systematically characterized, caution should be used when employing GCE‐based labeling to answer questions pertaining to cellular behavior and regulation.
Conclusions and perspectives
Genetic code expansion, together with click chemistry, offers an elegant tool for the labeling of proteins in both fixed‐ and live‐cell imaging applications. Previous studies indicate that this approach is suitable for the live‐cell imaging of both intra‐ and extracellular proteins and that it can be used with—and considerably contribute to—various advanced quantitative techniques, including photomanipulation, single‐molecule imaging, and super‐resolution techniques. However, care should be taken to follow optimized protocols, to validate the function of the POI, and to evaluate labeling specificity and efficiency by using relevant controls. Improving the performance and robustness of directly incorporated fluorescent unAAs for the visualization of proteins in live cells can overcome some of the background fluorescence issues currently associated with the approach. In the meantime, considerable time and effort can be saved by using previously published unAA incorporation positions in specific POIs, by adding the calibrated short peptide tag, and by adopting established protocols. Multicolor GCE‐based labeling is currently limited due to the challenge of exploiting two different codons for the incorporation of the unAAs. New concepts for genetic code engineering using four‐nucleotide codons instead of the traditional three‐nucleotide codons have been recently demonstrated in bacteria [52]. Although applying this approach to mammalian cell labeling is not trivial, such strategies can pave the way toward a multicolor, site‐specific labeling of cellular mammalian proteins. Lastly, the GCE‐based labeling technique requires the introduction of exogenous factors to the cellular translation machinery, and the effect that these factors may have on cellular physiology needs to be evaluated. Meanwhile, biophysical studies can be conducted to investigate the spatiotemporal organization of proteins in cells using an essentially ‘tag‐free’ labeling approach.
Authors: Richard Wombacher; Meike Heidbreder; Sebastian van de Linde; Michael P Sheetz; Mike Heilemann; Virginia W Cornish; Markus Sauer Journal: Nat Methods Date: 2010-08-08 Impact factor: 28.547
Authors: Jonathan B Grimm; Brian P English; Heejun Choi; Anand K Muthusamy; Brian P Mehl; Peng Dong; Timothy A Brown; Jennifer Lippincott-Schwartz; Zhe Liu; Timothée Lionnet; Luke D Lavis Journal: Nat Methods Date: 2016-10-24 Impact factor: 28.547
Authors: Ivana Nikić; Tilman Plass; Oliver Schraidt; Jędrzej Szymański; John A G Briggs; Carsten Schultz; Edward A Lemke Journal: Angew Chem Int Ed Engl Date: 2014-01-28 Impact factor: 15.336