Yuting Chen1, Linjie Lu2, Shixin Ye2,3,4. 1. Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University (ECNU), Shanghai, China. 2. Université Pierre-et-Marie-Curie, Laboratory of Computational and Quantitative Biology (LCQB), Institute of Biology Paris-Seine, Paris, France. 3. Institut National de la Santé et de la Recherche Médicale, Paris, France. 4. Centre National de la Recherche Scientifique, Paris, France.
Abstract
Nature has invented photoreceptor proteins that are involved in sensing and response to light in living organisms. Genetic code expansion (GCE) technology has provided new tools to transform light insensitive proteins into novel photoreceptor proteins. It is achieved by the site-specific incorporation of unnatural amino acids (Uaas) that carry light sensitive moieties serving as "pigments" that react to light via photo-decaging, cross-linking, or isomerization. Over the last two decades, various proteins including ion channels, GPCRs, transporters, and kinases have been successfully rendered light responsive owing to the functionalities of Uaas. Very recently, Cas9 protein has been engineered to enable light activation of genomic editing by CRISPR. Those novel proteins have not only led to discoveries of dynamic protein conformational changes with implications in diseases, but also facilitated the screening of ligand-protein and protein-protein interactions of pharmacological significance. This review covers the genetic editing principles for genetic code expansion and design concepts that guide the engineering of light-sensitive proteins. The applications have brought up a new concept of "optoproteomics" that, in contrast to "optogenetics," aims to combine optical methods and site-specific proteomics for investigating and intervening in biological functions.
Nature has invented photoreceptor proteins that are involved in sensing and response to light in living organisms. Genetic code expansion (GCE) technology has provided new tools to transform light insensitive proteins into novel photoreceptor proteins. It is achieved by the site-specific incorporation of unnatural amino acids (Uaas) that carry light sensitive moieties serving as "pigments" that react to light via photo-decaging, cross-linking, or isomerization. Over the last two decades, various proteins including ion channels, GPCRs, transporters, and kinases have been successfully rendered light responsive owing to the functionalities of Uaas. Very recently, Cas9 protein has been engineered to enable light activation of genomic editing by CRISPR. Those novel proteins have not only led to discoveries of dynamic protein conformational changes with implications in diseases, but also facilitated the screening of ligand-protein and protein-protein interactions of pharmacological significance. This review covers the genetic editing principles for genetic code expansion and design concepts that guide the engineering of light-sensitive proteins. The applications have brought up a new concept of "optoproteomics" that, in contrast to "optogenetics," aims to combine optical methods and site-specific proteomics for investigating and intervening in biological functions.
An Innovative Molecular Biology Tool Based on Site-directed Mutagenesis
Proteins are essential biomolecules for cellular and physiological functions. To understand how a protein catalyzes a biological reaction requires techniques to specifically manipulate the protein and detect associated functional signals. Since the 1980s, the development of site-directed unnatural amino acid mutagenesis [1,2] has evolved as a powerful approach to correlate structure-activity relationship by introducing novel functionality into proteins. Similar to conventional site-directed mutagenesis, which allows one to change a specific amino acid in a protein to any of the other 20 naturally occurring amino acids, Uaa mutagenesis unleashes the limit of amino acid repertoire to enable introduction of unique chemical groups carried by the Uaa. Varieties of Uaas containing side-chains that serve as photoactive, spectroscopic, and redox probes have been successfully introduced into proteins, which facilitate the development of novel detection methods [3-9].The common strategy of Uaa mutagenesis uses the amber stop codon suppression methodology, which involves the translation of an amber stop codon inserted into the gene of the targeted protein by a suppressor tRNA aminoacylated with a desired Uaa (Figure 1a). This strategy is later referred to as genetic code expansion, GCE [10], a concept based on assigning unnatural amino acids to the genetic code by using artificial coding rules. The main challenge is the generation of a Uaa aminoacylated suppressor tRNA. Three approaches have been developed (Figure 1b): 1) semi-chemical synthesis [1,2], 2) ribozyme [3,11], and 3) engineered aminoacyl-tRNA synthetase (aaRS) [4,10,12]. The first approach requires multiple steps of synthesis whereas the other two approaches are achieved through enzymatic reactions catalyzed by either ribozyme or aaRS. Although the main advantage of enzymatic reactions is that the Uaa aminoacylated tRNA can be generated in situ, the first approach offers an unmatched ability to aminoacylate any type of Uaa that would otherwise require specific design and screening of the ribozymes and aaRS.
Figure 1
Schematic illustration of Uaa incorporation through the amber stop codon suppression. a. Engineered suppressor tRNA carrying a desired Uaa (red dot) recognizes the UAG stop codon in the mRNA, which allows read-through of protein translation. The protein product has the Uaa incorporated at the specific site. b. Three routes to generate Uaa aminoacylated suppressor tRNA: 1) semi-chemical synthesis, 2) ribozyme, 3) engineered aminoacyl tRNA synthetase.
Despite the fact that Uaa mutagenesis has evolved gradually into discovery tools, compared to many well-established molecular biology tools such as restriction enzymes [13], molecular cloning [14,15], site-directed mutagenesis [16-18], PCR [19,20], and genetically encoding GFP [21,22], most efforts are currently focused on technical developments for the reprogramming of the genetic code (Figure 2). Since the successful demonstration of Uaa mutagenesis via semi-chemical synthesis of suppressor tRNA [1,2], many concepts have been later shown to be feasible. When injecting the suppressor tRNA into Xenopus oocytes [23], it was possible to generate proteins site-specifically incorporating the Uaa, making it the first demonstration of in vivo Uaa mutagenesis. Later on, reassignment of quadruplet, synonymous codons, and codons containing artificial bases were proven to be feasible [24,25]. The development of engineered aaRS/tRNA pairs to recognize varieties of Uaas further advanced the in vivo Uaa mutagenesis [10,12]. Engineering more complex translational apparatus such as the ribosome has also been achieved [26], making it an attractive solution to reprogram the Escherichia coli genome to tolerate incorporation of multiple Uaas, although the applications of these orthogonal ribosomes in solving biological questions are still not fully demonstrated. The recent methodology development in creating transgenic animals including Caenorhabditis
elegans [27,28], Drosophila [29] and mice [30,31] harboring aaRS/tRNA pairs for Uaa metabolism will further provide new biological insights into protein conformation changes, protein interactions, elementary processes in signal transduction, and the role of post-translational modifications in vivo.
Figure 2
Timeline of the technical developments of genetic code expansion (black font) and other molecular biology tools (grey font).
Genetic Code Expansion
Orthogonal tRNA/RS Pairs
The most widely used approach utilizes the orthogonal aaRS/tRNA pairs delivered or incorporated into the host organisms for decoding the stop codon (Figure 3). The binding pocket of aaRS can be altered in order to recognize a Uaa with higher specificity than the natural substrate. The suppressor tRNA can recognize the blank codon, commonly the amber stop codon. Table 1 summarizes the available orthogonal tRNA/RS pairs screened by rational mutagenesis in E. coli to encode structurally distinct amino acids. These pairs were derived from natural aaRS/tRNAs in archaea or bacteria which are orthogonal to those in the host organisms. For example, The Methanococcus jannaschii tyrosyl-tRNA synthetase (MjTyrRS) pair is orthogonal in E. coli but not eukaryotic cells. EcTyrRS/BsttRNA pair, EcTyrRS/EctRNA and EcLeuRS/tRNA pairs are orthogonal in eukaryotic cells but cannot be reused in E. coli. Comparing the two TyrRS/tRNA pairs used in eukaryotic cells, BsttRNA contains intact internal A-and B-box promoters for expression in mammalian cells, whereas the EctRNA requires external H1 and U6 promoters. Key residues in the substrate amino acid binding pocket are mutated in order to recognize varieties of Uaas [4,10,12]. Overall the PylRS/tRNA pair is arguably the most useful pair for GCE, because PylRS is orthogonal in all kinds of cellular systems to encode pyrrolysine in response to the amber stop codon [32,33]. In addition, the plasticity of the PylRS makes it ideal to engineer variants to recognize varieties of Uaas.
Figure 3
Schematic illustration of genetic code expansion utilizing the engineered aminoacyl tRNA synthetase/tRNA pairs (colored red). The orthogonal aaRS recognizes the Uaa and aminoacylates the suppressor tRNA in situ, which participates in the protein translations to incorporate Uaa at a site where amber stop codon is inserted into the mRNA. Endogenous aaRS/tRNAs and amino acids are colored blue.
Table 1
Five commonly used orthogonal aaRS/tRNA pairs.
aaRS/tRNA
Origin
Orthogonal organisms
MjTyrRS/tRNA
Methanococcus jannaschii
E. coli
EcTyrRS/BsttRNA
E. coli / Bacillus stearothermophilus
Eukaryotic cells
EcTyrRS/EctRNA
E. coli
Eukaryotic cells
EcLeuRS/tRNA
E. coli
Eukaryotic cells
MmPylRS/tRNA
Methanosarcina mazei
E. coli, Eukaryotic cells
Unnatural Amino Acids
More than 100 different Uaas have been designed and incorporated into proteins [5-9,34]. A large fraction of them have been further exploited by screening aaRS/tRNA pairs, which could eventually lead to applications in vivo. Uaas containing spectroscopic probes, post-translational modifications, metal chelators, photo affinity labels, and other chemical moieties have been selectively incorporated into proteins with good fidelity and efficiency. They have been used to study biological processes and address the emerging challenges for structure-activity relationships of proteins that are inaccessible to other methods.
Genome Editing Tools to Introduce New Coding Rules
The simplest way to provide orthogonal tRNA/aaRS pairs for a cellular system is by the delivery of the genes encoded in plasmid vectors, which are under the control of appropriate promoters for transient expression into the host cells, such as E. coli, yeast, oocytes, mammalian cells (HEK293, Hela, CHO cells), or primary cultured cells.Implementing GCE to study proteins in whole animals is challenging, and several model systems have been established. These demonstrations have tackled two major challenges: (i) inefficient gene delivery of suppressor tRNA, orthogonal synthetase, and protein of interest to the animals; and (ii) insufficient bioavailability of the Uaa at the desired tissue or cell type to enable acceptable level of mutant expression. It was first established in simple animals C. elegans [27,28], D. melanogaster [29], and recently in vertebrate animals including zebrafish [30,35] and mice [30,31]. In utero electroporation of plasmid DNAs coupled to direct injection of the Uaa into the embryonic brain provided a first possible method [36]. Gene transfer using lentiviral [30,31], adeno-associated viral (AAV) [37] and baculoviral vectors [38] offer an alternative way for efficient delivery of the Uaa genetic machinery in mammalian cells, tissues, and brains of living mice. Recently, GCE has been successfully developed in mice with a heritable expanded genetic code [30,31] – i.e. animals with tRNA/aaRS genes integrated into their genome. They provide opportunities to study human physiology and disease in vivo, because the mouse genome is more than 99 percent similar to that of humans and it contains most human gene counterparts or functionally related genes [39]. Meanwhile, the mouse has a short life span and is easy to breed and handle in the laboratory. More importantly, the mouse genome is well-known, which makes it possible to generate custom-made mutant mouse strains for in vivo studies of specific genes and providing excellent models for various human diseases [40]. Now mouse strains allowing incorporation of the photocrosslinker AzF [30] and the post-translationally modified Nε-acetyl-lysineAcK [31] have become available. We expect this strategy to be extended to other tRNA/aaRS pairs in a custom-made manner, including proteins of interest containing amber stop mutation, which could be stably integrated into the genome for physiological studies.Uaa delivery in the mice can be achieved by intraperitoneal injection, or direct injection into target tissues for site-specific induction of Uaa-engineered protein expression [31]. Uaa supplementation in the mouse drinking water provides a convenient option, even when targeting brain proteins as recently shown with an AcK [37]. In zebrafish, incubation of live animals in water containing AzF was shown to be sufficient for Uaa delivery [30,35].
Applications in Optogenetics and Optoproteomics
A toolbox of chemical biology approaches centered on GCE technology has enabled the incorporation of Uaas serving as unique physical and chemical probes into proteins to correlate protein structure activity relationship. There are many excellent reviews on a variety of applications of Uaas for studies of ligand-gated ion channels [5,41,42] and GPCRs are excellently reviewed [41,43,44-46]. Our understanding of functional properties of those important receptors is increasing rapidly due to the combination of X-ray crystal structure characterization and GCE applications in recent years.In the past decade, optogenetics based on natural light-sensitive proteins has evolved as a powerful strategy to study cellular functions. In that technology, channelrhodopsin (ChR, a light-driven ion channel found in archaebacteria) is genetically encoded into cellular systems such as neurons to enable the regulation of their function by light induced activation of ChR [47]. Despite the power of manipulation of biological functions with cell specificity, the regulation of a specific protein function by light remains a major challenge. The approach we call “optoproteomics,” combining optical methods and protein-specificity for investigating and intervening cellular functions, provides a general solution to regulating physiological function by light.Here we will focus on the applications of GCE technology to reprogram proteins in response to light. Light is a unique signal input with three major advantages for studying biological functions: 1) it has high temporal and spatial resolution to control a biological reaction; 2) reaction kinetics are significantly faster and more specific than small molecule based controls that rely on molecule diffusion; 3) cooperative mechanisms can be elucidated by combining controls by light and small molecules. The design concept is to introduce a light-sensitive molecule into proteins serving as a switch to regulate protein function. Light-sensitive Uaas can bestow proteins with such photoswitches, inducing a specific functional change in the protein upon illumination (Figure 4a). To date, three types of light-sensitive Uaas have been exploited to transform light-insensitive proteins to respond to light: 1) caged-Uaas which release a photolabile moiety from the side chain upon light activation; 2) photo-cross-linking Uaas, which cross-link to a nearby molecule upon light activation; 3) photo-switchable Uaas, which toggle in between two conformations reversibly upon illumination with different wavelengths (Figure 4b).
Figure 4
Schematic illustration of engineering photo-responsive proteins with light-sensitive Uaas. a. Inserting a light-sensitive Uaa into a protein can transform a protein to respond to light. Upon light stimulation, protein changes from the ground state to the activated state due to the photo-responses of the Uaa, which leads to a functional read-out. b. Three main categories of light responsive Uaas: caged amino acids, photo-cross-linking amino acids, and photo-switchable amino acids. When these Uaas are introduced into ion channels (black channels) or other proteins (grey circles), various biological responses have been observed.
Caged Amino Acids
This category of Uaas has side chains protected by a photochemically removable group (Figure 4b). Upon light activation, the protection group – or the cage – is released to generate the uncaged side chain. With a caged-Uaa introduced at a key functional site in the protein, protein functions can be activated due to the light stimulations.
Tyr(ONB)
Early experiments incorporating caged-TyrTyr(ONB) into proteins demonstrated the feasibility of time-resolved studies on protein function by decaging via photolysis [48]. Tyrosine, serine, threonine are substrates of a diversity of kinases, and phosphorylation of these residues plays a key role in the regulation of a wide range of pathways. For example, kinase-dependent modulation of ion channels and neuronal receptors have been demonstrated to play key roles in synaptic plasticity, the changes in the structure and composition of a synapse that are associated with learning and memory. Tyr(ONB) along with caged serine and threonine, hold special promise as general tools to regulate a specific tyrosine phosphorylation target in signaling pathways. Tyr(ONB) was first incorporated into the alpha subunit of mouse muscle nicotinic acetylcholine receptor (nAChR), which belongs to the pentameric ligand-gated ion channel family [48]. When introduced at different tyrosine positions (93Y, 127Y, 198Y), three types of light-induced kinetics were observed, providing the first demonstration of the light-control of protein function through caged Uaas. In particular, the slower kinetics observed for 93Tyr(ONB) and 198Tyr(ONB) could correspond to conformational changes of the receptor upon decaging, although no further information was available about how the state of the phosphorylation affected the receptor function. In a subsequent study with Tyr(ONB) introduced at the conserved tyrosine site in the K+-channel Kir 2.1, photo-decaging led to a ~30 percent decrease in K+-channel activity when the kinase v-src was also expressed in the oocyte [49]. Detailed electrophysiology studies revealed that decaging initiated two kinase-dependent pathways. One involved direct modulation of the channel presumably due to the conformational changes associated with phosphorylation. The other involved kinase-initiated endocytosis of receptors, which diminished the observed K+ current. Dissecting these two competing pathways would have been very difficult using conventional tools. When coupled with caged serine, threonine, and phospho-amino acids, these strategies have the potential for providing new insights into phosphorylation pathways of biological signaling.
Photocaged Lysine
Although powerful, the cases of incorporating Tyr(ONB) relied on semi-chemical synthesis of tRNA, which limits applications in vivo. Engineering an aaRS/tRNA pair that accepts light-activatable amino acids as substrates provides an attractive solution to achieve in vivo applications. The first such application established was incorporation of photocaged lysine for creating kinases that can be activated in living mammalian cells by light [50,51]. These studies made replaced the conserved lysine in the ATP binding pocket of the catalytic domain with caged lysine. The crystal structure of the MEPK/ATP complex reveals that the lysine anchors and orients ATP for catalytic functions. By replacing the lysine with caged lysine, the kinase activity is almost completely blocked due to the inability to bind ATP for kinase function. Upon light activation, the cage is released and lysine is restored. This approach allows the design of subnetworks controlled solely by light to study the kinetics of individual steps in signaling cascades and the dissection of temporal regulation. This methodology provides high spatial and temporal resolution that is inaccessible to other methodologies. Due to the substantial conservation of the targeted lysine residue (present in 95 percent of human protein kinases), the light-activation method reported here could be generally and readily applicable to creating photo-activated versions of other protein kinases. Moreover, by applying the lysine photocaging to each kinase in a pathway, precise quantitative insights into the kinetics of kinase networks, and the identification of substrates of individual kinases, become possible. Such developments are attractive because signal transduction pathways use cascades of protein and lipid kinase-mediated phosphorylation to elicit specific responses to distinct extracellular stimuli. Kinase pathways are extremely complex, dynamic, multistep processes with cross-talk, feed-back and feed-forward steps that are hard to differentiate. Light activatable kinases provide unique tools to uncover the architecture of pharmacologically important, time-dependent, adaptive control processes, such as feedback inhibition. Recently, photo-activatable kinases have been applied in living zebrafish embryos [35].Besides kinases, the caged lysine has been introduced into Cas9 protein making a light-activated Cas9 in mammalian cells, an attractive CRISPR/Cas9 system to control exogenous and endogenous gene functions by light [52]. These discoveries provide new tools for the optochemical control of protein function and gene editing in mammalian cells and tissues.
Caged Cysteine and Caged Serine (Cmn)
Many proteins use cysteine for a broad range of functional roles through the intrinsic properties of the thiol group. When a caged cysteine was introduced into the substrate-binding pocket of Renilla luciferase, the bulky caging group caused a complete loss of enzyme activity [53]. Upon brief exposure to UV light which removes the cage and restores the coelenterazine binding the activity was rescued (though not reaching the wild-type level), thus showcasing the applicability of the caged cysteine controlling protein activity in live human cells. The incomplete activity restoration may be due to the decaging not reaching 100 percent.Cmn has also been used to generate a photoactivatable Kir2.1 [36], by introducing Cmn into a specific site in the transmembrane region. The caged-serine introduced at C149 site had the most pronounced light sensitivity. Before light treatment, the Kir2.1 channel showed minimal activation due to Cmn occluding the pore and blocking the channel. After UV illumination, the cage is released to generate a serine side chain which is similar to the native cysteine, restoring the outward K+ activity. This light activatable Kir2.1 was screened in mammalianHEK293 cells, and further implemented in cultured rat hippocampal neurons as well as embryonic mouse neocortex to achieve light-activated suppression of neuronal firing specific to Kir2.1, demonstrating the proof-of-concept of optical control of protein function using genetically encoded Uaas.
Photo-cross-linking Amino Acids
This category of Uaas has side chains that become reactive when exposed to light (Figure 4b). They will conjugate to any one of several common functional groups in proteins upon activation. This feature makes them particularly useful in capturing protein-protein or protein-ligand interactions, and has been widely exploited in the studies of signaling pathways involving G-protein coupled receptors (GPCRs) [43,44], the most popular membrane protein family for drug development. Compared to the long history of engineering light-activatable proteins using caged amino acids, the development of light-sensitive proteins with photo-cross-linking amino acids is a more recent phenomenon.A series of pioneering studies focused on encoding photo-cross-linking UaasAzF and Bpa into GPCRs [43-47]. The encoding of AzF and Bpa takes advantage of an E. coliTyrRS and Bacillus stearothermophilusTyr-tRNACUA pair for incorporating tyrosine analogs into proteins in mammalian cells at high efficiency [54]. High efficiency was achieved by expression of the suppressor Tyr-tRNACUA of bacterial origin using conserved promoter sequences of the mammalianTyr-tRNAs [55,56]. The challenge of directly observing photo-cross-linking was first overcome to observe a photo-induced GPCR-ligand complex. GPCRs such as chemokine receptors containing AzF or Bpa introduced at specific sites of the ligand-binding pocket can covalently conjugate their cognate ligands such as T140 (an HIV-1 coreceptor blocker) [57]. Many other GPCRs and their interacting partners other than ligands (such as antibody epitopes) [58-61] have been investigated and characterized using this approach, providing novel biological insights. In addition to GPCRs, complexes between serotonin transporter and two antidepressant drugs have been systematically studied, precisely defining the high-affinity drug-binding site in this important drug target for the treatment of depression [62].Inspired by the works on GPCRs, AzF and Bpa were also incorporated to create light-sensitive ionotropic glutamate receptor (iGluR) family proteins, including AMPA receptors [63] and NMDA receptors [64-66]. These receptors mediate excitatory synaptic transmissions and are made of a tetrameric assembly of subunits. Sites were chosen at subunit interface revealed by the X-ray crystal structures. The incorporation of Bpa into the AMPA receptor GluA2 at two sites (S729 or G725) located at subunit interfaces between ligand-binding domains (LBD), allowed UV-induced state-dependent photo-inactivation. When pairing GluA2-AzF with GluA1, there was photo-inactivation although it was less than that of homomeric GluA2 receptors [63]. Overall, these AMPA mutants undergo the same desensitization mechanism for light inactivation by Bpa or AzF.In contrast to AMPA receptors, surprisingly, light-sensitive NMDA receptors with different mechanisms have been identified by the incorporation of AzF at the subunit interface. This interface is at N-terminal domains (NTD), which are extracellular domains adjacent to the LBD where multiple allosteric modulator binding sites have been identified. Two Glu2B AzF mutants in residues at the NTD interface, where the synthetic compound ifenprodil binds (revealed by X-ray crystal structures), showed opposite light-dependent effects [65,66]. GluN1-Y109AzF/GluN2B demonstrated UV-induced inactivation [65], whereas GluN1/GluN2B-F114AzF demonstrated UV-induced potentiation [66]. Why would introducing AzF at the same interface, one in the GluN1 subunit, and the other one in the GluN2B subunit elicit completely different light-induced modulations? It turns out that in the case of GluN1-Y109AzF/GluN2B, UV induced intra-subunit crosslinking within GluN1 changed the receptors from the wild-type conformation to a low channel-open-probability (Po) state. Whereas for the GluN1/GluN2B-F114AzF, UV induced inter-subunit between GluN1/GluN2B restored the receptors from a low Po to the wild-type state. Effects on ion conductivity and gating by the natural ligands or allosteric modulators (either inhibitors such as Zn2+ and ifenprodil, or potentiators such as spermine) can then be compared to the wild type receptors or the uncrosslinked mutants. Therefore, those light-sensitive NMDARs serve as novel “photoreceptors” to probe receptor functions, providing new platforms to uncover novel mechanisms of allosteric regulation. Such design concept through interfacial crosslinking has recently been extended to potassium channels [67,68]. These studies highlight the pharmacological mechanisms of receptors that make them promising drug targets for the treatment of diseases.
Photoswitchable Amino Acids (PSAAs)
PSAAs containing azobenzene moiety can reversibly change conformation under two wavelengths of light (Figure 4b). This reversibility is unique among other optoproteomic approaches discussed above. The concept was initially demonstrated in horseradish peroxidase [69], catabolite activator protein [70], and calmodulins [71] by incorporating various PSAAs. Recently photoswitchable NMDARs were generated by incorporating PSAAs [72]. This study proved for the first time the feasibility to convert neuronal receptors to respond to light with reversible control by two different wavelengths of light. PSAAs incorporated at several positions within the multi-domain receptors endowed robust light-induced modulations. They demonstrated different function switching mechanisms. When introduced at the GluN1 clamshell hinge region, PSAA photoisomerization enabled the control of receptor’s glycine sensitivity and activation efficacy. When inserted into the site in the pore domain which contributes to a conserved transmembrane cavity, PSAA photoisomerization impacted both gating and permeation properties due to the conformational changes of the membrane helix M3. This work demonstrates not only a unique approach to understand biophysical properties of transmembrane pore sites whose conformational changes during receptor activity remain poorly understood, but also unveiled the contribution of specific side chains to distinct receptor properties such as agonist sensitivity, channel open probability, and permeation.Optical control of protein function provides excellent spatial-temporal resolution for studying proteins functions in situ. The general advantage of the GCE approach is that it provides molecular selectivity and an acute modulation of protein functions inaccessible to other methodologies. Combined with rapid developments in structural biology, the goal is to gain functional information in a protein’s native environment and to develop new therapeutic strategies [73-76]. The development of light-controllable ion channels with genetic encodability, as reviewed above, should be valuable to manipulate receptor signaling and unmask specific sidechain roles in physiological function. We expect these principles to be generally applied to other proteins, enabling optical investigation of a range of receptors, ion channels, and kinases both in recombinant and native systems.
Pros and Cons
Complementing other approaches which rely on introducing photo-sensitive ligands to engineer light-sensitive receptors [74-76], the GCE directly inserts the light-sensitive moiety into the protein at any site using a stop codon suppression system. Using Uaa-based optoproteomics to control protein function offers several key advantages over existing methodologies: 1) The genetic encodability affords high molecular (i.e. subunit) specificity and cell-type targeting. 2) Varieties of photo-switching mechanisms can be identified. By targeting allosteric mechanisms using photo-cross-linkers and PSAA, rather than direct orthosteric interactions as for photoswitchable tethered ligands, it provides the ability to fine-tune receptor activity without interfering with its natural activation. 3) Different types of light functionalities can be installed into the same protein using a single site.However, there are many challenges to tackle for the in vivo applications. The low efficiency of mutant expression due to the inefficient incorporation of bulky Uaas such as caged Uaas and PSAA, and low efficiency of amber suppression and in vivo have to be kept in mind. Several strategies have been implemented to increase the efficiencies, such as optimization of tRNA gene design [35,55,77], development of controlled delivery of orthogonal pairs via viral vectors [37,38], and application of siRNA to down-regulate mammalianUpf2, which is the key regulator of nonsense mediated mRNA decay pathway [31].Another major challenge is to identify the proper site to introduce the light functionality for a given protein. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the nucleotide sequence of their genes, and results in protein folding into a specific three-dimensional structure that determines its activity. We and others have shown that light-sensitive Uaas introduced at conformation sensitive sites can generate light-responsiveness. However, previously sites were often chosen based on previous functional studies using conventional site-directed mutagenesis in combination with empirical analyses of available X-ray crystal structures that are static. Functional mutants can be validated by conducting cell-based screening assays. In practice there is often a low success rate, with more than half of the designed mutants being light insensitive, likely due to the sheer complexity of the protein structure-function relationship. To increase the success rate, prediction methodologies could be improved using molecular modeling [78]. Recent progress in system-wide protein computational analyses has made a profound impact in elucidating sequence-structure relationship with significantly improved accuracy. For example, structural based protein design algorithms can predict conformational changes with high accuracy. For proteins without high-resolution structures, ancestral sequence reconstruction analysis can provide insights into the specific amino acids in a given protein sequence that contribute to the desirable properties, such as increased flexibility and conservation. Sequence coevolution analysis enables the prediction of protein–protein interactions at the precision of amino acids. It is therefore conceivable to apply these analyses to design light-responsive proteins on a large-scale with increased prediction precision. When implementing a designed protein in vivo to control functionality with light through optoproteomics, as illustrated in Figure 5, we can discover the physiological effect of a targeted protein.
Figure 5
Concept of optoproteomics. a. Introducing various Uaas to render a protein of interest to be light sensitive by site-specific incorporation of Uaas. b. Engineering a light-sensitive protein with in silico screening based on the atomic structures, in vitro functional screening, and system-wide investigation of protein specific response in tissues and in vivo.
Conclusions and Outlook
Advances in GCE have opened up new landscapes in optoproteomics. Designed light sensitive proteins provide an expanding palette (Figure 4) for specific electrical or biochemical effector function control through well-defined protein functional changes. In some sense, optogenetics could be considered a branch of optoproteomics due to the utilization of naturally occurring microbial channelrhodopsins to mediate neuronal and physiological activities in organisms of higher complexity. The temporal precision of the use of light along with the specific protein functional change enabled by GCE creates a unique link between a molecular event defined precisely through a change of a single amino acid side-chain in the protein and its physiological consequences. To realize the goal of in vivo light control, advances in light targeting/readout technologies offered by optogenetics will provide solid foundation. In the future, models of genetic diseases caused to specific proteins carrying amber stop codon mutations [79] may be established and explored by optoproteomics. Firstly, the read-through of the stop codon could lead to a phenotypic disease rescue; and secondly, the light could introduce a proteomic change to enhance the therapeutics. Such efforts will undoubtedly reveal more unexpected and exciting roles for nonsense mutations in human diseases.
Authors: Irene Coin; Vsevolod Katritch; Tingting Sun; Zheng Xiang; Fai Yiu Siu; Michael Beyermann; Raymond C Stevens; Lei Wang Journal: Cell Date: 2013-11-27 Impact factor: 41.582