Artur Gora1, Jan Brezovsky, Jiri Damborsky. 1. Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment, Faculty of Science, Masaryk University , Kamenice 5/A13, 625 00 Brno, Czech Republic.
Enzymes are very efficient
catalysts that are essential for the
functioning of living organisms. The low efficiency of biocatalysts
produced de novo relative to those that have evolved naturally demonstrates
that our understanding of enzymatic catalysis is still incomplete.[1−4] The dynamic motion of enzymes during catalytic events is one of
the many aspects of protein chemistry that are currently insufficiently
well understood.[5−9] On one hand, proteins need to have well-defined and organized structures
in order to maintain stable functionality in the intracellular environment.
On the other hand, some degree of flexibility is often required for
catalytic activity. Molecular dynamics simulations have provided key
insights into the importance of protein dynamics in catalysis, such
as the observation of substrate access and product exit pathways that
cannot be identified by inspecting crystal structures.[10] Csermely et al. recently reported that mutations
in regions that affect protein dynamics, such as hinge regions that
are important in substrate binding, can have dramatic effects on catalytic
activity.[11] In this review, we highlight
the role of protein gates as another class of highly dynamic structures
that play key roles in protein function.Given the importance
of gates for enzymatic catalysis, the number
of studies that have examined them systematically is surprisingly
small. Conformational gating in proteins was first described by McCammon
and co-workers in 1981, but there have been relatively few systematic
studies in this area since then.[12−14] Moreover, much of the
available data on gates in macromolecular systems is hidden or otherwise
dispersed within the scientific literature, partly because there is
currently no consensus regarding what defines a gate. Some authors
describe all residues that affect the ligand’s access to a
target area as gating residues, whereas others apply the term exclusively
to structural features that undergo large movements during the gating
event. In this review, we define a gate as a dynamic system consisting
of individual residues, loops, secondary structure elements, or domains
that can reversibly switch between open and closed conformations and
thereby control the passage of small molecules—substrates,
products, ions, and solvent molecules—into and out of the protein
structure. Under this definition, the anchoring residues that stabilize
the open or closed conformations of a gate are not themselves gating
residues. However, because of their various interactions with the
gating residues, they can control the size and properties of the ligands
that pass through the gate as well as the frequency of the exchange
events.Gates can be found in various systems, including enzymes,
ion channels,
protein–protein complexes, and protein–nucleic acid
complexes.[14,15] In this work, we focus specifically
on gates in enzymes. We attempt to answer three basic questions—why,
how, where—by describing the molecular function, structural
basis, and location of gates within protein structures. We discuss
71 illustrative examples of enzymes that together contain 129 different
molecular gates and propose a system for their classification. Reviewed
enzymes were chosen based on a literature search with a set of keywords
corresponding to gates and conformational changes in enzymes. A preliminary
set of protein structures was filtered out, leaving only those entries
for which both open and closed conformations were described. The final
set represents different classes of enzymes (Figure 1A) and different protein folds (Figure 1C) and spans structurally and functionally the entire enzyme world.
Among 71 chosen enzymes, only 17 have higher than 20% sequence identity
with other set members (Figure 1E). The proposed
classification system provides a useful framework for comparing gates
of different enzymes and drawing general conclusions about gate function,
structure, and position. Moreover, the classification scheme is easily
extendable to describe the new gate types that will almost certainly
be revealed by structural and functional analyses of newly isolated
enzymes in the future.
Figure 1
Distribution of (A) reviewed proteins and (B) proteins
from the
PDB database according to EC classes, (C) reviewed proteins and (D)
proteins from the PDB database according to SCOP classification, and
(E) identity matrix of reviewed enzymes (only the enzymes with sequence
identities above 20% are shown for clarity).
Distribution of (A) reviewed proteins and (B) proteins
from the
PDB database according to EC classes, (C) reviewed proteins and (D)
proteins from the PDB database according to SCOP classification, and
(E) identity matrix of reviewed enzymes (only the enzymes with sequence
identities above 20% are shown for clarity).
Molecular Function of Gates
Analyses of protein
dynamics have identified a number of enzymes
with gates, suggesting that these structures are rather common. What
is the molecular function of the gates? It seems that in enzymes they
facilitate precise control over processes that are directly linked
to catalysis. Enzyme gates can (i) contribute to enzyme selectivity
by controlling substrate access to the active site, (ii) prevent solvent
access to specific regions of the protein, and (iii) synchronize processes
occurring in distant parts of the protein (Figure 2). The proper function of even the simplest gates can potentially
be essential for catalysis, and the gating event may even represent
the rate-limiting step of the catalytic cycle. Interestingly, different
gating residues within a single protein molecule may be responsible
for restricting the access of specific substrates. High variability
of the gating residues within an enzyme scaffold can lead to the evolution
of enzyme families whose members are selective for specific substrate
types. The best known example of such specialization within a single
enzyme family is provided by the cytochromes P450.[16]
Figure 2
Schematic illustration of the molecular functions of protein gates:
(A) control of substrate access, (B) control of solvent access, (C)
control and synchronization of reactions. Protein is represented by
the area colored in gray, active site cavity by the area in white,
gating residues by red lines, substrate molecules by green or violet
lines, and water molecules by blue lines.
Schematic illustration of the molecular functions of protein gates:
(A) control of substrate access, (B) control of solvent access, (C)
control and synchronization of reactions. Protein is represented by
the area colored in gray, active site cavity by the area in white,
gating residues by red lines, substrate molecules by green or violet
lines, and water molecules by blue lines.
Control of Substrate Access
Enzyme
selectivity has been traditionally explained by the “lock and
key” model,[17] which was subsequently
complemented by the “induced fit” or “hand in
glove”,[18] “selected fit”,[19] and “keyhole, lock, and key”[20] models. In many cases, these models provide
an adequate description of enzyme selectivity based on adjustable
complementarity between the active site and the cognate substrates.
However, research conducted over the past decade has shown that regions
located further from the active site can also affect enzyme selectivity.
Substrate access pathways, which often incorporate molecular gates,
impose additional constraints on ligand binding to the active site.[20] The ability of ligands to traverse these access
pathways can be controlled by (i) size discrimination at the narrowest
point along the pathway forming a bottleneck, (ii) geometrical constraints,
e.g., the curvature of the pathway, and (iii) specific molecular interactions
such as hydrogen bonds, electrostatic interactions, and hydrophobic
interactions with the residues comprising the access pathway. Protein
gates can be regarded as molecular filters that discriminate between
molecules as similar as molecular oxygen and carbon monoxide in NiFe
hydrogenases[21,22] or water and hydroxyperoxide
in catalases.[23,24] Gates act as filters in a wide
range of enzymes, controlling the range of substrates that can be
accepted by broad-specificity cytochromes P450,[25] the stereospecificity of epoxide hydrolases,[26] and product length in undecaprenyl-pyrophosphate
synthases.[27]One of the first systematic
descriptions of the influence of the gating process on substrate binding
was reported by Szabo et al., who assumed that the switching between
the open and the closed conformations of the gate was a stochastic
process.[28−30] This model was successfully used to demonstrate that
despite conformation gating[13] acetylcholinesterase
can bind acetylcholine with a rate constant of 109 M–1 s–1 and predict the rate of formation
of the enzyme–substrate complex in choline oxidase.[31] Since gates create a barrier on the substrate
access pathway, the kinetic rate constant for passage over the barrier
can be obtained using Kramers’ reaction rate theory or its
later modifications.[32−34] This methodology was used to compare the results
of computational and experimental studies on the passage of the tetramethylammonium
cation through acetylcholinesterase[35] and
migration of ammonia through carbamoyl phosphate synthetase.[36]
Control of Solvent Access
Spatial
localization of the hydrophobic and hydrophilic regions within the
structure of a protein is important in maintaining its proper fold
and can also be crucial for catalytic function. The various steps
of an enzymatic reaction may require different environments. These
distinct environments can be generated by having the individual steps
occur in spatially separate regions of the protein, but this does
not eliminate the problem of transporting the substrate between these
sites. There are important problems to be addressed, including transporting
polar molecules from a polar environment to a nonpolar one and separating
hydrophilic compartments from hydrophobic ones within the structure
of a single protein.In some proteins, these problems are addressed
by the presence of selective barriers that permit passage of solutes
but not water molecules. Crystallographic and NMR data can be used
to identify cavities within a protein structure accessible to water
molecules. Exclusion of water from some parts of the cavity, such
as the active site or a specific tunnel, is essential for functioning
of numerous enzymes. In simple cases, the gates may prevent the entrance
of water molecules into the cavity when a substrate or a cofactor
is not present, as occurs in rabbit 20a-hydroxysteroid dehydrogenase.[37] In more complex cases, the gates may permit
access only to a specific part of the cavity, as occurs in carbamoyl
phosphate synthetase[36] and imidazole glycerol
phosphate synthase.[38] In the cytochromes
P450, a “water channel” controls hydration of the substrate
in the active site, which is extremely important for cytochrome activity.[16] The potential importance of gates that act as
solvent barriers is further illustrated by the example of enzymes
with ammonia tunnels. In these proteins, gates prevent water from
entering the channel and protonating the ammonia, which is essential
for maintaining its nucleophilic character.[39]Control of water access can be seen as a special case of the
function
described in section 2.1. However, when
discussing water exclusion, the main emphasis is on the water permeability
of the gates and their ability to distinguish water molecules from
other ligands. The gate can simultaneously act as a barrier to passage
of water molecules while acting as a selective filter for other molecules,
allowing them to access the active site. Gates of this type resemble
semiselective membranes that can distinguish between species such
as water and ammonia, allowing only the second to pass. It is worth
noting that passage of ‘permitted’ species can be facilitated
by rearrangement of an individual gating residue, such as K99 in
imidazole glycerol phosphate synthase. Conversely, passage of water
molecules through gates of this type would often require significant
conformational changes in all of the residues that comprise the gate.[38] This makes gates in enzymes far more sophisticated
than semipermeable membranes.
Control
and Synchronization of Reactions
Another function of gates
becomes apparent when considering enzymes
with two or more active sites. Many protein structures contain tunnels
to facilitate efficient migration of intermediates and gates to synchronize
chemical reactions. Such arrangements can be compared to a pair of
workers on an assembly line. The second worker has to be ready before
he can receive a product from the first one. Moreover, the products
generated by the first worker must satisfy certain standards. The
control gates located between the workers regulate the exchange of
products over a well-defined period of time. Gates of this kind are
common in ammonia-transferring enzymes, suggesting that they are old
in evolutionary terms and functionally important.[40−43] The need to efficiently transport
ammonia within the interior of the protein may be related to its high
cellular toxicity. We speculate that gates of this kind may be present
in many enzymes that have multiple active sites connected by internal
tunnels for the transport of intermediates. Many such enzymes have
been studied in some detail, including carbamoyl phosphate synthetase,
which has tunnels for ammonia and carbamate transportation;[36] asparagine synthetase,[44] glucosamine 6-phosphate synthase,[45] and
glutamate synthase,[46] all of which have
tunnels for ammonia transportation; tryptophan synthase for indole[47] and carbon monoxide dehydrogenase/acetyl coenzyme
A synthase for carbon monoxide transportation.[48]Systematic analysis of the functions of the known
gates in the 71 proteins discussed in this article revealed the following
distribution of gate types: 40% of the studied gates control substrate
access, 19% control solvent access, 15% control and synchronize catalytic
events, and 26% have other function (Figure 3).
Figure 3
Distribution of gate functions; 71 proteins with 129 different
gates were analyzed. Percentages shown in the figure are based on
all of the identified functions of each gate; individual gate may
perform multiple functions. Detailed description of the analyzed proteins
is provided in Table 3.
Distribution of gate functions; 71 proteins with 129 different
gates were analyzed. Percentages shown in the figure are based on
all of the identified functions of each gate; individual gate may
perform multiple functions. Detailed description of the analyzed proteins
is provided in Table 3.
Table 3
Detailed Description
of Enzymes Possessing
Gates Presented in the Scientific Literature
Structural Basis of Gates
Gates are
dynamic systems that can make reversible transitions
between open and closed states. They vary in size and complexity,
from individual amino acid residues to loops, secondary structure
elements, and even domains. The simplest gates consist of only one
amino acid side chain that can close or open an access pathway by
rotating. Opening and closing of more complicated systems can involve
the synchronized movement of two or more residues, and the largest
systems involve rearrangements of secondary elements or even entire
domains (Table 1). For larger systems, movement
of the gate may cause formation of a tunnel or enclosed cavity in
addition to permitting or denying access to selected species.[49,50] The following parameters can be useful for describing and discriminating
between gates: (i) their constituent residues, (ii) their anchoring
residues, (iii) the hinge region, i.e., the amino acids that make
the structure flexible and allow it to move, (iv) the gate’s
position, (v) the gate’s bottleneck diameter in the open and
closed states, (vi) changes in the bottleneck’s size over time,
(vii) the energy required to switch the gate from one state to the
other, and (viii) the energy required for passage of specific molecules
through the gate.
Table 1
Classifying Enzyme Gates According
to Their Structural Basis
Residue Motion: Wings
The energetic
barriers for residue rotation are quite small, 1–16 kcal/mol.[51] While generally low, such barriers can nevertheless
be large enough to significantly affect the probability that a given
species will be able to pass through the gate or the rate at which
they do so. Depending on the particular amino acid and its surroundings,
one or both states of the gate may be stabilized by interactions with
anchoring residues, e.g., hydrophobic interactions, H bonds, ionic
interactions, salt bridges, and π–π interactions.
The strongest effect on the control of the passage is achieved when
a large gating residue is located in the bottleneck of the pathway.
The most common residues in this role are those whose side chains
contain aromatic rings, i.e., W, F, and Y (Figure 4). Wing-type gates are common and can be found in enzymes
such as imidazole glycerol phosphate synthase,[38] cytidine triphosphate synthetase,[52] methane monooxygenase hydroxylase,[53] FabZ
β-hydroxyacyl-acyl carrier protein dehydratase,[54] and cytochrome P450.[25,55] Even small gates of
this type may require an activating agent to open. For example, the
gate in the water channel of human monooxygenase CYP3A4 is created
by the interaction of the conserved residue R375 with the heme, which
opens upon cytochrome P450 reductase binding to the enzyme.[56]
Figure 4
Relative occurrences of specific amino acid residues in
wing and
swinging door gates; 71 proteins with 129 gates were analyzed, and
154 residues that form wings or swinging doors were identified. Detailed
description of the analyzed proteins is provided in Table 3. Values were normalized against the frequency with
which each amino acid appears in all of the protein structures of
the UniProtKB/Swiss-Prot database (2012_07).
Relative occurrences of specific amino acid residues in
wing and
swinging door gates; 71 proteins with 129 gates were analyzed, and
154 residues that form wings or swinging doors were identified. Detailed
description of the analyzed proteins is provided in Table 3. Values were normalized against the frequency with
which each amino acid appears in all of the protein structures of
the UniProtKB/Swiss-Prot database (2012_07).
Residue Motion: Swinging Doors
A
more complex type of gate consists of two residues that can rotate
but are stabilized in the closed conformation by a mutual interaction.
Lario et al. introduced the phrase “swinging door” to
describe gates of this type that were identified in cholesterol oxidase
type I.[57] Some swinging door gates open
by having both residues rotate in the same direction, while in others
the two residues rotate in opposite directions. Common stabilizing
interactions in swinging door gates include π stacking as occurs
in the F–F pair of cytochrome P4503A4[58,59] and acetylcholinesterase,[60] ionic interactions
as in toluene-4-monooxygenase[61] and cytochrome
P450cam, P450BM3, and P450eryF,[25,55] aliphatic hydrophobic interactions such as those between the F–I,
the F–V, and the F–L pairs of cytochrome P4503A4,[58,59] aliphatic interactions such as those between
the R–L and the L–I pairs of cytochrome P4503A4,[58,59] and hydrogen bonds such as that between
the R–S pair in cytochrome P4503A4.[58] The open conformations of one or both of the gate residues
may also be anchored, depending on the amino acids surrounding the
gate. In comparison to wing gates, gates consisting of two residues
can control wider tunnels and channels. It is worth mentioning that
the individual residues that comprise a swinging door gate may simultaneously
be components of another gate, as occurs in cytochrome P4503A4.[58] Literature data indicate that most
gates of this type consist of F–F pairs, and one way to screen
for potential gates is to search for phenylalanine sandwiches.
Residue Motion: Apertures
Proteins
undergo low-frequency breathing motions that may involve synchronized
movements of bottleneck residues. In contrast to the previously described
gates, the residues that form aperture type gates do not need to rotate
and can maintain a rigid conformation. Their movements occur as a
result of the synchronized relocalization of the enzyme backbone during
its breathing motions. The ability of a given species to pass through
gates of this type depends on the length of time the gate remains
in the open state, which is determined by the enzyme’s rigidity
(especially in terms of the compartments housing the gating amino
acids) and the strength of the interactions between the gating residues.
Gates of this kind can therefore switch between states at different
frequencies, which can be adjusted by mutating the gating residues.
Aperture-type gates have been identified in several enzymes including
carbamoyl phosphate synthetase,[36] choline
oxidase,[31] glutamate synthases,[62] extradiol dioxygenases-homoprotocatechuate 2,3-dioxygenase,[63] cytochrome P450eryF,[25] and acetylcholinesterase.[64]
Motions of Loops and Secondary Structure Elements:
Drawbridges and Double Drawbridges
The movements of loops
and secondary structure elements can provide an energetically favorable
method of controlling access for larger ligands. The gates described
above consist of individual residues and would not provide sufficient
control for enzymes that have large substrates and correspondingly
large active site cavities. In many cases, the loops involved in access
control also contribute to formation of substrate/cofactor binding
cavities. Alternatively, in enzymes with complex systems of internal
tunnels such as the members of the cytochrome P450 family, the dynamic
motion of the protein structure, especially the flexible B–C
and F–G loops in the cytochromes P450, plays a vital role in
the opening and closing of the tunnels.[16] Protein motions of this type can also merge different tunnels, creating
a wider opening. Here, gating elements control the access of large
substrates by merging and dividing the space shared by the tunnels.[16] However, in such cases the movements of the
loops can cause formation of smaller and more selective gates such
as the swinging doors described in the preceding sections.[56,58]Movements of loops and secondary structure elements can change
the solvation of a cavity or the gate itself. The equilibrium between
the open and the closed conformations depends on the anchoring residues
and the flexibility of the hinge region. All of these elements play
important roles in the movements of large gates. The conserved GxG
motif found in most cytochrome P450 family members provides a good
example.[65] Depending on cytochrome isoform,
the motif flanks either one or both ends of the B–C loop. It
increases the flexibility of the loops, lowers the energy required
for their motion, and facilitates tunnel opening and closing. The
loops’ variable lengths and levels of flexibility mean that
each member of the P450 family has a gate with unique properties.[16]
Domain Motion: Shell
Large domain motions in enzymes are not generally
regarded as gating
systems. “Gates” of this scale are common in ion channels,
which are beyond the scope of this review.[14] However, one might expect that such large gates could be present
in enzymes that catalyze reactions of very large substrates. Indeed,
a gate of this kind has been observed in RNA polymerase, whose clamp
domain opens to permit entry of promoter DNA during initiation, closes
to establish a tight grip on the DNA during elongation, and then opens
again to release the DNA during termination.[66] Interestingly, movement of large domains may protect enzymes from
small molecule leakage and control their transport through long tunnel
networks. This has been observed in carbon monoxide dehydrogenase/acetyl
coenzyme A synthase, which operates as a tetrameric complex of distinct
subunits.[67] The cap domain movement of
epoxide hydrolase from Mycobacterium tuberculosis is another enzyme with a domain-scale gate that controls substrate
access to the active site cavity.[68] Monomers
of phospholipase A2 control access to their interface and the active
site by adopting a different conformation during dimer aggregation.[69] Large domain movements often require an additional
source of energy. For example, in the ATP-dependent protease HsIVU,[70] ATP hydrolysis is required to initiate conformational
changes and propagate them to the residues that form the gate.Structures of the gates found in 71 different proteins were analyzed
systematically, yielding the results presented in Tables 2 and 3. The most common gate types are wings, swinging doors, and
drawbridges, while apertures, double drawbridges, and shells are less
common (Figure 5). However, these numbers may
be distorted by the difficulty of identifying different gate classes
by experimental methods or molecular dynamics simulations. Movements
of only a few residues are more easily captured than those of secondary
structure elements or domains due to the different time scales involved
and the sizes of the moving structures. Moreover, gates may be controlled
by overlapping processes that occur on different time scales and affect
different structural regions.[50]
Table 2
List of Enzymes Possessing Gates Described
in the Scientific Literature with Indication of Their Function, Structural
Basis, and Location
Figure 5
Frequencies
of different gate types based on analysis of 71 proteins
with 129 gates. Detailed description of the analyzed proteins is provided
in Table 3.
Frequencies
of different gate types based on analysis of 71 proteins
with 129 gates. Detailed description of the analyzed proteins is provided
in Table 3.
Locations of Gates
The roles of gates in the enzymatic catalysis
discussed above suggest
that these structures are natural hot spots for modifying enzyme properties.
Identification of structural components of natural gates would therefore
be very useful to protein designers. This raises a question: how and
where should one look for the gates? Gates in proteins can be identified
experimentally by protein crystallography and NMR spectroscopy and
computationally by molecular dynamics simulations and normal-mode
analysis.Crystallographic analyses can provide information
on alternative
configurations of particular amino acids in a protein structure. The
presence of residues that can adopt both open- and closed-type configurations
along the access or release pathways of ligands, ions, and solvents
may suggest the presence of a gate. However, in order for a gate to
be detected by crystallography, it is necessary for both the open
and the closed conformations to be sufficiently represented. Despite
this restriction, crystallographic analyses have identified gates
in tryptophan synthase,[71] haloalkane dehalogenase
LinB,[72]l-amino acid oxidase,[73] and toluene-o-xylene monooxygenase.[74] In some cases, only one conformation will be
present in the solved crystal structure, which may create a somewhat
distorted picture, suggesting the absence of a gate when the open
conformation is stabilized or the absence of a pathway when the closed
conformation is stabilized.[31,61,75]The limitations of crystallographic analysis can be overcome
by
advanced NMR spectroscopy, which makes it possible to study multiple
protein conformations simultaneously, over time scales ranging from
picoseconds to milliseconds.[7,76] Such analyses provide
information on both the open and the closed states as well as the
population of each state and rate of their interconversion. NMR techniques
have been used to measure the rate of exchange between the open and
the closed conformations of triosephophate isomerase,[77,78] HIV-1 protease,[79] and dihydrofolatereductase.[80,81] Overall, the utility of NMR for studying distant effects of mutations
on protein dynamics suggests that it has great potential for investigating
gating mechanisms.[9]Some of the most
useful tools for identifying gates are the computer
programs developed for detecting tunnels, channels, and cavities in
protein structures.[82] The outputs of CAVER,[83] MOLE,[84] and MOLAXIS[85] can be analyzed to detect bottleneck residues
that form a potential gate or identify the best position for introduction
of a new gate. Mutations at these “hot spots” can provide
enzymes with new selectivities or activities.[86] Zawaira et al.[59] used the CAVER software
together with the Protein Interaction Calculator[87] for identifying gating residues within the cytochrome P450
family.MD simulations are well suited for identification and
analysis
of gates and their behavior over time. Detailed descriptions of MD
methods and their applications in simulating ligand migration can
be found in recent reviews.[9,88] Movements of large
protein fragments on microsecond time scales can be investigated
using Brownian dynamics,[31,89] while Random Expulsion
Molecular Dynamics and Steered Molecular Dynamics can be used to study
pathways dedicated to transport of specific ligands.[58,90] Some proteins have multiple pathways, each of which accommodates
a different ligand or ligand class. This may in fact be a lot more
common than is currently realized and can dramatically increase the
complexity of gating systems arising from protein movement and the
difficulty of identifying the true gating residues. For example, different
residues control the ability of inhibitors E2020 and Huperzine A
to access the active site of Torpedo californicaacetylcholinesterase.[60] Similarly, in cytochrome P4503A4, different residues in the same tunnel control access of temazepan
and testosterone-6OH.[58] The importance
of a gating residue identified by computational methods can be confirmed
experimentally by site-directed mutagenesis and kinetic experiments.Studies using the experimental and theoretical approaches for gate
identification discussed above have demonstrated that their locations
within the protein can vary widely. Gates have been observed (i) at
the entry to the active site or even directly inside the active site,
(ii) at the entry or in the bottleneck of the protein tunnel connecting
the buried active site to the protein surface or connecting two active
site cavities, and (iii) at the interface of the cofactor and active
site cavities (Figure 6).
Figure 6
Locations of gates within
a protein structure. Schematic representation
of an enzyme with two active sites connected by a tunnel (I), a cofactor
cavity (II), and multiple access tunnels. Gating residues in red may
be located at the entrance to the active site (1), at the entrance
or the bottleneck of the tunnel (2), and between the active site cavity
and the cofactor cavity (3).
Locations of gates within
a protein structure. Schematic representation
of an enzyme with two active sites connected by a tunnel (I), a cofactor
cavity (II), and multiple access tunnels. Gating residues in red may
be located at the entrance to the active site (1), at the entrance
or the bottleneck of the tunnel (2), and between the active site cavity
and the cofactor cavity (3).
Active Site Entrance and Active Site
The
entrance to the active site cavity is a suitable location for
a gate, and gates situated here can have strong effects on enzyme
activity. In some cases, the gating residues may even be a part of
the active site.[91] The simplest gates serve
as filters that discriminate between potential substrates and thus
play an important role in controlling enzyme selectivity. More advanced
systems can prevent substrate entry when the active site residues
are not properly oriented, e.g., in enzymes that require conformational
changes before substrate binding. Many enzymes have gates at the entrance
to their active sites, including acetylcholinesterase,[60] imidazole glycerol phosphate synthase,[38] glutamate synthase,[46] toluene-o-xylene monooxygenase,[91] monooxygenase,[92] choline oxidase,[31] NiFe hydrogenases,[21] carbonic anhydrases,[93] formiminotransferase-cyclodeaminase,[94] type III polyketide synthases,[95] and FabZ β-hydroxyacyl–acyl carrier protein
dehydratase.[54]
Tunnel
Entrance and Tunnel Bottleneck
The ability of ligands and
solvent molecules to move from the media
surrounding the protein to the active site can be controlled by gates
located at any point along the tunnel. Gating residues may be situated
at the tunnel entrance. However, it is more common to find them at
the tunnel bottleneck. The tunnel entrance refers to the first shell
of residues that define the tunnel and have contact with the bulk
solvent. The tunnel bottleneck refers to the narrowest part that can
be positioned anywhere along the tunnel (Figure 6). Even a single large residue whose side chain can project into
the interior of the tunnel can exert efficient control over the access
pathway. One might speculate that it might be favorable to have gates
located inside tunnels because this allows their position to be more
tightly controlled; their movements are restricted by the surrounding
residues, and both the open and the closed conformations can be stabilized
via interactions with neighboring amino acids. In contrast, residues
located on the surface of the protein possess more degrees of freedom,
and it is rare for both the open and the closed conformations to be
stabilized. Examples of such gates inside the tunnels can be found
in cholesterol oxidase type I,[57] toluene-4-monooxygenase,[61] undecaprenyl-pyrophosphate synthase,[27] homoprotocatechuate 2,3-dioxygenase,[63] 4-hydroxy-2-ketovalerate aldolase/acylating
acetaldehyde dehydrogenase,[96] epoxide hydrolase
from Aspergillus niger M200,[26] and FabZ β-hydroxyacyl-acyl carrier protein dehydratase.[54] Similarly, gates can be situated in the bottlenecks
of tunnels connecting two active sites. Gates in such positions are
essential for enzymes that catalyze two reactions requiring different
environments, such as glucosamine 6 phosphate synthase,[45] imidazole glycerol phosphate synthase,[38] cytidine triphosphate synthetase,[52] carbamoyl phosphate synthetase,[36] and glutamate synthases.[46]
Cofactor Cavity
Gates can be positioned
at the interface of the active site and the cofactor cavity, allowing
for more fine-grained control during the reaction. In NADH oxidase,
the W47 residue acts as a gate that controls the accessibility of
the FAD flavin ring and thus plays a crucial role during the catalytic
cycle. The closed conformation is stabilized by hydrogen bonds between
the cofactor and the peptide backbone, whereas stabilization of the
open form may be advantageous during the initial steps of substrate
binding since it is believed to slow down product dissociation.[22,97] Other gates of this type have been reported in 3-hydroxybenzoate
hydroxylase,[98] 4-hydroxy-2-ketovalerate
aldolase/acylating acetaldehyde dehydrogenase,[96] and cholesterol oxidase type I[57] and type II.[99] Moreover, cofactors themselves
can also function as gates. The FAD cofactor of digeranylgeranylglycerophospholipid
reductase has two different conformations, referred to as the “in”
and “out” conformations. In the “in” conformation,
the tunnel is blocked by FAD. To open the tunnel, FAD has to adopt
the “out” conformation.[100]Gates are most commonly located at the tunnel entry and the
tunnel bottleneck (51%). This is to be expected because the bottleneck
represents the narrowest point of the tunnel, and its diameter often
dictates the tunnel’s permeability. Another common location
is the entrance to the active site cavity (28%). Gates at the entrance
of the cofactor cavity are less common (5%), which is not surprising
since not all enzymes have a cofactor cavity. In 16% of the cases
studied, the gate was not located within any of these functional regions
(Figure 7).
Figure 7
Distribution of gate locations within
protein structures based
on analysis of 71 proteins with 129 gates. Detailed description of
the analyzed proteins is provided in Table 3.
Distribution of gate locations within
protein structures based
on analysis of 71 proteins with 129 gates. Detailed description of
the analyzed proteins is provided in Table 3.
Engineering
of Gates
The average rate of evolution of the gating residues
in the cytochrome
P450 family is significantly greater than that for the protein sequence
as a whole.[59] The high rate of evolution
at the gating residues suggests that gate engineering may be an attractive
alternative to other rational enzyme design strategies. This idea
is supported by a few observations: (i) the gates are often spatially
separated from the active site, and so mutations at the gating residues
should not be deleterious to protein function, (ii) the opening and
closing of the access pathways can affect ligand exchange and thus
enzyme activity and selectivity, and (iii) gate modification can modulate
the solvent’s ability to access the active site, which in turn
affects solvation and stabilization of the transition state and also
product release. In the following section, we describe selected cases
in which an enzyme’s catalytic properties have been successfully
altered by modifying its gates.
Gate Modification
Gates can be modified
by substitution of the gating residues, hinge residues, or anchoring
residues depending on the nature of the gate in question. Modification
of the gating residues has been shown to change the selectivity and
activity of toluene-o-xylene monooxygenase, with
the E214G mutation improving oxidation of p-nitrophenol
by a factor of 15.[101] A similar improvement
in overall activity was achieved in a lipase from Burkholderia
cepacia by the mutations L17S + L287I. This double mutant
also exhibited a 10-fold increase in enantioselectivity compared to
the wild-type enzyme.[86] The T78F or P76F
mutations in imidazole glycerol phosphate synthase override some of
the control exerted by the wild-type gate and block the passage of
ammonia through the tunnel.[38] In NiFe hydrogenases,
mutations of the V74 and V74 + L122 residues changes the rates of
transport for H2, CO, and O2, thereby modulating
the overall rate of reaction.[21]Gates
that incorporate secondary structure elements are dependent on hinge
and anchoring regions. It has been demonstrated that modification
of the hinge region can change an enzyme’s activity and selectivity.
Notably, the Q230P mutation in rabbit 20A-hydroxysteroid dehydrogenase
decreases the flexibility of a key loop and thereby changes its selectivity.[37] Similarly, access to the active site in the
HIV-1 protease is controlled by two β-turn flaps.[89] Results from Brownian dynamics simulations suggest
that the G48V + V82A or I84V + L90M mutations in this enzyme reduce
the likelihood that the active site will be exposed at any given point
in time from 14% in the wild type to 2% in the mutants.[89] The importance of the anchoring residues in
the HIV-1 protease was demonstrated by the F53L mutation, which generates
a semiopen conformation due to removal of the stabilizing F53–I50
interaction.[89]
Gate
Removal
Removing gates typically
increases substrate and product exchange rates but also allows more
extensive access of water molecules to protein tunnels and cavities.
The overall effect of gate removal is therefore equal to the combined
effects of these two processes. Gate deletion has been shown to increase
the rate of substrate binding to tryptophan synthase.[102] The F280C and F280S mutations both increased
the rate of indole binding by a factor of 2. Similarly, the T78A mutation
allowed ammonia to rapidly pass through the tunnel in imidazole glycerol
phosphate synthase.[38] The R239A mutation
in the cyclase caused a 1000-fold decrease in the enzyme’s kcat/Km value and
decoupling of the reaction.[38] This dramatic
change in enzyme catalytic efficiency was attributed to creation
of a new route for ammonia release. A similar leakage of ammonia was
caused by the G359F and G359Y mutations in carbamoyl phosphate synthetase.[103,104] Negative consequences of gate removal were also observed for the
FabZ-β-hydroxyacyl-acyl carrier protein dehydratase (HpFabZ),[54] in which the Y100A mutation leaves the active
site completely exposed to the bulk solvent. As a result, the acyl
carrier protein binds to the HpFabZ Y100A mutant much more strongly
than to the wild-type HpFabZ, decreasing the mutant enzyme’s
activity by more than 50% due to the very slow dissociation of the
acyl carrier protein.In some cases, gate removal enables bulky
substrates to access the active site cavity. Mutations D285I and D285Q
in toluene-4-monooxygenase improved its ability to oxidize the large
and bulky substrates 2-phenylethanol and methyl p-tolyl sulfide by factors of 8 and 11, respectively, while the D285S
mutation improved the rate of styrene oxidation 1.7-fold.[105] The L137A mutation in undecaprenyl-pyrophosphate
synthase removed the bottom of the tunnel in this enzyme, allowing
formation of products with longer chain lengths.[27] A similar situation was encountered in type III polyketide
synthases from Aloe arborescens PCS, in which the
M207 residue controls the number of condensations of malonyl-CoA.[106] The M207G mutation opened a connection between
the tunnel and two hidden pockets located behind the active site,
resulting in formation of extended products. Further product elongation
was achieved with the triple mutant F80A + Y82A + M207G.
Gate Insertion
To best of our knowledge,
there have been no reports of an intentional introduction of a new
gate into an enzyme structure. However, there have been studies in
which an access tunnel was systematically modified with multiple substitutions,
and it is reasonable to expect that some of these mutations might
have created new gates. More research will clearly be needed to confirm
this expectation. To verify successful intentional insertion of a
new gate into a protein structure, it would be necessary to confirm
the existence of both the open and the closed conformations at a position
where previously only a single conformation could be adopted.Site-directed mutagenesis targeting specific residues at various
positions along the access tunnel of Candida rugosa lipase has been used to alter the acceptable substrate chain length
for this enzyme. The mutants for which this was observed were P246F,
L413F, L410W, L410F + S300E, and L410F + S365L.[107] We note that the aromatic residues F and W, which are common
in wing and swinging door gates, were introduced in each of these
variants.In another study, the residue L177 that is located
near the entrance
to the access tunnel of the haloalkane dehalogenase LinB from Sphingobium japonicum UT26 was substituted with all of the
natural amino acids, yielding 19 mutants with significantly altered
substrate specificity and activity.[108] Preliminary
computational analyses of these variants using molecular dynamics
revealed that the two residues possessing a single aromatic ring (F
and Y) exhibited large fluctuations, as might be expected for gating.Residue A217 is located at the entrance to the tunnel in the epoxide
hydrolase EH from Aspergillus niger M200. This residue
was substituted with C, E, G, L, P, Q, R, T, and V, and the effect
of each mutation on the enzyme’s activity and enantioselectivity
was studied.[26] The mutants exhibited different
enantioselectivity and activity relative to the wild type. For instance,
the activity of the A217G mutation toward allyl glycidyl ether was
lower than that of the wild type by a factor of 33, whereas the A217V
mutation increased activity toward this substrate 6.6-fold.Residues I135, W141, C176, V245, L246, and Y273 are positioned
close to the entrance to the main and side tunnels of the haloalkane
dehalogenase DhaA from Rhodococcus rhodochrous NCIMB
13064. These residues were simultaneously permuted in an attempt to
improve this enzyme’s
activity against 1,2,3-trichloropropane. The most successful mutant,
which featured the I135F, C176Y, V245F, L246I, and Y273F substitutions,
showed 26-fold greater activity toward the target substrate than did
the wild type.[109] In this mutant, three
aromatic residues were introduced in place of aliphatic ones in the
vicinity of the tunnels. Computational analysis of product release
from the mutant suggests that substitutions introduced a transient
rather than permanent structural feature and gating residues prevented
access of water to the active site.[110] Crystallographic
analysis of the mutant revealed two distinct conformations for the
Y176 side chain.[110]
Conclusions
This review highlights the importance of gates
in enzymes. Gates
play vital roles in controlling the catalytic activity and selectivity
of enzymes and are more common in protein structures than is generally
thought. In particular, gates control substrate access to the active
site and product release, prevent or restrict solvent access to specific
regions of the protein, and can synchronize processes occurring in
distinct parts of the enzyme. Our literature survey of 129 gates in
71 enzymes revealed a large variety of systems with sophisticated
structures. We presented a rigorous definition of gates and established
a new scheme for their classification. The large number of inspected
cases allowed us to build a catalogue of gates assigned to six distinct
classes—wings, swinging doors, apertures, drawbridges, double
drawbridges, and shells—with three different functions and
three distinguishable locations. We also presented summary statistics
that give a preliminary overview of the propensity of specific amino
acid residues to occur in particular gate classes. The proposed classification
scheme can be easily extended and updated but even in its present
form can provide guidance for analysis and engineering of gates in
biomolecular systems.The biochemical relevance and specific
location of gates within
protein structures make them attractive targets for protein engineering.
Attempts to rationally redesign gates typically involve computer-assisted
gate identification followed by modification using focused directed
evolution. This approach is compatible with a recent trend in protein
engineering that stresses construction of small and smart libraries.
Gate modification and deletion have been demonstrated in numerous
cases, but the intentional insertion of new gates remains a challenge.
Convenient methods for identifying gates in protein structures are
essential prerequisites for their engineering. In silico, this can
be achieved by coupling the software tools developed for describing
pathways to tools developed for study of protein dynamics. Of the
available experimental techniques, NMR spectroscopy is particularly
suitable for analysis of highly dynamic protein structures and can
be expected to play an indispensable role in the study of gate dynamics
at the atomic level. The field would also benefit from development
of new experimental techniques for monitoring the passage of ligands
through the protein pathways. One day it will be possible to control
the catalytic properties of enzymes by rational engineering of their
gates. To achieve this goal, we have to learn how gates evolved, how
they interact with the other parts of the protein structure as well
as with the ligand and solvent molecules, and how they fulfill their
biological functions.
Authors: Vladimir Yu Torbeev; H Raghuraman; Kalyaneswar Mandal; Sanjib Senapati; Eduardo Perozo; Stephen B H Kent Journal: J Am Chem Soc Date: 2009-01-28 Impact factor: 15.419
Authors: Gira Bhabha; Jeeyeon Lee; Damian C Ekiert; Jongsik Gam; Ian A Wilson; H Jane Dyson; Stephen J Benkovic; Peter E Wright Journal: Science Date: 2011-04-08 Impact factor: 47.728
Authors: Pierre-Pol Liebgott; Fanny Leroux; Bénédicte Burlat; Sébastien Dementin; Carole Baffert; Thomas Lautier; Vincent Fourmond; Pierre Ceccaldi; Christine Cavazza; Isabelle Meynial-Salles; Philippe Soucaille; Juan Carlos Fontecilla-Camps; Bruno Guigliarelli; Patrick Bertrand; Marc Rousset; Christophe Léger Journal: Nat Chem Biol Date: 2009-12-06 Impact factor: 15.040
Authors: Christopher G Mayne; Mark J Arcario; Paween Mahinthichaichan; Javier L Baylon; Josh V Vermaas; Latifeh Navidpour; Po-Chao Wen; Sundarapandian Thangapandian; Emad Tajkhorshid Journal: Biochim Biophys Acta Date: 2016-05-06
Authors: Diego Javier Zea; Alexander Miguel Monzon; Claudia Gonzalez; María Silvina Fornasari; Silvio C E Tosatto; Gustavo Parisi Journal: Protein Sci Date: 2016-04-18 Impact factor: 6.725