Enzyme function requires that enzyme structures be dynamic. Substrate binding, product release, and transition state stabilization typically involve different enzyme conformers. Furthermore, in multistep enzyme-catalyzed reactions, more than one enzyme conformation may be important for stabilizing different transition states. While X-ray crystallography provides the most detailed structural information of any current methodology, X-ray crystal structures of enzymes capture only those conformations that fit into the crystal lattice, which may or may not be relevant to function. Solution nuclear magnetic resonance (NMR) methods can provide an alternative approach to characterizing enzymes under nonperturbing and controllable conditions, allowing one to identify and localize dynamic processes that are important to function. However, many enzymes are too large for standard approaches to making sequential resonance assignments, a critical first step in analyzing and interpreting the wealth of information inherent in NMR spectra. This Account describes our long-standing NMR-based research into structural and dynamic aspects of function in the cytochrome P450 monooxygenase superfamily. These heme-containing enzymes typically catalyze the oxidation of unactivated C-H and C═C bonds in a multitude of substrates, often with complete regio- and stereospecificity. Over 600 000 genes in GenBank have been assigned to P450s, yet all known P450 structures exhibit a highly conserved and unique fold. This combination of functional and structural conservation with a vast substrate clientele, each substrate having multiple possible sites for oxidation, makes the P450s a unique target for understanding the role of enzyme structure and dynamics in determining a particular substrate-product combination. P450s are large by solution NMR standards, requiring us to develop specialized approaches for making sequential resonance assignments and interpreting the spectral changes that occur as a function of changing conditions (e.g., oxidation and spin state changes, ligand, substrate or effector binding). Solution conformations are characterized by the fitting of residual dipolar couplings (RDCs) measured for sequence-specifically assigned amide N-H correlations to alignment tensors optimized in the course of restrained molecular dynamics (MD) simulations. The conformational ensembles obtained by such RDC-restrained simulations, which we call "soft annealing", are then tested by site-directed mutation and spectroscopic and activity assays for relevance. These efforts have gained us insights into cryptic conformational changes associated with substrate and redox partner binding that were not suspected from crystal structures, but were shown by subsequent work to be relevant to function. Furthermore, it appears that many of these changes can be generalized to P450s besides those that we have characterized, providing guidance for enzyme engineering efforts. While past research was primarily directed at the more tractable prokaryotic P450s, our current efforts are aimed at medically relevant human enzymes, including CYP17A1, CYP2D6, and CYP3A4.
Enzyme function requires that enzyme structures be dynamic. Substrate binding, product release, and transition state stabilization typically involve different enzyme conformers. Furthermore, in multistep enzyme-catalyzed reactions, more than one enzyme conformation may be important for stabilizing different transition states. While X-ray crystallography provides the most detailed structural information of any current methodology, X-ray crystal structures of enzymes capture only those conformations that fit into the crystal lattice, which may or may not be relevant to function. Solution nuclear magnetic resonance (NMR) methods can provide an alternative approach to characterizing enzymes under nonperturbing and controllable conditions, allowing one to identify and localize dynamic processes that are important to function. However, many enzymes are too large for standard approaches to making sequential resonance assignments, a critical first step in analyzing and interpreting the wealth of information inherent in NMR spectra. This Account describes our long-standing NMR-based research into structural and dynamic aspects of function in the cytochrome P450 monooxygenase superfamily. These heme-containing enzymes typically catalyze the oxidation of unactivated C-H and C═C bonds in a multitude of substrates, often with complete regio- and stereospecificity. Over 600 000 genes in GenBank have been assigned to P450s, yet all known P450 structures exhibit a highly conserved and unique fold. This combination of functional and structural conservation with a vast substrate clientele, each substrate having multiple possible sites for oxidation, makes the P450s a unique target for understanding the role of enzyme structure and dynamics in determining a particular substrate-product combination. P450s are large by solution NMR standards, requiring us to develop specialized approaches for making sequential resonance assignments and interpreting the spectral changes that occur as a function of changing conditions (e.g., oxidation and spin state changes, ligand, substrate or effector binding). Solution conformations are characterized by the fitting of residual dipolar couplings (RDCs) measured for sequence-specifically assigned amide N-H correlations to alignment tensors optimized in the course of restrained molecular dynamics (MD) simulations. The conformational ensembles obtained by such RDC-restrained simulations, which we call "soft annealing", are then tested by site-directed mutation and spectroscopic and activity assays for relevance. These efforts have gained us insights into cryptic conformational changes associated with substrate and redox partner binding that were not suspected from crystal structures, but were shown by subsequent work to be relevant to function. Furthermore, it appears that many of these changes can be generalized to P450s besides those that we have characterized, providing guidance for enzyme engineering efforts. While past research was primarily directed at the more tractable prokaryotic P450s, our current efforts are aimed at medically relevant human enzymes, including CYP17A1, CYP2D6, and CYP3A4.
Dear Enzymologist,It is time we had a talk. Yes, “a
picture is worth a thousand
words”, and “seeing is believing”. The crystal
structure of your enzyme exposed the secrets of the active site, identified
critical residues, and let you dream of inhibitor design and enzyme
engineering. You love your structure, but you are troubled. Why does
substrate bind in the wrong orientation or not at all? What about
those allosteric, synergistic, and antagonistic effects that you see
in your assays about which the structure is mum? You screened potential
inhibitors by the thousands against the structure, but either hits
led to nothing, or when you crystallized the resulting complex, it
did not look at all like what was predicted.Here is the problem:
Enzymes are dynamic and occupy multiple conformations
at their working temperatures. But the crystallization process is
also a purification: Only those conformers that fit into the growing
lattice will be accepted. Unfortunately, there is no way of knowing a priori whether the crystallographic conformation is relevant
to catalysis or, if it is, which step in a reaction pathway it represents.Nuclear magnetic resonance (NMR) can provide insights into solution
structure and dynamics when static methods are insufficient. NMR allows
one to control variables such as composition, temperature, solvent,
pH, and other factors affecting dynamic processes and conformations
in a (relatively) nonperturbing environment. However, while NMR analysis
of proteins 10–30 kDa in size is straightforward, enzymes are
often larger than this, and much hard work is needed in order to reap
the benefits that NMR promises.Our group’s research
for the last 30 years has been aimed
at using NMR to improve our understanding of enzyme structure and
dynamics and to refine the techniques we use in order to accomplish
that goal. We have focused on cytochrome P450 monooxygenases, heme-containing
enzymes that typically catalyze the hydroxylation of unactivated C–H
bonds (Scheme ), although
they can also function as epoxidases and demethylases and in the catalysis
of oxidative couplings.
Scheme 1
The remarkable structural conservation
combined with the vast array
of substrate/product combinations exhibited by the P450 superfamily
makes it an ideal target for understanding molecular recognition in
biological systems.[1] In this report, we
will attempt a comprehensive description of the insights we have gained
into P450 structure and dynamics, as well as the methods we have used
or developed along the way.
Effector Function in Monooxygenases: The
Putidaredoxin-Cytochrome
P450cam Couple
We began our efforts by focusing on the cytochrome P450cam (CYP101A1) reaction cycle. CYP101A1 catalyzes the 5-exo hydroxylation of d-camphor, the first step in camphor
catabolism by the soil bacterium Pseudomonas putida.[2] CYP101A1 was the first P450 for which
a crystal structure was determined,[3] and
most of what is known about the mechanism of O2 activation
by P450s has been learned with this enzyme. The reaction catalyzed
by the 46.7 kDa CYP101A1 (Figure ) is remarkable for its efficiency, specificity, and
regulation. The enzyme is better than 99% efficient in its use of
reducing equivalents, and camphor hydroxylation is completely regio-
and stereospecific. The reaction requires two additional proteins,
putidaredoxin reductase (PdR), that oxidizes NADH to provide the needed
electrons, and the 11.6 kDa Fe2S2-containing
putidaredoxin (Pdx), which acts as an electron shuttle between PdR
and CYP101A1.
Figure 1
Catalytic cycle of CYP101A1. Binding of d-camphor
(upper left) induces a spin state shift from low spin (ls) to high-spin
(hs) permitting the first electron transfer. O2 binds to
the reduced enzyme, and binding of Pdx generates the catalytically
competent complex to produce 5-exo-hydroxycamphor
(lower left). In the absence of Pdx, the O2 complex decomposes
to superoxide and the enzyme–substrate complex (dotted line,
right). See text for details.
Catalytic cycle of CYP101A1. Binding of d-camphor
(upper left) induces a spin state shift from low spin (ls) to high-spin
(hs) permitting the first electron transfer. O2 binds to
the reduced enzyme, and binding of Pdx generates the catalytically
competent complex to produce 5-exo-hydroxycamphor
(lower left). In the absence of Pdx, the O2 complex decomposes
to superoxide and the enzyme–substrate complex (dotted line,
right). See text for details.Upon d-camphor binding, the hemeiron in
CYP101A1
shifts from ferric low spin (S = 1/2) to the high
spin (S = 5/2) form,[4] shifting
the reduction potential from −303 mV to −173 mV. As
the potential of reduced Pdx (Pdxr) is −240 mV,
electron transfer from Pdxr to high spin CYP101A1 is nearly
iso-energetic (Keq = ∼2.3), maximizing
the rate of electron transfer. Substrate binding is thus an on–off
switch; Pdxr does not reduce CYP101A1 unless substrate
is bound. Upon reduction, the hemeiron can ligate O2 required
for turnover. However, Pdx is not merely a reductant but also acts
as an effector for turnover. In fact, the first reduction of CYP101A1
can be done by any reductant with a sufficiently negative potential.
In the absence of Pdx, however, the O2–CYP101A1–substrate
complex (CYPr–S–O2) slowly decomposes
to superoxide and CYPo–S.[5] Binding of Pdxr drives a rate determining conformational
change in CYPr–S–O2, so the Pdxr–CYPr–S–O2 complex
is the enzymatically competent species for turnover. But what is the
nature of that complex, and what is the conformational change that
Pdx binding drives?
Pdx Structure and Dynamics
A necessary
first step was
to determine the structure of Pdx. The CYP101A1 structure had been
obtained in a number of forms by the late 1980s, but Pdx had defied
crystallographic analysis, nor were there any related ferredoxin structures
available. Could a Pdx structure be obtained by NMR? By 1989, homonuclear
(i.e., 1H, 1H) two-dimensional (2D) NMR methods
had advanced sufficiently that the answer was a qualified yes. Amino
acid 1Hspin systems could be established using through-bond
experiments such as TOCSY and COSY, and the spin systems could be
sequentially and structurally connected using through-space (nuclear
Overhauser effect, NOE) correlations. We published preliminary 1H sequential resonance assignments for oxidized Pdx (Pdxo), along with evidence for an extensive β-sheet structure
in 1991.[6] By 1994, graduate researcher
Xiaomei Ye had accumulated sufficient NOEs to publish a structure
of Pdxo,[7] the first de novo metalloprotein structure to be solved by NMR. Paramagnetic
broadening of 1H resonances within a ∼8 Å radius
of the Fe2S2 cluster required us to model the
cluster into the structure, and while Sophia Kazanis was able to prepare
a diamagnetic form of Pdx by reconstitution with gallium salts,[8,9] we were nonetheless relieved when a crystal structure for the related
adrenodoxin was published,[10] showing that
our modeling of the metal center was accurate.Having in hand
our NMR-derived structure for Pdxo, it was an irresistible
temptation to propose a model for the Pdx–CYP complex. Residues
on both proteins had been identified as important for complexation,
Arg109 and Arg112, both on the C-helix of CYP101A1, and the Pdx C-terminal
Trp106.[11,12] We assumed that positively charged Arg residues
would interact with carboxylates on Pdx, that the indole of Trp106
would interact with hydrophobic residues on the surface of CYP101A1,
and that the redox centers of both proteins would be close to each
other, leading us to the model shown in close-up in Figure .[13] The most predictive feature of this model was the appearance of
a salt bridge between Arg112 on CYP101A1 and Asp38 on Pdx. Subsequent
mutagenesis showed that Asp38 is important in mediating complexation,[14] and recently, crystallographic structures of
the Pdx–CYP101A1 complex confirmed the interaction.[15,16]
Figure 2
Close-up
of the interface between complexed Pdx and CYP101A1 proposed
in ref (13). The Pdx
polypeptide is shown in cyan, CYP101A1 in green. The 12 Å vector
between the heme iron and Fe2S2 iron adjacent
to the surface is shown as a dotted line, as are salt bridge interactions
between Asp38 and Arg112. (PDB-format file available from the authors
upon request.)
Close-up
of the interface between complexed Pdx and CYP101A1 proposed
in ref (13). The Pdx
polypeptide is shown in cyan, CYP101A1 in green. The 12 Å vector
between the hemeiron and Fe2S2iron adjacent
to the surface is shown as a dotted line, as are salt bridge interactions
between Asp38 and Arg112. (PDB-format file available from the authors
upon request.)This picture was far
from complete, however. The relevant complex
is between reduced Pdx (Pdxr) and CYPr–S–O2. We found that reduction of Pdx involved significant changes
in local dynamics, particularly near the C-terminal Trp106 and around
the metal center.[17] Using 1H,15N HSQC experiments, we showed that amide proton exchange
in multiple regions of Pdx is slowed upon reduction.[18] A painstaking selective 15N, 13C
double labeling effort by Nitin Jain allowed us to assign backbone
resonances in the vicinity of the Fe2S2 cluster
of Pdx and directly link a contraction of the structure in the metal
cluster binding loop upon reduction to the overall redox-dependent
changes in dynamics.[19,20] We later showed that redox-dependent
dynamics are not peculiar to Pdx but a general feature of related
ferredoxins.[21]The close coupling
between oxidation state and dynamics in Pdx
provided other insights. The formation of a Pdx–CYP101A1 complex
stable enough to drive a conformational change would require “freezing”
some degrees of freedom in Pdx, which comes at an entropy price. If
some of that price was prepaid upon reduction, the enthalpy change
upon complex formation might be sufficient to drive whatever conformational
changes occur in CYP101A1. Furthermore, upon reoxidization, dissociation
would be encouraged by the increased dynamics of Pdxo,
minimizing back electron transfer. It also rationalized our difficulties
with Pdx crystallography. Pdxo crystallizes readily, forming
large black crystals we referred to as “the Borg” (Star Trek fans will understand the reference). “The
Borg” diffracted immediately after insertion into the X-ray
beam, but diffraction quality decreased rapidly upon continued X-ray
exposure. As reduction causes contraction of the Pdx structure,[20] it was likely that photoelectron reduction increased
the mobility of individual Pdx molecules, decreasing order in the
crystal. More recently, crystallographic structures have been solved
for Pdx that show redox-dependent conformational changes in the vicinity
of the metal cluster.[22,23]
Pdx-Driven Changes in CYP101A1
Of course, Pdx structure
and dynamics tells less than half of the story. For a complete picture,
we needed to look at CYP101A1 directly. Fortunately, solution NMR
methods improved rapidly during the 1990s, with the introduction of
three-dimensional (3D) NMR experiments that allowed sequential 1H, 15N, and 13C resonance assignments
of the polypeptide without requiring NOE connectivity.[24] This gave us hope that the 46.7 kDa CYP101A1
might be tackled.However, there were still challenges. The
experiments we use for sequential assignments (HNCA, HN(CO)CA, HNCACB)
are “there and back again” sequences: That is, coherence
is first generated on the amide 1H and passed via the 15N to either the adjacent 13Cα carbon (HNCA)
or through the amide carbonyl to the 13Cα of the
preceding residue (HN(CO)CA). In HNCACB, the coherence is transferred
a step further, to the 13Cβ. After frequency labeling
of the connected nuclei, coherence is returned to the amide 1H for detection. The success of the experiment depends upon coherences
being sufficiently long-lived that they can still be detected at the
end of the sequence. 13C spins with attached protons are
particularly susceptible to rapid coherence loss, and coherence lifetimes
get shorter with increasing molecular weight. This, combined with
sheer spectral complexity, is the primary reason that NMR becomes
more difficult as proteins get bigger.In order for these experiments
to work with CYP101A1, the density
of 1H spins present in the sample must be minimized. Besides
uniform 15N and 13C labeling, then, the enzyme
requires perdeuteration (u-2H), replacing all (>99%)
of
the protons in the sample with deuterons. 2H is much less
efficient than 1H in stimulating 13C relaxation,
and coherence lifetimes are increased sufficiently to allow “there
and back again” experiments. Perdeuteration requires that growth
media be prepared with 2H2O, uniformly 13C, 2H labeled carbon sources, and salts lyophilized
to remove 1H2O. Needless to say, our standard
expression strains are less than thrilled with the menu we provide
and require “training” growths with increasing concentrations
of 2H2O before we grow in the fully labeled
(and expensive) media.Protein produced in this fashion is fully perdeuterated,
with 2H present even at the amide positions, where 1H is required for NMR. For proteins that can be unfolded/refolded
in good yield, this problem is easily overcome. Sadly, P450s are notoriously
resistant to this treatment, so we must rely on whatever 1H/2H exchange occurs during purification. Slow amide exchange,
along with the presence of 35 prolines (which lack an amide NH) in
the 414-residue CYP101A1, introduces multiple interruptions in the 1H–15N-dependent assignment process. The
paramagnetism of the heme further complicates the assignment process
due to efficient relaxation of nearby protons.To address these
problems, we prepared a series of CYP101A1 samples
containing type-selective 15N labels for most amino acids.
These samples were used to classify 1H–15N correlations by residue type, which were used as starting points
for sequential assignments and to confirm tentative assignments. Another
CYP101A1 sample was prepared with 13C-Pro and (u-15N) labeling. An HNCO experiment, requiring a 1H–15N correlation to be preceded by a 13C carbonyl
for detection, identified Pro–Xaa correlations. To avoid paramagnetic
relaxation, most of our assignments were made for the diamagnetic
reduced CO-bound CYP101A1 (CYPr–S–CO).By 2003, we had made sufficient progress to begin investigating
Pdx–CYP interactions. Titrating Pdxr into CYPr–S–CO and monitoring by 1H,15N TROSY-HSQC,[25] we identified many CYP101A1
residues perturbed upon Pdxr binding. While some perturbations
were unsurprising, particularly in helix C of the P450 (Figure ), we also saw changes in and
near the active site, opposite from the proposed Pdx binding site
on the proximal face (Figure ). These included I helix residues adjacent to the heme, as
well as residues in the B–B′ loop and B′ helices
bordering what was assumed to be the entrance to the active site.[26] Based on these observations, we proposed a “doorstop”
model for the effector activity of Pdx, in which the crystallographic
conformation of CYP101A1 represented the “closed door”
(and catalytically competent) form of the enzyme.[27] In solution, more open CYP conformers predominate, with
less impeded active site access, but are not catalytically competent.
The binding of Pdx would drive the formation of the closed form and
permit turnover after the second electron transfer.[25]
Figure 3
NMR-derived solution structure of camphor-bound CYP101A1 (PDB entry 2L8M). Secondary structures
are labeled as described by Poulos (ref (3)). Heme is shown as sticks (background) and substrate d-camphor in the foreground as translucent space-filling
spheres. Thus, the foreground is the distal (active site) side of
the heme.
NMR-derived solution structure of camphor-bound CYP101A1 (PDB entry 2L8M). Secondary structures
are labeled as described by Poulos (ref (3)). Heme is shown as sticks (background) and substrate d-camphor in the foreground as translucent space-filling
spheres. Thus, the foreground is the distal (active site) side of
the heme.A second titration series used
(u-2H)-Pdxr and (u-2H)-CYPr–S–CO, so the
most prominent 1H NMR signals were those of d-camphor.[28] Ring current shifts were used
to characterize the orientation of camphor in the active site with
and without Pdxr bound. Notably, the 1H shifts
of the three camphor methyl groups best fit the crystallographic orientation
when Pdx is bound. In the absence of Pdx, camphor reorients so that
the 5-exo C–H bond is too far from the hemeiron for the observed regiochemistry.[28] We observed that the Pdx-driven conformational change in CYP101A1
is poised between slow and fast exchange on the 1H shift
time scale: Resonances moving more than ∼180 Hz during the
titration show slow exchange behavior (two discrete signals for Pdx-free
and -bound forms), while those moving less than ∼120 Hz are
in fast exchange (a single signal moving through the titration course).
We interpreted this as meaning that all of the Pdx-dependent perturbations
were driven by the same event, taking place at a rate of ∼700
s–1 at half-saturation (∼2:1 Pdx/CYP). On
the time scales of most protein motions, this rate is positively glacial.
However, the cis–trans isomerization of an
Xaa–Propeptide bond can be this slow, so we looked for prolines
in CYP101A1 that might trigger the conformational change. An obvious
candidate was Pro89, at the beginning of the B′ helix, adjacent
to the proposed active site access. In most crystallographic structures
of CYP101A1, the Ile88–Pro89 bond is cis,
resulting in the “closed door” active site, although
a recent structure of the tethered Pdx–CYP101A1 complex shows
the Ile88–Pro89 bond as distorted trans.[29] Mutation of Pro89 to Ile (P89I) resulted in d-camphor bound in both orientations, the “closed-door”
orientation driven by Pdx binding as well as the open Pdx-free form.[30,31] We proposed that the trans–cis isomerization of the Ile88–Pro89 bond is driven by Pdx binding,
and the isomerization is the origin of the perturbations observed
upon formation of the Pdxr–CYP complex. Many P450s
have an Ile (or Val)–Pro combination at the N-terminus of the
B′ helix, suggesting that this isomerization is a common means
of controlling active site access.[1] Others
have a similar arrangement in the F–G loop, which could serve
the same function.[1]The nature of
the complex formed by Pdx and CYP101A1 has continued
to be a focus of interest in recent years. Spectroscopic studies have
on one hand confirmed that the interaction between the two proteins
is redox-dependent,[32] while crystallographic
studies differ on whether the open or closed states of CYP101A1 predominate
in the complex.[15,16,29] Although CYP–S–CO and CYP–S–O2 are both in the same oxidation state, they differ in electronic
configuration, and the instability of the oxygenated complex precludes
direct NMR observation of the complexation of that species with Pdxr. As such, some open questions remain concerning the precise
nature of the complex.
Substrate-Dependent Conformational Changes
in CYP101A1
We next turned our attention to the effects of
substrate on CYP101A1
conformational selection. We found that removal of substrate or replacement
by substrate analogs affected multiple regions of the enzyme, including
the I helix adjacent to the active site and the C–D loop on
the proximal side of the enzyme. One surprising set of changes was
observed in the K′ helix (Figure ).[33] The K′
helix is a short helix that connects the β-meander (which includes
the axial heme thiolate ligand) to the β3 sheet bordering the
active site. While residues in the β3 sheet show little change
upon substrate binding, amides in the K′ helix show relatively
large perturbations, even though the K′ residue nearest to
the active site, Ser325, is ∼14 Å from bound substrate.
We proposed that the K′ helix (which is highly conserved across
the P450 superfamily) adjusts the position of the β3 sheet in
response to substrate binding by changing hydrogen bonding patterns
from that of an α-helix to that of a 3-10 helix (Figure ). This changes the length
of the K′ helix, moving the β3 sheet as a unit in response
to the presence of substrate.
Figure 4
Changing hydrogen bond patterns in the K′
helix of CYP101A1
(PDB entry 2L8M) adjusts the position of substrate contact Val295 (β3 sheet)
relative to substrate camphor (in purple spheres) by changing the
length of the K′ helix. Camphor is in the foreground of the
figure, with the heme behind. Direction of helix length adjustment
is shown as gray scale arrow parallel to the K′ helix. See
text for details.
Changing hydrogen bond patterns in the K′
helix of CYP101A1
(PDB entry 2L8M) adjusts the position of substrate contact Val295 (β3 sheet)
relative to substrate camphor (in purple spheres) by changing the
length of the K′ helix. Camphor is in the foreground of the
figure, with the heme behind. Direction of helix length adjustment
is shown as gray scale arrow parallel to the K′ helix. See
text for details.
Solution Structural Ensembles
of Substrate-Free and -Bound CYP101A1
While 1H,15N HSQC spectra provide a rapid
residue-specific means of localizing responses to events such as substrate
or effector binding, the observed spectral changes are responses to
changes in hydrogen bond lengths or angles or other local electronic
factors rather than conformational changes per se. In order to visualize the structural changes causing the perturbations,
we required direct structural information. This was obtained in the
form of residual dipolar couplings (RDCs) measured for backbone amide 1H–15N correlations.In solution NMR,
coherences are transferred via through-bond J-couplings
that are independent of magnetic field strength. But nuclear spins
also interact via through-space (dipolar) coupling. As the interaction
between two bar magnets depends upon their relative orientations,
so does the sign and magnitude of the dipolar coupling between an
amide 1H and 15N depend upon the angle that
the internuclear vector makes with respect to the applied field. In
isotropic solution, the dipolar coupling averages to zero. However,
if a material that aligns in the magnetic field (e.g., phage particles
or liquid crystal) is added to the sample, collisions between the
aligned additive and protein molecules introduce a slight orientational
preference in the molecular tumbling. This preference reintroduces
the 1H–15N dipolar coupling in a attenuated
form known as a residual dipolar coupling (RDC).[34] RDCs are detected as a modulation of the 1H–15N J-coupling, with the sign and magnitude
of the RDC dependent upon the average orientation of the N–H
bond vector with respect to the applied magnetic field. By fitting
measured RDCs to an alignment tensor, possible orientations of the
N–H bond in the molecular frame of reference are obtained.
RDCs can be used as restraints in molecular dynamics (MD) simulations,
a process we call “soft annealing”, to provide an ensemble
average of the orientations of secondary structures in the molecular
frame.RDCs for CYP101A1 amide N–H pairs measured with
and without
bound substrate were used as restraints in MD simulations,[35,36] resulting in the first noncrystallographic P450 structures in the
PDB (entries 2L8M and 2LQD).
Comparison of substrate-free and -bound structures showed some striking
conformational shifts. As predicted, hydrogen bonding patterns in
the K′ helix change, and this change is linked to an essential
collapse of the active site in the absence of substrate, due to inward
movements of the β3 sheet and B–B′ loop. Large
movements in the C, D loop were seen, as well as a rotation around
the long axis of the N-terminal half of the I helix (see Figure ). This rotation
results in changes in the hydrogen bonding pattern in the I helix
“kink”, a region of irregular hydrogen bonding near
the heme that forms the O2 binding site.[37] We used mutagenesis to confirm that the conformational
changes we observed were not artifacts of the methodology but realistic
representations of substrate-dependent conformational selection.[38]
Enzyme Control of Regio- and Stereochemistry
An important
insight from our NMR-derived structures was that CYP101A1 secondary
structures do not fray in the absence of substrate but move as units:
Structural reorganization is the result of changes in relative orientations
or displacements of secondary structural features when substrate is
bound, not local unfolding. This fits with our observation that the
biggest substrate-dependent spectral changes are found at junctions
between secondary structures (turns and loops) rather than within
the structures themselves. To probe the role that motions of secondary
structures play in determining the regio- and stereochemistry of the
CYP101A1-catalyzed oxidation, Eliana Asciutto at UNSAM/CONICET (Buenos
Aires) performed a series of MD simulations starting from 2L8M and 2LQD and analyzed the
resulting dynamics tracks using perturbation response scanning (PRS),
a means of determining the sensitivity of one region of the protein
to structural fluctuations elsewhere.[39] She detected several long-range normal mode displacements that are
active in substrate-bound CYP101A1 but inactive in the substrate-free
enzyme. These modes lie essentially along the longest diagonals of
the P450 structure (Figure ), implying that the entire enzyme structure is involved in
maintaining substrate position, not just those residues contacting
substrate in the active site. Conversely, modes active in the absence
of substrate are weaker and more localized.
Figure 5
Long-range coupled motions
(normal modes) associated with substrate
binding and orientation in cytochrome P450cam identified
by NMR-directed molecular dynamics simulations, as described in ref (39). Residues highlighted
in orange represent the end points of normal modes active when substrate
(d-camphor, shown in red) is bound, while those in
yellow correspond to shorter range motions active in the absence of
substrate. Parenthetic labels refer to nearby secondary structures
(see Figure ).
Long-range coupled motions
(normal modes) associated with substrate
binding and orientation in cytochrome P450cam identified
by NMR-directed molecular dynamics simulations, as described in ref (39). Residues highlighted
in orange represent the end points of normal modes active when substrate
(d-camphor, shown in red) is bound, while those in
yellow correspond to shorter range motions active in the absence of
substrate. Parenthetic labels refer to nearby secondary structures
(see Figure ).Second, she found that by including
a dihedral restraint that rotates
substrate away from its preferred orientation relative to the heme
plane, dramatically different PRS patterns were obtained. Even changes
in the dihedral restraint as small as 10° results in different
PRS response patterns (Figure ). Based on these observations, we conclude that the long-range
normal modes activated by substrate binding are critical for determining
substrate orientation relative to the active Fe=O complex and
enforce the regio- and stereochemistry of the ultimate hydroxylation.
These conclusions have obvious implications for rational enzyme engineering,
a current area of interest for our group.
Figure 6
Heat maps showing the
sensitivity of perturbation response patterns
(normal modes) as a function of amino acid sequence to orientation
of substrate d-camphor in the active site of CYP101A1.
Most sensitive regions are the brightest. Top and bottom maps differ
in substrate orientation by 10°. See text and ref (39) for details.
Heat maps showing the
sensitivity of perturbation response patterns
(normal modes) as a function of amino acid sequence to orientation
of substrate d-camphor in the active site of CYP101A1.
Most sensitive regions are the brightest. Top and bottom maps differ
in substrate orientation by 10°. See text and ref (39) for details.
Can What We Learn from CYP101A1 Be Applied
to Other P450 Enzymes?
In many respects, CYP101A1 is an outlier
in the P450 superfamily.
The requirement for Pdx as an effector is unusually specific: Many
P450s exhibit turnover with a noncognate redox partner. The almost
complete shift to high spin upon substrate binding observed for CYP101A1
is also atypical, although evidence suggests that all P450s require
some degree of high spin formation upon substrate binding,[40] and the effector requirement is, if not universal,
quite general. Other observations that we have made along the way
(the conserved K′ helix, for example, or the common occurrence
of an Xaa–Pro at the N-terminal end of the B′ helix)
suggest that there are other commonalities shared among members of
the P450 superfamily.To test whether our conclusions are generalizable
to other P450s,
we characterized MycG, a P450 from the biosynthetic pathway of mycinamicin
II (M-II), a macrolactone antibiotic.[41] MycG is slightly smaller than CYP101A1, (398 vs 414 residues) and
sequence identity between CYP101A1 and MycG is low (∼23%).
Their substrates differ dramatically in structure, size, and hydrophobicity.
Furthermore, MycG is bifunctional, catalyzing the last two steps of
M-II biosynthesis, a hydroxylation followed by an epoxidation (Figure ), both of which
are regio- and stereoselective.[42] Crystallographic
structures of MycG with M-IV bound showed an open active site, with
M-IV not suitably oriented for the observed chemistry, so further
rearrangements must take place prior to hydroxylation.
Figure 7
Sequential oxidations
catalyzed by MycG in the biosynthesis of
mycinamicin II (M-II).
Sequential oxidations
catalyzed by MycG in the biosynthesis of
mycinamicin II (M-II).After sequential assignment of MycG, RDCs were used to calculate
solution structural ensembles for the M-IV-bound enzyme. Paramagnetic 1H relaxation rates were used as distance restraints for M-IV
orientation in the active site.[43] Comparison
of the MycG crystallographic (PDB entry 2Y98) and RDC-based structures (PDB entry 5UHU) showed that the
orientation and placement of substrate M-IV differs drastically between
the two. In the crystal, much of the M-IV molecule lies between the
F and G helices that form the “cap” of the P450 active
site, in an orientation perpendicular to the heme plane, while in
solution, M-IV lies roughly parallel to the heme (Figure a). The orientation and placement
of M-IV between the F and G helices in the crystal is reminiscent
of that of an inhibitor, quinidine, in the crystal structure of CYP2D6,
a humanP450.[44] This leads us to speculate
that the crystallographic position of M-IV is an inhibitory binding
mode (Figure b). High
substrate concentrations are inhibitory for many P450 enzymes, including
MycG.[43] Binding of M-IV in this mode might
inhibit movements of the F–G helix structures necessary for
appropriate substrate binding or catalysis. Furthermore, this “locked”
conformer may crystallize readily due to its compactness.
Figure 8
(A) Mycinamycin
IV (M-IV) binding modes in the crystallographic
(2Y98, purple)
and NMR-based structures (5UHU, green). (B) Inhibitor quinidine binding mode in CYP2D6
structure (4WNU).
(A) Mycinamycin
IV (M-IV) binding modes in the crystallographic
(2Y98, purple)
and NMR-based structures (5UHU, green). (B) Inhibitor quinidine binding mode in CYP2D6
structure (4WNU).
Conformational Changes upon Substrate Binding
in Two Different
P450 Enzymes
We then went on to characterize solution conformations
of substrate-free
MycG. Again using RDCs, we compared the conformational changes upon
M-IV binding to MycG with those observed for camphor binding to CYP101A1.[45] While some specific differences were noted,
those differences can be rationalized in terms of substrate size and
coupled motions needed to accommodate substrate. More striking were
the similarities: In both cases, changing hydrogen bond patterns in
the K′ helix is critical for modulating the position of the
β3 sheet appropriately for substrate contacts in the active
site, and overall, the same sense of coordinated displacements of
the A and B helices and β-rich regions are seen.A complementary
observation was that the regions least perturbed
upon substrate binding in both enzymes are the same: these
include the J, K, and L helices and the irregular secondary structure
(the “β-meander”) that contains the axial heme
thiolate ligand cysteine residue. Furthermore, while overall sequence
homology is low between the two enzymes, homology is highest in the
unperturbed regions (63% identity, 78% sequence similarity, and no
gaps). The conservation suggests that these regions represent the
“core” structural features that must be conserved in
order to safely activate molecular oxygen, regardless of substrate.[45]
What’s Next? The Membrane Barrier
and Eukaryotic P450
Enzymes
The similarities we see between substrate binding
in MycG and that
in CYP101A1 are suggestive of commonalities in structure/function
relationships in the P450 superfamily, but hardly proof. There are
over 600 000 genes in GenBank assigned to P450s, and their
sheer number is indicative of the multitude of substrate/product combinations
that P450 catalysis encompasses. Characterization of even a small
fraction of these enzymes is inconceivable. More efficient is to characterize
a representative of each class, as determined by substrate–product
types. CYP101A1 provides an example of a catabolic enzyme with a small
hydrophobic substrate, while MycG binds a large mesophilic substrate
as part of a biosynthetic pathway. We are currently investigating
another enzyme, P450meg (CYP106A2), which hydroxylates
steroidal substrates, making this enzyme of interest for drug manufacturing.[46] Furthermore, steroid oxidations are among the
roles that P450s play in higher organisms. The elaboration and modification
of cholesterol to biosynthesize most steroid hormones involve P450s
at some stage. These P450s are targets for chemotherapy in steroid-responsive
cancers, and an understanding of how steroidal substrates are bound
and oriented in the active sites of the relevant P450s has implications
for drug design. We are currently working on several human P450s,
including CYP17A1, CYP2D6, and CYP3A4, in collaboration with Emily
Scott’s group at the University of Michigan. Her group has
shown that the human steroidogenic CYP17A1 is amenable to solution
NMR,[47,48] and together we have now made ∼50%
of the amide backbone N–H assignments for that enzyme.Eukaryotic P450s present greater challenges than their bacterial
counterparts. Heterologous expression of eukaryotic P450s can be difficult,
often requiring chaperone coexpression for proper folding. They are
usually larger than bacterial P450s, resulting in increased spectral
complexity. But, more importantly, eukaryotic P450s contain an N-terminal
helical membrane binding domain, and often display hydrophobic residues
on their surfaces to facilitate interaction with membranes. This can
lead to oligomerization, with concomitant increased molecular weight
and shorter coherence lifetimes. Furthermore, it seems likely that
active site/membrane interactions are critical for determining active
site geometry and substrate orientation.[49] However, with advances such as the use of nanodisc membrane models
to facilitate solution NMR examinations of membrane proteins,[50,51] we foresee that these problems can be solved, and NMR will continue
to offer further important insights into P450 enzyme structure and
dynamics.It is important to consider the strengths and limitations
of any
methodology when drawing functional conclusions from a structure.
While crystallography offers atomic-level details that no other structural
method can match, it should be remembered that even the most detailed
still-life portrait can only suggest motion. NMR can fill in some
of the gaps and provide a fourth dimension to your enzyme’s
structure.
Authors: Milka Kostic; Susan Sondej Pochapsky; John Obenauer; Huaping Mo; Gina M Pagani; Robert Pejchal; Thomas C Pochapsky Journal: Biochemistry Date: 2002-05-14 Impact factor: 3.162
Authors: I Schlichting; J Berendzen; K Chu; A M Stock; S A Maves; D E Benson; R M Sweet; D Ringe; G A Petsko; S G Sligar Journal: Science Date: 2000-03-03 Impact factor: 47.728
Authors: Manuel Quiroz; Molly M Lockart; Mohamed R Saber; Shaik Waseem Vali; Lindy C Elrod; Brad S Pierce; Michael B Hall; Marcetta Y Darensbourg Journal: Proc Natl Acad Sci U S A Date: 2022-06-13 Impact factor: 12.779
Authors: Jacob W McCabe; Michael J Hebert; Mehdi Shirzadeh; Christopher S Mallis; Joanna K Denton; Thomas E Walker; David H Russell Journal: Mass Spectrom Rev Date: 2020-07-01 Impact factor: 10.946