Literature DB >> 23641937

Quantum mechanics/molecular mechanics modeling of regioselectivity of drug metabolism in cytochrome P450 2C9.

Richard Lonsdale¹, Kerensa T Houghton, Jolanta Żurek, Christine M Bathelt, Nicolas Foloppe, Marcel J de Groot, Jeremy N Harvey, Adrian J Mulholland.

Abstract

Cytochrome P450 enzymes (P450s) are important in drug metabolism and have been linked to adverse drug reactions. P450s display broad substrate reactivity, and prediction of metabolites is complex. QM/MM studies of P450 reactivity have provided insight into important details of the reaction mechanisms and have the potential to make predictions of metabolite formation. Here we present a comprehensive study of the oxidation of three widely used pharmaceutical compounds (S-ibuprofen, diclofenac, and S-warfarin) by one of the major drug-metabolizing P450 isoforms, CYP2C9. The reaction barriers to substrate oxidation by the iron-oxo species (Compound I) have been calculated at the B3LYP-D/CHARMM27 level for different possible metabolism sites for each drug, on multiple pathways. In the cases of ibuprofen and warfarin, the process with the lowest activation energy is consistent with the experimentally preferred metabolite. For diclofenac, the pathway leading to the experimentally observed metabolite is not the one with the lowest activation energy. This apparent inconsistency with experiment might be explained by the two very different binding modes involved in oxidation at the two competing positions. The carboxylate of diclofenac interacts strongly with the CYP2C9 Arg108 side chain in the transition state for formation of the observed metabolite-but not in that for the competing pathway. We compare reaction barriers calculated both in the presence and in the absence of the protein and observe a marked improvement in selectivity prediction ability upon inclusion of the protein for all of the substrates studied. The barriers calculated with the protein are generally higher than those calculated in the gas phase. This suggests that active-site residues surrounding the substrate play an important role in controlling selectivity in CYP2C9. The results show that inclusion of sampling (particularly) and dispersion effects is important in making accurate predictions of drug metabolism selectivity of P450s using QM/MM methods.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2013 PMID： 23641937 PMCID： PMC3670427 DOI： 10.1021/ja402016p

Source DB: PubMed Journal: J Am Chem Soc ISSN： 0002-7863 Impact factor: 15.419

Introduction

The cytochrome P450 family of heme monooxygenase enzymes (P450s) plays an important role in the metabolism of drugs.[1,2] Potentially harmful complications can sometimes occur during P450-mediated metabolism, such as drug–drug interactions and formation of toxic metabolites. Most drug molecules contain several sites that are susceptible to P450-mediated oxidation, and the site of oxidation may determine whether a toxic metabolite is formed. A detailed understanding of the mechanisms that govern drug metabolism processes at the atomic level is vital to accurately predict metabolites of new pharmaceutical compounds, which will help to eliminate potentially harmful leads at an early stage. Many in silico methods have been developed for the prediction of P450 metabolite formation; these generally consist of four types: ligand-, structure-, rule-, and reactivity-based methods.[3,4] Calculations on small models with QM methods fall into the latter category and can be useful in aiding in the understanding of how P450 enzymes work at the atomic level without the need for experimental data for parametrization/fitting, and can provide information for use in, e.g., quantitative structure–activity relationships (QSARs). For example, a structure–activity relationship was determined from the reaction barriers calculated for a range of substituted benzene molecules in a QM-only model of the active oxidizing species (Compound I, Cpd I).[5] While model compounds can be useful in determining general chemical reactivity, such models often omit important details, such as steric effects due to the drug molecule interacting with the residues surrounding the active site. These effects are often important in determining at which site in a drug molecule oxidation occurs.[6] Hybrid quantum mechanics/molecular mechanics (QM/MM) calculations surmount this problem, as the steric and electrostatic environment of the enzyme can be included in the calculation, hence providing a more accurate picture of reactivity than that possible with calculations performed for systems in vacuo.[7,8] In previous work, we demonstrated that P450 chemoselectivity (e.g., hydroxylation versus epoxidation) can be predicted by comparing QM/MM reaction barriers to possible competing pathways.[6,9] The favored pathway will usually be that with the lowest activation barrier to oxidation by Cpd I. For example, the selectivity in the metabolism of dextromethorphan by CYP2D6 was studied using QM and QM/MM methods.[6] Many pharmaceutical compounds, such as dextromethorphan, contain multiple functional groups that are susceptible to P450 oxidation, in the case of dextromethorphan an aromatic ring, methoxy group, and N-CH3 group. QM calculations on a model system (comprised of anisole to represent the substrate, and a porphyrin group representation of Cpd I) predict that aromatic hydroxylation and oxidation of the methoxy group should be competitive. Indeed, aromatic oxidation is frequently observed in P450 metabolism of other substrates containing anisole fragments. In contrast, QM/MM (B3LYP/CHARMM27) calculations predict a much lower barrier to oxidation of the methoxy group, consistent with the fact that aromatic oxidation is not observed experimentally in this case. Thus far, there have been few published QM/MM studies of large-molecule oxidation in human P450 isoforms.[6,10] The formation of dopamine in CYP2D6, which occurs via aromatic hydroxylation of tyramine, has been studied with QM/MM at the B3LYP/CHARMM level.[10] The authors observed that the mechanism favored by QM/MM calculations was not favored in a QM-only model, which reflects the influence of the protein environment on the reactivity of Cpd I. The stereoselectivity of hydrogen abstraction from S-(−)-nicotine in CYP2A6 has been studied using QM/MM and free energy methods.[11] The authors observed that the barriers to abstraction of the hydrogen atoms cis and trans to the N-methyl group were similar; however, a more favorable free energy of binding (calculated using MM-PBSA) was observed for a binding position in which the trans hydrogen atom is placed close to the Cpd I oyxgen, in agreement with the experimentally observed stereoselectivity. In this study, we describe QM and QM/MM modeling of the oxidation of three widely used pharmaceutical compounds by CYP2C9, namely diclofenac (1), S-ibuprofen (2), and S-warfarin (3, see Figure 1). We reveal key mechanistic features of these reactions and show that inclusion of the enzyme surrounding the active site is important for rationalizing experimentally observed selectivity. These features are also likely to be important for making predictions of metabolism for new drugs. We have developed and demonstrate here a successful protocol for this type of modeling application, in particular showing the importance of including conformational sampling and dispersion in DFT-based QM/MM calculations.

Figure 1

Chemical structures of the drug molecules studied in this work: diclofenac (1), S-ibuprofen (2), and S-warfarin (3). Atoms numbered according to convention used in the current work. Diclofenac and ibuprofen were modeled in the QM/MM calculations here as the carboxylate (negatively charged) forms.

Ibuprofen

Ibuprofen (2) is a commonly used non-steroidal anti-inflammatory drug (NSAID) and acts as a nonselective inhibitor of cyclooxygenases 1, 2, and 3.[12,13]In vivo, 60% of R-ibuprofen undergoes conversion to S-ibuprofen.[14−16] Ibuprofen is oxidized primarily by CYP2C9, with a small amount oxidized by CYP2C8.[17,18] There are three metabolites of S-ibuprofen formed by CYP2C9: S,R- and S,S-3-hydroxyibuprofen are the major metabolites (4 and 5, Figure 2),[17] and S-2-hydroxyibuprofen (6, Figure 2) is formed as a minor metabolite.

Figure 2

Chemical structures of the metabolites formed during oxidation of S-ibuprofen by CYP2C9: major metabolites S,R-3-hydroxyibuprofen (4) and S,S-3-hydroxyibuprofen (5), and minor metabolite S-2-hydroxyibuprofen (6).

Diclofenac

Diclofenac (1) is an NSAID widely used in the treatment of rheumatoid arthritis and post-operative pain. It contains two aromatic rings bearing different substituents (Figure 1). CYP2C18 and CYP2C19 can oxidize either of the two rings, giving rise to a mixture of C4′- and C5-hydroxylated diclofenac (7 and 8 in Figure 3, respectively).[19] CYP2C9 and CYP3A4 exhibit very strict regioselectivities: CYP2C9 produces 4′-hydroxydiclofenac and 3′-hydroxydiclofenac,[20] and CYP3A4 produces 5-hydroxydiclofenac exclusively.[19] (Figure 3) These selectivity differences are observed despite a very similar electronic structure for the active oxidizing species (Cpd I) in these two enzymes.[21]

Figure 3

Chemical structures of the metabolites formed during oxidation of diclofenac. 4′-Hydroxydiclofenac (7) is the major metabolite formed during oxidation by CYP2C9. 5-Hydroxydiclofenac (8) is the major metabolite formed during oxidation by CYP3A4.

Diclofenac oxidation has been studied using H2O2 and BuOOH as oxidizing agents in the presence of an Fe-porphyrin catalyst.[19] The authors concluded that C5 is the chemically preferred site of oxidation for diclofenac. Chemical structures of the metabolites formed during oxidation of diclofenac. 4′-Hydroxydiclofenac (7) is the major metabolite formed during oxidation by CYP2C9. 5-Hydroxydiclofenac (8) is the major metabolite formed during oxidation by CYP3A4.

Warfarin

S-Warfarin (3) is a widely used anti-coagulant and undergoes hydroxylation by CYP2C9 at the 7- and 6-positions in a 3:1 ratio (Figure 4).[22,23]S-Warfarin also undergoes hydroxylation at the 4′ carbon, to a smaller extent, by CYPs 2C8, 2C18, and 2C19.[23,24] The R-isoform of warfarin is predominantly metabolized by CYP3A4.[23] Warfarin has an exceptionally low therapeutic index; i.e., the therapeutic amount required is very close to the amount that will cause internal hemorrhaging.[25] Warfarin dosage is further complicated by drug–drug interactions, where the metabolic clearance of warfarin can be slowed down by the presence of other compounds.

Figure 4

Chemical structure of S-7-hydroxywarfarin (9) and S-6-hydroxywarfarin (10), the major and minor metabolites formed during oxidation of S-warfarin by CYP2C9. These structures correspond to the open form of S-warfarin.

In solution, warfarin exists in an equilibrium between the open form (3 shown in Figure 1) and a diastereomeric pair of ring-closed hemiketals. The hemiketal form is favored in organic solvents; however, in aqueous solution and at physiological pH the open-side-chain form is preferred.[26] In the crystal structure of CYP2C9 with S-warfarin bound, warfarin is in the open-side-chain form; hence, this is the form that is modeled in the present study.[27] Chemical structure of S-7-hydroxywarfarin (9) and S-6-hydroxywarfarin (10), the major and minor metabolites formed during oxidation of S-warfarin by CYP2C9. These structures correspond to the open form of S-warfarin.

Computational Details

X-ray Crystal Structures

Three crystal structures of human CYP2C9 were available in the Protein Data Bank when this work was initiated. The 1OG2 and 1OG5 structures correspond to the apo form and a complex containing S-warfarin, respectively.[27] Warfarin is bound at a considerable distance from the heme unit, with the hydroxylation site ∼10 Å from the heme iron. The authors used a docking technique to illustrate that it should be possible to bind two warfarin molecules simultaneously, with the putative second binding site located close to the heme group. This suggests that CYP2C9 might accommodate more than one substrate simultaneously. The 1OG2 and 1OG5 crystal structures do not provide any support for the existence of a previously suggested anionic binding site that would explain the preference of CYP2C9 toward negatively charged substrates.[28−30] The side chains of Arg105 and Arg108, previously proposed to be involved in the binding of the anionic substrates, were found to be pointing away from the active site. The 1R9O structure of CYP2C9 contains the drug flurbiprofen bound directly above the heme group in a position favoring oxidation.[31] There are significant differences between the 1R9O and 1OG5 crystal structures, most notably the conformation of the Arg108 side chain, which points in toward the active site in 1R9O, and hydrogen-bonds to the carboxylic acid group of flurbiprofen.[32] Indeed, mutation of Arg108 affects the binding of flurbiprofen and leads to decreased oxidation of S-warfarin and diclofenac.[32,33] As the 1R9O crystal structure was obtained at a higher resolution, has a less altered protein sequence, and exhibits features corresponding to binding of substrates with anionic groups, such as ibuprofen and diclofenac, it was selected as the starting point for all of the MD simulations (and subsequent QM/MM calculations) presented in this study.

Structure Preparation for MD Simulations

All preparatory calculations were performed with the CHARMM program version c27b2, with the CHARMM22 forcefield.[34] The 1R9O crystal structure was used,[31] for the reasons described above. The Cpd I state of CYP2C9 was modeled, with the coordinates of the ferryl oxygen taken from a crystal structure of CYP101 (P450, PDB code 1DZ9).[35]S-ibuprofen and diclofenac were built using the molecular builder in Quanta 98. The coordinates of S-warfarin were taken from the 1OG5 crystal structure and hydrogen atoms were added. S-warfarin was docked into the active site of CYP2C9 using AUTODOCK 3.[36] Deprotonated forms of S-ibuprofen and diclofenac were docked in the active site of the enzyme by hand, by superposition with flurbiprofen, before deleting the latter from the active site. Atomic charges for the CHARMM22 forcefield were obtained for all three substrates by fitting to electrostatic potential charges calculated using the B3LYP density functional[37−40] and 6-31G(d) basis set[41] in Jaguar 5.0.[42] The raw charges resulting from the electrostatic potential fitting were edited in order to maintain a consistency with charges for related atoms in the CHARMM22 forcefield. The bonded parameters were chosen such as to be consistent with the CHARMM22 forcefield. The topology files and additional parameters for the substrates are provided in the Supporting Information. Histidine tautomers were assigned using the optimal hydrogen-bonding network analysis method available in the WHATIF web interface.[43] Hydrogen atoms were added using the HBUILD module of CHARMM, according to pKa values calculated using PROPKA.[44] The protein was truncated to within 25 Å of the heme iron. Charged residues (Asp, Lys, Glu, and Arg) located more than 20 Å from the center of the system (i.e., the heme iron) were neutralized. The positions of the hydrogen atoms were energy minimized using 1000 steps of steepest descent (SD) and 500 steps of conjugate gradient (CG) minimization. The protein was solvated in a box of pre-equilibrated TIP3P water molecules.[45] Water molecules farther than 25 Å from the heme iron were deleted, together with all water molecules whose oxygen atom was 2.6 Å or closer to any heavy atom in the system. The positions of the water molecules were energy minimized using 1000 steps of SD, followed by 500 steps of CG, while keeping all other atoms fixed. Stochastic boundary molecular dynamics (SBMD) were performed on all water molecules (with all other atoms held frozen) with a buffer zone beyond a radius of 20 Å. The system was heated from 0 to 300 K over 1 ps and then equilibrated for 25 ps (with a time step of 1 fs). A friction coefficient of 62 ps–1 was applied to all water oxygen atoms. All atoms were then optimized with 1500 steps of SD and 1500 steps of ABNR. In order to prevent distortion of Cpd I, the heme heavy atoms were fixed to their initial positions during all minimizations and MD simulations.[46] The protein backbone heavy atoms were restrained to their initial positions with a force constant of 1.7 kcal mol–1 Å–2. The atoms in the buffer region (>20 Å from the Cpd I Fe atom) were harmonically restrained with force constants gradually increasing with distance from the center of the system. The optimized structure was used as the starting point for SBMD simulations detailed below.

Molecular Dynamics Simulations

Sampling is of great importance when modeling selectivity in enzymes.[9,47,48] Methods such as QM/MM umbrella sampling simulations[48−50] require many millions of energy and gradient evaluations and are hence only routinely feasible in QM/MM calculations using relatively low levels of QM theory, such as semiempirical methods. Such methods are not currently able to reliably model P450 oxidation reactions and hence are not suitable for modeling selectivity for these enzymes. Use of empirical valence bond methods[51] would require significant parametrization of the reactant and product states for each drug that is modeled. Hence, in order to reduce the amount of parametrization, and to approximate to a sufficient amount of sampling, multiple QM/MM potential energy profiles have been calculated using different starting points from MM-based MD simulations (described below). MD simulations were performed in CHARMM with stochastic boundary conditions. The same buffer region and harmonic restraints as detailed in the previous section were used in the SBMD simulations. Each system was equilibrated for 1 ns, prior to MD production runs of a minimum of 5 ns. The root-mean-squared deviation (RMSD) of the enzyme backbone heavy atoms from their positions in the crystal structure was used to determine the point in the simulation at which the enzyme was sufficiently equilibrated for structures to be selected for QM/MM calculations. Only once the value of the RMSD had begun to fluctuate around a constant value were structures considered for QM/MM calculations. In order to obtain structures that were suitable for QM/MM reaction modeling, i.e., with the site of metabolism close enough to react with Cpd I, the addition of harmonic restraints between the substrate and Cpd I was necessary during selected MD simulations. For the generation of starting structures for QM/MM modeling of hydrogen abstraction from the C2 position of ibuprofen, a distance restraint of 2.7 Å (with a force constant of 100 kcal mol–1 Å–2) was applied between the Cpd I oxygen and the hydrogen atom attached to C2. A distance restraint was not necessary for the generation of starting structures for hydrogen abstraction from C3 of ibuprofen, nor for the hydroxylation of diclofenac at the C4′ or C5 positions. When modeling the hydroxylation of warfarin, preliminary QM/MM calculations revealed the necessity to constrain the position of warfarin (with respect to Cpd I), during the MD, to one which was similar to that observed for the transition state (TS) to C–O bond formation. In the absence of these restraints, the barriers were found to be unrealistically high, due to breakage of the Fe–S bond. In the MD simulations of warfarin in CYP2C9 an harmonic restraint with force constant of 2000 kcal mol–1 Å–2 was applied with an equilibrium distance of 2.0 Å between the C6 or C7 carbon atom of warfarin and the Cpd I oxygen, when modeling C6 or C7 hydroxylation, respectively.

Selection of Structures for QM/MM Calculations

Analysis of the MD simulations of ibuprofen and diclofenac revealed that many different orientations of the substrates were observed during the simulations. It has been found previously that calculating QM/MM profiles from MD structures in which the two reacting species are at large initial distances yields high activation energy barriers that are not representative of the true reactivity of the enzyme. Therefore, structures for QM/MM modeling were selected such that the orientation of the substrate in the active site was as close to the expected TS geometry as possible. MD structures were pre-screened on the basis of certain geometric criteria, relating to the proximity of the substrate to Cpd I, using a similar procedure to that used previously.[9] Assuming that the “reactive” conformations make up a significant proportion of the MD simulation, it is reasonable to select such “reactive” conformations as the starting point for QM/MM calculations. As is discussed below, even when such selection is performed, a wide range of barriers is observed for a given reaction. A Boltzmann-weighted averaging procedure (as shown in eq 1) has been applied to calculate the average barrier, as this will favor the lower-energy barriers that will dominate the experimental reactivity. This approach has been applied previously for modeling the oxidation of alkenes in P450.[9] The mathematical expression used for averaging does not take into account the different energies of the initial conformations for each pathway, as these are typically quite different, so that Boltzmann weighting would include a contribution only from the lowest energy initial conformation unless a very large number of pathways were studied. The fact that each conformation was visited in an MD simulation suggests they are all of reasonable free energy, and the conclusions were checked and found to be robust toward removing the lowest value of ΔE⧧ from the averaging procedure. The geometry criteria that were used to select starting structures for QM/MM modeling of ibuprofen and diclofenac (from the MD simulations described above) are summarized in Figure 5. Starting structures for modeling oxidation of ibuprofen were chosen such that the Fe–O–C(x) angle was in the range 110–130° and the Fe–H(x) distance was less than 3.5 Å, where H(x) is the hydrogen atom undergoing abstraction by Cpd I and C(x) is the carbon atom on the substrate to which this hydrogen is bonded. Starting structures for QM/MM modeling of diclofenac oxidation were selected such that the Fe–O–C(x) angle was in the range 110–130° and the Fe–C(x) distance was less than 4.0 Å, where C(x) is the substrate aromatic carbon that undergoes C–O bond formation with Cpd I (see below). No selection criteria were applied to the choice of starting structures for the modeling of warfarin oxidation, as the harmonic restraint present during the MD simulations (with the shorter O–C distance) yielded many structures close to the TS—these were selected at random for QM/MM calculations.

Figure 5

Criteria used for selection of MD structures for QM/MM modeling of (a) aliphatic hydroxylation and (b) aromatic hydroxylation by CYP2C9.

QM/MM Calculations

QM/MM calculations were performed with the QoMMMa program,[52] in which QM calculations were performed using Jaguar (version 5.5)[42] with the B3LYP density functional.[37−39] The MM part of the QM/MM calculations was performed in Tinker using the CHARMM27 all atom forcefield.[34] The MM point charges were included in the QM Hamiltonian. The valences of the QM atoms at the QM/MM boundary were satisfied using the “link atom” method (i.e., the addition of hydrogen atoms).[53] The charges of the MM atoms at the QM/MM boundary were set to zero to avoid unphysical interactions with the link atoms, and the residual charge was shifted onto the adjacent MM atoms. This QM/MM setup procedure has been used previously and has been shown to perform well for similar systems.[9,21,46,54,55] An empirical dispersion correction was applied to the QM energy and gradient in all calculations.[56] For geometry optimization the 6-31G(d) basis set[41] was used for all atoms with the exception of Fe, for which the Los Alamos effective core potential was used (LACVP) (referred to herein as BSI).[57] Single point energies were calculated using the LACV3P[57] and 6-311++G(d,p) basis set combination (BSII).[58−61] The QM region consisted of Cpd I, modeled as a truncated heme with all ring substituents replaced by hydrogen atoms, shown in Figure 6a, with the proximal cysteine represented by methyl mercaptide. For QM/MM calculations of ibuprofen and diclofenac, the entire substrate molecule was included in the QM region. The deprotonated form of diclofenac was used in QM/MM calculations, as this is the expected form at physiological pH. For QM/MM calculations of warfarin, the substrate molecule was split into QM and MM regions, as shown in Figure 6b.

Figure 6

(a) QM representation of Cpd I used in QM and QM/MM calculations. (b) Separation of S-warfarin into QM and MM regions in QM/MM calculations.

(a) QM representation of Cpd I used in QM and QM/MM calculations. (b) Separation of S-warfarin into QM and MM regions in QM/MM calculations. All residues and water molecules with at least one atom located within 5 Å of any substrate or heme atoms were included in the minimization; all other atoms were held fixed. With the exception of the initial geometry optimizations of the enzyme–substrate complexes, an harmonic restraint with a force constant of 1000 kcal mol–1 Å –2 was applied to keep the reaction coordinate fixed to the appropriate value. Reaction profiles were generated by defining the reaction coordinate as the O–X distance, where O is the Cpd I oxygen and X is the hydrogen atom undergoing abstraction for ibuprofen oxidation, C4′ or C5 for diclofenac oxidation, and C6 or C7 for warfarin oxidation. The geometry was optimized with QM/MM at various points along the reaction coordinate, leading from reactant complex to intermediate, via the TS. In the case of warfarin oxidation, where the starting structures were close to the TS, the reaction was modeled in the forward and reverse directions to intermediate and reaction complex, respectively. The reaction coordinate was varied by steps of 0.2 Å with the exception of the region close to the TS, where steps of 0.1 Å were used. The potential energy barrier, ΔE⧧, was calculated as the difference in energy between the reactant complex and the highest point on the potential energy surface (i.e., the approximate TS). The Boltzmann-weighted average barrier for a given process was calculated for all available energy barriers using the expression in eq 1.

QM Calculations

QM calculations were performed using the ORCA program (version 2.8). The LANLDZ basis set[62] was used for iron and 6-31G[63,64] for all other atoms in all geometry optimization calculations (BSIII), together with Grimme’s empirical dispersion correction.[56] Single-point energies were calculated using the SDD basis set[65] for Fe and the valence triple-ζ basis set developed by Ahlrichs et al. with added d,p polarization functions for all other atoms (BSIV).[66] The starting geometries for these gas-phase calculations were taken from the QM/MM geometries which produced the lowest potential energy barriers to hydroxylation. In the QM calculations of diclofenac, diclofenac was protonated at the carboxylate group, in order to prevent deprotonation of the N–H group by the carboxylate.

Results and Discussion

Aliphatic Hydroxylation

Aliphatic hydroxylation by P450s occurs via the hydrogen abstraction/rebound mechanism (Figure 7).[67] The hydrogen abstraction step has been found to be rate-limiting[68−73] and hence is the only step modeled here. Cpd I has two closely lying electronic states, a doublet and a quartet, both of which are believed to contribute to its reactivity, and hence both states are considered here. In model studies of hydrogen abstraction from methane, similar barriers were calculated for the doublet and quartet states.[68] A small barrier to the rebound step was found for the quartet state of Cpd I, but no barrier was observed for the same step on the doublet surface.

Figure 7

Rebound mechanism for aliphatic hydroxylation catalyzed by P450s. The first step is hydrogen abstraction, resulting in the formation of a substrate radical which then rebounds with the Fe-bound OH group to form the hydroxylated product. Six MD simulations of the ibuprofen/CYP2C9 complex were performed in order to relax the crystal structure of CYP2C9 and to generate energetically accessible starting structures for the QM/MM energy profiles. Three simulations were performed for oxidation at C2 and three simulations for oxidation at C3. Each subset of simulations was propagated from the same initial geometry, only the set of initial velocities assigned to the atoms was varied. In the latter set of simulations, no restraints were included between the substrate and Cpd I and the isopropyl group of the ibuprofen molecule sampled many different conformations within the active site. The remainder of the ibuprofen molecule sampled fewer conformations in these simulations due to the presence of a hydrogen bond between the carboxylate group of ibuprofen and the Arg108 side chain. In the simulations that were performed to generate structures for C2 hydroxylation, the addition of an harmonic restraint was necessary in order to generate a sufficient number of “reactive” starting structures for QM/MM. It has been postulated previously that the presence of a water molecule close to the Cpd I oxygen may lead to a lowering of the barrier to hydrogen abstraction in P450.[74] However, we did not observe any water molecules close to the Cpd I oxygen during any of the MD simulations, for ibuprofen or the other substrates. Given the relative size of the substrates, there is little room in the active site for entrance of solvent during the simulations. We have reported previously that the displacement of water from the active site by the substrate results in a more reactive Cpd I.[21] The isopropyl group of ibuprofen was positioned with all carbon atoms positioned at a distance of >5.0 Å from the Cpd I ferryl oxygen for the majority of one of the C3 MD simulations, and hence this simulation was not used to extract starting structures for QM/MM calculations. One of the simulations contained many more structures in which ibuprofen was in a “reactive” conformation, and hence this simulation was used to obtain QM/MM starting structures. In all three of the C3 MD simulations, the backbone RMSD continued to increase until after approximately 4 ns of simulation, hence all starting structures for QM/MM were obtained from after this point in the simulation, when we assume that the system has reached equilibrium. All three of the C2 MD simulations yielded many “reactive” conformations for hydroxylation by Cpd I at the C2 position, and hence all three of the C2 MD simulations were used for the generation of QM/MM starting structures. Example QM/MM-optimized geometries of reactant complexes for the hydrogen abstraction from C2 and C3 of ibuprofen in CYP2C9 are shown in Figure 8. The reactant complexes leading to the lowest energy barriers (see Table 1) were selected as examples. In the reactant complex for hydrogen abstraction from C3, there is a hydrogen bond between the substrate and the Arg108 side chain. This interaction is not present in the reactant complex for hydrogen abstraction at C2. The O–H distance is smaller in the reactant complex for hydrogen abstraction from C3, compared to C2 (2.253 and 3.017 Å, respectively). The Fe–O–C(3) angle in the C3 hydrogen abstraction reactant complex structure is significantly smaller than the Fe–O–C(2) angle in the C2 hydrogen abstraction reactant complex (122.8° and 138.9°, respectively).

Figure 8

Reactant complex structures for hydrogen abstraction from (a) C3 and (b) C2 of S-ibuprofen, calculated at the B3LYP(BSI)/CHARMM27 level of theory (corresponding to doublet profiles 2-3 and 3-1 in Table 1).

Table 1

Potential Energy Barriers, ΔE⧧ [in kcal/mol], for Hydrogen Abstraction from Ibuprofen at the C2- and C3-Positions, Calculated at the B3LYP-D(BSII)//B3LYP-D(BSI)/CHARMM27 Level of Theorya

	C3			C2
profile	²ΔE^⧧	⁴ΔE^⧧	profile	²ΔE^⧧	⁴ΔE^⧧
3-1	18.2	18.1	2-1	26.4	28.1
3-2	19.8	18.6	2-2	31.4	32.5
3-3	22.5	21.4	2-3	26.2	29.7
3-4	20.2	19.7
3-5	20.9	20.0
3-6	24.6	22.4
3-Ave_B	19.2	18.9	2-Ave_B	26.5	28.7

AveB corresponds to the Boltzmann-weighted average calculated over all pathways. The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I.

QM and QM/MM barriers have been calculated for hydrogen abstraction from both the C2 and C3 atoms of S-ibuprofen. Six QM/MM energy profiles were calculated for abstraction from each carbon atom. The QM/MM energy barriers for hydrogen abstraction from ibuprofen are displayed in Table 1. Out of the six attempts to model abstraction from C2, only three resulted in profiles that successfully went from reactants to intermediate, via a TS. For the remaining three profiles, it was not possible to achieve a converged structure corresponding to the TS. All six of the C3 profiles resulted in converged energy profiles. Reactant complex structures for hydrogen abstraction from (a) C3 and (b) C2 of S-ibuprofen, calculated at the B3LYP(BSI)/CHARMM27 level of theory (corresponding to doublet profiles 2-3 and 3-1 in Table 1). AveB corresponds to the Boltzmann-weighted average calculated over all pathways. The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I. The QM/MM barriers reported here are similar in magnitude for the doublet and quartet spin states of Cpd I; this finding is consistent with other QM/MM studies,[6] as well as small QM-only model studies of the reactivity of Cpd I (e.g., methane oxidation[68] and alkene oxidation[75]). The QM/MM barriers for C3 and C2 hydroxylation span the ranges 18.1–24.6 and 26.2–32.5 kcal/mol, respectively. It is not unusual to observe such a range in activation barriers for the same reaction calculated from different MD structures, and is due to the relative reactivities of different conformations.[9] It is expected that the pathways with lower barriers are more representative of the reaction taking place in the enzyme, if one assumes that all of the reaction complexes are thermally accessible. The lowest barriers are observed for C3 hydroxylation, and this reflects the experimental preference for the formation of 3-hydroxyibuprofen versus 2-hydroxyibuprofen. The barriers to hydrogen abstraction from C2 are higher than expected, as one would not expect formation of 2-hydroxyibuprofen on the basis of the large difference between the lowest C3 and C2 barriers (∼7 kcal/mol). As 2-hydroxyibuprofen is a minor product of metabolism in CYP2C9, the large barriers suggest that our model system somehow disfavors the minor pathway to a larger extent than is observed experimentally.[17] This may be due to incomplete sampling; it is possible that there is a conformation that favors C2 hydroxylation which was not accessed during the MD simulations described here. Preliminary QM/MM barriers calculated at the same level of theory in the absence of the dispersion correction were found to be substantially higher (∼10 kcal/mol) than their counterparts where dispersion was included. Hence it is apparent that inclusion of dispersion is important in these calculations. The QM barriers for hydrogen abstraction from C2 and C3 computed in vacuum are displayed in Table 2. The QM barriers predict the opposite selectivity to that predicted by the QM/MM calculations. The barriers to C2 hydroxylation are 16.3 and 14.8 kcal/mol for the doublet and quartet states of Cpd I, respectively. The barriers to C3 hydroxylation are 22.5 and 25.1 kcal/mol, hence C2 hydroxylation is favored by 7.7 kcal/mol in the gas phase. The barrier to C2 hydrogen abstraction is expected to be lower, reflecting that it requires less energy to abstract a hydrogen atom from a tertiary carbon atom, compared to a primary carbon. The QM barriers for C2 hydrogen abstraction are lower than those calculated in the enzyme, whereas the C3 barriers are higher than their QM/MM counterparts. As is discussed below, the C2 gas-phase TS geometry is very different to that optimized with QM/MM. Hence it is likely that the high energy barriers observed for C2 hydroxylation in CYP2C9 are due to the prevention of formation of a low-energy TS geometry by the steric effects of the enzyme active site.

Table 2

Gas-phase B3LYP-D(BSIV)//B3LYP-D(BSIII) Energies [in kcal/mol] for the Hydroxylation of S-Ibuprofen at the 2- and 3-Positionsa

site of oxidation	²E_RC	⁴E_RC	²E_TS	⁴E_TS	²ΔE^⧧	⁴ΔE^⧧
2	–14.6	–14.6	1.6	0.2	16.3	14.8
3	–13.8	–13.8	8.8	11.3	22.5	25.1

ERC and ETS correspond to the energies of the reactant complex and transition state, respectively, calculated relative to separate reactants. ΔE⧧ is the potential energy barrier, calculated as the difference between ERC and ETS. The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I, respectively. The experimental rate constant for 3-hydroxylation of S-ibuprofen in CYP2C9, determined at 310 K, is 0.056 s–1.[76] According to the Eyring equation, this corresponds to a free energy barrier of 20.0 kcal/mol. The experimental reaction barrier should be treated as the upper bound, as the rate limiting step in the catalysis of CYP2C9 has not been established. The reaction barriers calculated in this work are not free energy barriers, as entropy contributions are not calculated. However, previous work suggests that entropy contributions for similar reactions are small (<2 kcal/mol),[48,77,78] and that predictions of reactivity, in good agreement with experiment, can be made on the basis of reaction enthalpies.[79,80] The lowest value calculated for hydrogen abstraction from C3 with QM/MM (18.2 kcal/mol) is below the upper limit suggested by the activation free energy derived from the experimental rate constant. A zero point vibrational energy correction of ∼4 kcal/mol has been calculated for hydrogen abstraction from other P450 substrates using similar methods.[9] This would bring the QM/MM calculated energy barrier to around 14 kcal/mol. C–H bond activation kinetics have been measured in the CYP-119 isoform with the substrate lauric acid.[81] In this study, an apparent rate constant of 1.1 × 107 s–1 was observed at 277 K. This corresponds to an energy barrier of ∼12 kcal/mol. Hence a barrier of 14 kcal/mol is likely to be representative of the true energy barrier to hydrogen abstraction from the C3 of ibuprofen. The TS structures for the hydrogen abstraction profiles 3-1 and 2-3 are shown in Figure 9. These profiles correspond to the lowest energy barriers for each reaction. There is a hydrogen-bonding interaction between one of the carboxylate oxygen atoms of ibuprofen and Arg108 in both TS structures. Arg108 has been identified as playing an important role in binding anionic substrates in CYP2C9.[32] In the TS for hydroxylation at C3, the carboxylate oxygen of ibuprofen and the side chain N–H group of Arg108 are in an orientation that would result in a stronger hydrogen-bonding interaction than that between the corresponding atoms for C2 hydroxylation. This is because the carboxylate group in the latter reaction is dragged further into the active site (i.e., closer to Cpd I) in order for the Cpd I oxygen to abstract the hydrogen from C2, as illustrated in Figure 9.

Figure 9

The TS structures for hydrogen abstraction from ibuprofen calculated in vacuo are shown in Figure 10. The orientations of ibuprofen relative to the heme in the gas-phase calculations are very different to those observed in the QM/MM structures (Figure 9). This observation is unsurprising given that in the QM/MM calculations, the orientation of ibuprofen is restricted by the residues surrounding the active site. In the gas phase, ibuprofen is less sterically hindered and can adopt an orientation in which the dispersion interaction between ibuprofen and the heme is maximized, as has been observed previously for other substrates.[75] Indeed, in previous work we observed that the effect of dispersion on optimized geometries is smaller in QM/MM than in QM calculations on P450 oxidation.[82]

Figure 10

Transition-state structures for hydrogen abstraction from C3 and C2 of S-ibuprofen, calculated in vacuo at the B3LYP-D/BSIII level of theory. The carbon atoms corresponding to C2 and C3 hydroxylation are in green and orange, respectively.

Transition-state structures for hydrogen abstraction from C3 and C2 of S-ibuprofen, calculated at the B3LYP-D(BSI)/CHARMM27 level of theory (corresponding to doublet profiles 2-3 and 3-1 in Table 1). The carbon atoms corresponding to C2 and C3 hydroxylation are in green and orange, respectively. Transition-state structures for hydrogen abstraction from C3 and C2 of S-ibuprofen, calculated in vacuo at the B3LYP-D/BSIII level of theory. The carbon atoms corresponding to C2 and C3 hydroxylation are in green and orange, respectively. There is a large difference in barrier between oxidation at C2, compared with C3 in vacuo (7 kcal/mol). This is due (in part) to the relative ease of removal of a hydrogen atom from a tertiary carbon atom, compared to that of a primary carbon atom. In addition to this, the angle formed between the two reactants at the TS results in better orbital overlap in the case of C2, compared with C3, as shown in Figure 11. The shape of the Fe–O π* is such that the most favorable angle of approach for the substrate is around 130° (in the case of ibuprofen, the Fe–O–C(x) angle). It is possible that the dispersion interactions between the heme and ibuprofen moieties do not allow for the C3 hydrogen atoms to form a TS in which optimal overlap can occur between the two reactants. In the enzyme, the amino acid residues surrounding the active site can anchor the substrate in a position such that a more favorable angle of approach is formed between the heme and ibuprofen, and hence a lower barrier to hydrogen abstraction is observed.

Figure 11

(a) Effect of angle of approach of substrate (Sub) to Cpd I on barrier to aliphatic and aromatic hydroxylation. Optimal orbital overlap leads to low barrier when angle of approach is approximately 130° (green). At larger angles of approach, non-optimal orbital overlap occurs, leading to higher barriers (red). (b) Orbital energy diagram for the first electron transfer step in the oxidation of substrate by Cpd I. The electron may transfer to either of the two singly occupied Fe–O π* orbitals. From the calculations described above, it is clear that in the case of oxidation of S-ibuprofen by CYP2C9, the effect of the enzyme environment is to determine the regioselectivity of oxidation, by restricting the orientation of the substrate relative to Cpd I, placing the C3 group at an optimal position and angle for hydroxylation to occur. The angle dependence of regioselectivity may play an important role in the automated prediction of P450 metabolites. The sensitivity of barrier to angle of attack may explain the necessity for careful structure selection in QM/MM calculations on P450s.

Aromatic Hydroxylation

P450-catalyzed aromatic hydroxylation is believed to occur via the addition/rearrangement mechanism (Figure 12), whereby hydroxylation proceeds via a tetrahedral σ-complex, which is formed upon addition of Cpd I to the aromatic substrate carbon atom. The addition/rearrangement mechanism is supported by observed isotope effects for the hydroxylation of deuterated chlorobenzenes, which are inconsistent with the previously suggested initial epoxide formation mechanism.[83] The addition of Cpd I to benzene and analogues has been studied in the gas phase,[5,84] as well as with QM/MM methods.[55,82] The tetrahedral σ-complex can have both radical- or cation-like character, depending on the extent of electron transfer from the aromatic substrate to the single-occupied orbitals of Cpd I.[5]

Figure 12

Addition/rearrangement mechanism for aromatic hydroxylation catalyzed by P450s.[5,83] The orbital energy levels are indicated schematically. The first step is C–O bond formation, resulting in the formation of a σ-adduct with either (a) cationic or (b) radical character. (c) The second step is rearrangement to form the hydroxylated product.

The effect of the orientation of the substrate with respect to Cpd I to the barrier to C–O bond formation has also been investigated.[5,55,82] The side-on (perpendicular) and face-on (parallel) orientations of benzene with respect to the Cpd I porphyrin were calculated both in the gas phase[5] and with QM/MM.[55,82] In the former case, a slight preference for side-on addition was observed,[5] whereas in the latter calculations, no distinct preference could be determined.[55,82] The tetrahedral intermediate can rearrange to form a phenol product directly, or via ketone or epoxide intermediates. The calculated barriers to these competing processes for benzene were similar, indicating that a mixture of these species is likely to be observed.[55] Two drugs have been studied here that are known to undergo aromatic hydroxylation in CYP2C9: S-warfarin and diclofenac. Both of these drugs contain several possible sites at which aromatic hydroxylation can occur; QM/MM calculations have been used here to investigate if the preferred site of metabolism can be rationalized on the basis of the relative barriers to C–O formation with Cpd I at each site. The mechanism for the hydroxylation of R- and S-warfarin has been studied experimentally by Bush et al. using deuterium-labeled substrates.[85] The relative amounts of deuterated products formed (and absence of an observed kinetic isotope effect) suggested that hydroxylation proceeds through an addition-rearrangement mechanism, prior to, or in the absence of, epoxide formation. This reactivity is believed to be dictated by the nature of Cpd I, whereas the regioselectivity of hydroxylation is believed to be dependent on the binding position of the substrate, which will be determined by the residues surrounding the active site. The majority of S-warfarin that is metabolized by CYP2C9 is converted to the 7-hydroxy product, with a small amount of 6-hydroxywarfarin formed.[23] Addition/rearrangement mechanism for aromatic hydroxylation catalyzed by P450s.[5,83] The orbital energy levels are indicated schematically. The first step is C–O bond formation, resulting in the formation of a σ-adduct with either (a) cationic or (b) radical character. (c) The second step is rearrangement to form the hydroxylated product. As mentioned above, MD simulations of S-warfarin and CYP2C9 were performed with an harmonic restraint between the substrate carbon undergoing C–O bond formation (C6 or C7) and the Cpd I oxygen. To keep the substrate close to Cpd I, in an orientation close to that expected for the TS, a large force constant value (2000 kcal mol–1 Å –2) was used. As a result, there was relatively little movement of the substrate in the active site, compared with the other substrates studied here. This is despite the fact that the protein and solvent were free to sample different conformations. QM/MM reaction profiles were calculated from selected MD structures, by increasing the C–O distance to locate the reactant complex, and decreasing the same distance to locate the TS and intermediate. There were several differences between the MD simulations that were performed for C6 and C7 hydroxylation. In the C6 MD simulation, a hydrogen bond was observed between the carboxylate group of Glu300 and the hydroxyl group located on the benzolactone ring of warfarin, for the entire simulation. In the C7 simulation this hydrogen bond was not present, and instead a bridging water molecule was located between the two groups. Additionally, in the C6 simulation, a hydrogen bond was observed between the Arg108 side chain and the carbonyl oxygen of the benzolactone ring of warfarin (O2 in Figure 1). In the C7 simulation, the average distance between the groups is larger than in the C6 simulation (average values of the Arg108 HH22–warfarin O2 distance 2.30 and 3.29 Å for the C6 and C7 simulations, respectively). Mutagenesis studies of CYP2C9 with S-warfarin revealed a loss of formation of 7-hydroxywarfarin for R108F and R108E mutations.[32,86] The third hydrogen-bonding interaction between the enzyme and substrate that differs between the two simulations is that between the aliphatic ketone oxygen of warfarin (O4 in Figure 1) and the carboxamide NH2 group of Asn204. The average distance between the closest amide proton of Asn204 and the O4 of warfarin is 3.03 and 2.40 Å in the C6 and C7 MD simulations, respectively. Asn204 formed a hydrogen bond to the acid group of flurbiprofen in the 1R9O crystal structure (the starting point for the calculations in this work).[31] Example QM/MM-optimized reactant complexes for C–O bond formation between Cpd I and C6 and C7 of S-warfarin in CYP2C9 are shown in Figure 13. In the binding poses for addition to C6 and C7 displayed in Figure 13, the same interactions were observed between warfarin and the active-site residues as described for the MD simulation described above.

Figure 13

Reactant complex structures for hydrogen abstraction from (a) C6 and (b) C7 of S-warfarin, calculated at the B3LYP-D(BSI)/CHARMM27 level of theory (corresponding to doublet profiles 6-8 and 7-9 in Table 3).

Table 3

Potential Energy Barriers, ΔE⧧ [in kcal/mol], from B3LYP-D(BSII)//B3LYP-D(BSI)/CHARMM27 Profiles (and Corresponding Boltzmann-Weighted Average, AveB) for Hydroxylation of S-Warfarin at the 6- and 7-Positionsa

	C6			C7
profile	²ΔE^⧧	⁴ΔE^⧧	profile	²ΔE^⧧	⁴ΔE^⧧
6-1	22.6	22.6	7-1	19.9	18.9
6-2	24.6	24.0	7-2	17.2	16.2
6-3	23.2	22.5	7-3	16.7	14.4
6-4	22.9	23.2	7-4	23.0	22.1
6-5	27.8	26.3	7-5	20.6	18.4
6-6	23.8	23.4	7-6	16.0	14.6
6-7	23.3	23.4	7-7	16.3	15.2
6-8	21.4	21.8	7-8	15.6	14.6
6-9	22.8	22.6	7-9	14.2	13.6
6-10	24.7	24.4	7-10	16.0	14.6
Ave_B	22.4	22.7	Ave_B	15.1	14.3

The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I.

The barriers to C–O formation between S-warfarin and Cpd I, calculated at the B3LYP-D/CHARMM27 level, are provided in Table 3. The barriers to addition at C6 and C7 span the ranges 21.4–27.8 and 13.6–23.0 kcal/mol, respectively. As mentioned above, it is expected that a wide range of values will be observed for activation barriers when sampling different enzyme conformations. The barriers for a given pathway are similar, whether the doublet or quartet spin state of Cpd I is considered. In previous work, barriers to C–O bond formation with the doublet state of Cpd I were found to be slightly lower than the corresponding quartet state;[5] this is not the case for C–O bond formation between Cpd I and S-warfarin. The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I. There is a clear preference for addition to the C7 carbon atom in the present QM/MM calculations, in agreement with the observed formation of 7-hydroxywarfarin as the major product in CYP2C9 assays.[23] The experimental Vmax for hydroxylation of S-warfarin in CYP2C9, determined at 310 K is 133.3 pmol min–1 nmol–1,[87] which corresponds to a free energy barrier of approximately 16 kcal/mol. As discussed for ibuprofen above, this value is an upper bound, as it represents the overall rate of turnover for the enzyme, and it is not known whether this is the overall rate-limiting step. The Boltzmann-weighted average value for the activation enthalpy of 14.3 kcal/mol, calculated at the B3LYP-D/CHARMM27 level is in good agreement with the experimental value, assuming that the contribution of entropy to the free energy barrier is small. The barriers for addition of Cpd I to the truncated model of S-warfarin in the gas phase, are shown in Table 4. The barriers to C6 and C7 hydroxylation are 12.8 and 13.3 kcal/mol, respectively, and are lower than the values calculated with QM/MM. The gas-phase barriers for the doublet state of Cpd I are lower than those calculated for the quartet and predict a slight preference for hydroxylation at C6. Assuming that the relative barriers to C–O addition at C6 and C7 will determine the ratio of S-6- and S-7-hydroxywarfarin, these barriers are not consistent with the 3:1 experimental preference for the formation of S-7- over S-6-hydroxywarfarin.[23]

Table 4

Gas-Phase B3LYP-D(BSIV)//B3LYP-D(BSIII) Energies [in kcal/mol] for Hydroxylation of S-Warfarin at the 6- and 7-Positionsa

site of oxidation	²E_RC	⁴E_RC	²E_TS	⁴E_TS	²ΔE^⧧	⁴ΔE^⧧
6	–12.9	–13.0	–0.1	1.7	12.8	14.7
7	–12.9	–13.0	0.4	3.0	13.3	16.0

The QM/MM barriers correctly display a preference for C–O bond formation at C7, in contrast to the gas-phase barriers. This suggests that the enzyme plays an important role in governing the selectivity of oxidation for S-warfarin in CYP2C9; despite C6 being slightly more reactive toward Cpd I than C7. It is C7 that undergoes C–O bond formation, due to the active-site residues positioning warfarin in a favorable geometry for reaction at C7. The relatively small experimental preference for S-7- over S-6-hydroxywarfarin (3:1)[23] is at first sight not consistent with our QM/MM calculations. The large difference in C–O addition barriers in our calculations would suggest exclusive formation of the S-7 hydroxylation product. It is possible that the QM/MM calculated barriers do not reproduce the differences in energy barrier accurately. However, the apparent discrepancy may also be due to the fact that our calculations only address the first step in the reaction mechanism, formation of a tetrahedral adduct of the Cpd I oxygen to one or the other of the two carbon atoms. In many cases, such adducts will lead to formation of the final hydroxylated product with the same regiochemistry as that of the initial addition. However, the mechanism of the subsequent steps does allow for migration of oxygen from one carbon to another. Previous work[5,55,88] shows that the tetrahedral adducts can react further in one of three ways, each of which has a low energy barrier: (i) proton shuffling, with transfer of a proton from the O-bonded carbon to the porphyrin ring; (ii) hydride migration from the O-bonded carbon to a neighboring carbon; (iii) ring closure by formation of a second C–O bond between the oxygen and a neighboring carbon to yield an epoxide (or arene oxide), that can undergo subsequent (presumably non-enzymatic) ring-opening. Pathways (i) and (ii) lead, after further steps, to formation of a product with a C–O bond in the same position as in the adduct. However, depending on the regiochemistry of epoxide ring-opening, the hydroxylated product from pathway (iii) can have a C–O bond either at the position of addition or at the neighboring position. The regiochemistry of ring-opening of arene oxides in the presence of weak acid is known to be influenced by resonance stabilization of an initially formed carbocation.[89] In the present case, the lactone oxygen substituent should favor opening of a 6,7-arene oxide to yield the S-6 hydroxylation product. Hence the observation of ca. 20% of the S-6 hydroxylation product could be due to initial exclusive addition of Cpd I to C-7—consistent with our calculations. This would need to be followed by dominant rearrangement of the adduct through mechanisms (i) or (ii), forming the major observed S-7 hydroxylation product, but partial occurrence of mechanism (iii) leading to arene oxide. This would then open regioselectively to the S-6 product. This interpretation is consistent with the amount of deuterium retention obtained during studies of the cytochrome P450-catalyzed hydroxylation of selectively deuterated forms of S-warfarin.[90] ERC and ETS correspond to the energies of the reactant complex and transition state, respectively, calculated relative to separate reactants. ΔE⧧ is the potential energy barrier, calculated as the difference between ERC and ETS. The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I. The QM/MM transition states corresponding to the lowest energy pathways for C6 and C7 hydroxylation of S-warfarin are displayed in Figure 14. One notable difference between the TSs to C6 and C7 hydroxylation is between the Fe–O–C(x) angle. In the C7 TSs, this angle varies between 126 and 130°, whereas this angle is significantly larger in the C6 TSs (between 140 and 143°). This is in agreement with the relationship between angle of attack and barrier to oxidation observed for ibuprofen, summarized in Figure 11.

Figure 14

Transition-state structures for C–O bond formation between C6 and C7 of S-warfarin and the Cpd I ferryl oxygen of CYP2C9, calculated at the B3LYP-D(BSI)/CHARMM27 level of theory (corresponding to doublet profiles 6-8 and 7-9 in Table 3). The carbon atoms corresponding to C6 and C7 hydroxylation are displayed in green and orange, respectively. The gas-phase TS geometries for C–O bond formation to C6 and C7 of the truncated QM model S-warfarin are superimposed in Figure 15. The geometries are very similar with respect to angle of approach of the substrate carbon atom to Cpd I, which is expected given that the barriers are very similar (12.8 and 13.3 kcal/mol for C6 and C7 hydroxylation, respectively). Given the similarity of the C6 and C7 barriers, one might expect equal formation of 6- and 7-hydroxywarfarin (or even a slight preference for 6-hydroxywarfarin). This is not the experimental observation, however; clearly the enzyme controls the selectivity by orientating the substrate in such a position that hydroxylation at C7 is more facile.

Figure 15

Transition-state structures for hydrogen abstraction from C6 and C7 of a truncated model of S-warfarin, calculated in vacuo at the B3LYP-D/BSIII level of theory. The carbon atoms corresponding to C6 and C7 hydroxylation are in purple and yellow, respectively.

The difference in the orientation of the substrate relative to the heme, between the gas-phase and QM/MM TS geometries, is similar to that observed for S-ibuprofen. In the gas-phase calculations the ring structure of warfarin lies parallel to the heme, whereas in the QM/MM calculations the ring is closer to a perpendicular orientation. This difference is clearly a result of the amino acid residues surrounding the active site in the latter case. As mentioned in the Introduction, the metabolism of diclofenac by CYP2C9 predominantly leads to formation of 4′-hydroxydiclofenac. The formation of the 5-hydroxy product in CYP2C9 was observed when the acid group was removed from diclofenac.[19] Oxidation by CYP3A4 leads to exclusive formation of 5-hydroxydiclofenac, and the 5-position has been shown to be the chemically more reactive site using Cpd I mimetic species.[19] The MD simulation performed prior to oxidation at C4′ in diclofenac generated many structural snapshots where the distance between the C4′ and the Cpd I oxygen (C4′–O) was less than 3.0 Å, i.e., close enough for reaction to occur (the average value of this distance over the entire MD trajectory was 3.4 Å). A hydrogen bond was observed between the Arg108 side chain and the acid group of diclofenac for the majority of this simulation. During the parts of the simulation in which this interaction was missing, no noticeable effect was observed on the C4′–O distance. In the MD simulations used to generate structures for C5 oxidation, no hydrogen-bonding interaction was observed between Arg108 and the diclofenac acid group; instead the acid group forms hydrogen bonds to two water molecules for the entire simulation. Transition-state structures for hydrogen abstraction from C6 and C7 of a truncated model of S-warfarin, calculated in vacuo at the B3LYP-D/BSIII level of theory. The carbon atoms corresponding to C6 and C7 hydroxylation are in purple and yellow, respectively. Example QM/MM optimized reactant complex structures for the addition of Cpd I to C4′ and C5 of diclofenac are displayed in Figure 16. As in the MD simulations, a hydrogen bond is observed between the carboxylate of diclofenac and the Arg108 side chain in the reactant complex to addition at C4′. In the reactant complex for addition to C5, the carboxylate of diclofenac does not form any hydrogen bonds to the protein; however, it does form hydrogen bonds to three surrounding water molecules. Additionally, in the C5 addition reactant complex, an intramolecular hydrogen bond is formed between the carboxylate and N–H groups of diclofenac.

Figure 16

Reactant complex structures for hydrogen abstraction from (a) C4′ and (b) C5 of diclofenac, calculated at the B3LYP-D(BSI)/CHARMM27 level of theory (corresponding to doublet profiles 4′-2 and 5-3 in Table 5).

Table 5

Potential Energy Barriers, ΔE⧧ [in kcal/mol], from B3LYP-D(BSII)//B3LYP-D(BSI)/CHARMM27 Profiles for Hydroxylation of Diclofenac at the 4′- and 5-Positionsa

	C4′			C5
profile	²ΔE^⧧	⁴ΔE^⧧	profile	²ΔE^⧧	⁴ΔE^⧧
4′-1	18.8	19.5	5-1	18.7
4′-2	18.3	17.4	5-2	11.9
4′-3	22.3		5-3	11.8	15.7
4′-4	27.2
4′-5	20.4
Ave_B	19.0	17.8	Ave_B	12.1	15.7

The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I.

The barriers to C4′- and C5-hydroxylation in diclofenac are provided in Table 5. The barriers for addition of Cpd I to C4′, calculated with B3LYP-D/CHARMM27, span the range 17.4–27.2 kcal/mol. The corresponding barriers for addition to C5 span the range 11.9–18.7 kcal/mol. Similar barriers were observed for the doublet and quartet spin states of Cpd I, and because previous studies have shown a tendency for lower barriers for the doublet state, not all of the pathways were calculated for both spin states. The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I. The gas-phase QM barriers are provided in Table 6. The barriers to both 4′- and 5-hydroxylation are lower for the doublet spin state of Cpd I (12.9 and 13.4 kcal/mol, respectively). In the gas phase, C4′-hydroxylation has a slightly lower barrier than that of C5-hydroxylation. This is in stark contrast to the QM/MM barriers, where C5-hydroxylation is favored by over 6 kcal/mol.

Table 6

Gas-Phase B3LYP-D(BSIV)//B3LYP-D(BSIII) Energies [in kcal/mol] for Hydroxylation of Diclofenac at the 4′- and 5-Positionsa

site of oxidation	²E_RC	⁴E_RC	²E_TS	⁴E_TS	²ΔE^⧧	⁴ΔE^⧧
4′	–12.1	–12.1	0.8	1.3	12.9	13.4
5	–10.5	–10.5	2.9	4.9	13.4	15.5

ERC, ETS, and ΔE⧧ correspond to the reactant complex, transition state, and potential energy barrier, respectively. The superscript 2 and 4 labels correspond to the doublet and quartet spin states of Cpd I. The QM/MM barriers suggest that addition to C5 should be preferred in CYP2C9, which is inconsistent with the metabolites observed during experimental studies.[19] Given that QM/MM was able to predict the correct selectivity for addition to warfarin, this may seem surprising. However, a possible explanation for the disagreement with experiment is that the free energies of binding of diclofenac in the respective reactant complexes for 4′- and 5-hydroxylation are not factored into the above evaluations of the energy barriers. For ibuprofen and warfarin oxidation, where the major and minor sites of metabolism are relatively close together, the major and minor metabolites can be formed with the substrate in very similar binding orientations. For diclofenac, the two sites of metabolism that have been investigated are on different rings, and hence the substrate is required to be in two distinct binding poses in order for these sites to be close enough to Cpd I for reaction to occur. Our methods do not take into account the relative free energies of binding for the two orientations. The narrow active-site cavity in CYP2C9 does not allow for interchange between these two orientations during the relatively short time scale available in MD simulations. Simulations of S-warfarin in the active site of CYP2C9 performed by Seifert et al.[91] exhibited movement of the substrate around the active site; however, a larger movement than this would be required to interchange between the two binding orientations of diclofenac. In P450s with larger active sites, such as CYP3A4, the major metabolites formed with such isoforms are more likely to be those that result from oxidation of the most reactive site with respect to Cpd I, as the sites will be more likely to be equally accessible to Cpd I. Calculating accurate free energies of binding for P450 substrates is challenging, and beyond the scope of the present work. The QM/MM optimized structures of the TSs corresponding to the lowest energy C4′ and C5 profiles are superimposed in Figure 17. In the TS corresponding to hydroxylation at C4′, a hydrogen bond is observed between the acid group of diclofenac and Arg108. This interaction is also present in the reactant complex structure, and during the majority of the MD simulation. This interaction cannot occur when diclofenac is oriented for oxidation at C5, and this interaction may explain the preference for oxidation at C4′. As mentioned above, experiments have shown that removal of the acid group from diclofenac results in oxidation at C5.[19]

Figure 17

The gas-phase optimized TS geometries for oxidation at C4′ and C5 of diclofenac are shown in Figure 18. In the C5 hydroxylation TS, the ring undergoing oxidation is oriented parallel to the heme, with an Fe–O–C(5) angle of 135.9°. In the C4′ hydroxylation TS, the ring undergoing oxidation lies at an acute angle relative to the heme. The Fe–O–C(4′) angle is 125.2° for the 4′-hydroxylation TS. Applying the terminology used in our previous work on P450-mediated hydroxylation of benzene,[5,55,82] the C5 and C4′ TSs approximate to face-on and side-on addition, respectively. Despite the different geometries observed for the gas-phase TSs to C4′- and C5-hydroxylation, the energies of the two species are very similar (12.9 and 13.4 kcal/mol, respectively, in the doublet spin state of Cpd I).

Figure 18

Gas-phase transition state structures for C–O bond formation between C4′ and C5 of diclofenac and the Cpd I ferryl oxygen of CYP2C9, calculated at the B3LYP-D/6-31G(d) level of theory. The carbon atoms corresponding to C4′ and C5 hydroxylation are displayed in green and orange, respectively.

QM/MM transition-state structures for C–O bond formation between C4′ and C5 of diclofenac and the Cpd I ferryl oxygen of CYP2C9, calculated at the B3LYP-D-6-31G(d)/CHARMM27 level of theory (corresponding to doublet profiles 4′-2 and 5-3 in Table 5). The carbon atoms corresponding to C4′ and C5 hydroxylation are displayed in green and orange, respectively. Thus, the present calculations suggest that the observed preference for C4′-hydroxylation of diclofenac in CYP2C9 reflects the formation of the hydrogen-bonding interaction between the acid group of diclofenac and Arg108. The barriers to hydroxylation calculated in the gas phase are generally lower than those calculated in the enzyme, which suggests that the effect of Arg108 is not catalytic, but rather that interaction between the substrate and Arg108 leads to a preference for the 4′-hydroxylation binding site. This is consistent with a pharmacophore model for CYP2C9, which suggests that most substrates for CYP2C9 will have an anionic site at about 8 Å from the site of metabolism, which will interact with a cationic protein residue (Asp108).[29,92] The acid carbon of diclofenac is approximately 7.4 Å from C4′ in the gas-phase calculations, and 4.8 Å from C5. Hence hydroxylation at the C5 position would not be expected in CYP2C9, on the basis of the pharmacophore. Certainly, this observation is consistent with the experimental preference for oxidation at C5 in diclofenac with the acid group removed.[19] The two oxidation sites in diclofenac are sufficiently distant from each other that distinct binding modes are clearly required. On the other hand, there is unlikely to be a significant difference between the binding complexes for the two competing pathways for ibuprofen and warfarin, where a single binding mode is a sensible starting point for both pathways. Gas-phase transition state structures for C–O bond formation between C4′ and C5 of diclofenac and the Cpd I ferryl oxygen of CYP2C9, calculated at the B3LYP-D/6-31G(d) level of theory. The carbon atoms corresponding to C4′ and C5 hydroxylation are displayed in green and orange, respectively. The Curtin–Hammett free energy diagram shown in Figure 19 provides a hypothetical explanation as to why 4′-hydroxylation is favored, despite having a higher QM/MM barrier than that found for C5-hydroxylation. If the free energies of the C4′ reactant complex and TS are lower than those of C5, due to the stabilizing effect of hydrogen-bonding to Arg108, the 4′-hydroxylation pathway would be favored, even if the “local” barrier to hydroxylation at C5 (Δ⧧G5) is smaller. Certainly, this hypothesis is merely speculation at this stage. It would be interesting to investigate this effect further, but accurate calculations of binding free energies for P450s (or indeed other proteins) are not currently routine.[93] Further experimental studies of diclofenac oxidation by CYP2C9 in which Arg108 is mutated to another residue might help to support our hypothesis. The R108E mutant CYP2C9 was found to exhibit greatly reduced hydroxylation of diclofenac at the 4′-position; however, no corresponding increase in 5-hydroxylation was reported.[86]

Figure 19

Putative energy level diagram for CYP2C9 oxidation of diclofenac. 2C9+diclofenac refers to the free energy of the separate diclofenac molecule and CYP2C9 enzyme. 5-TS/5-RC and 4′-TS/4′-RC correspond to the free energies of the transition states/reactant complexes to 5- and 4′-hydroxylation, respectively. In this hypothesis, the energy barrier to 4′-hydroxylation (Δ⧧G4′) is larger than that of 5-hydroxylation (Δ⧧G5). However, the free energy of binding is greater for 4′-hydroxylation, and the relative energy of the transition state to 4′-hydroxylation is lower than that for 5-hydroxylation; hence 4′-hydroxylation would be the preferred pathway.

Concluding Remarks

Accurate predictions of metabolites formed during P450-mediated metabolism should help identify ADME/Tox properties of drug candidates, and thus lessen the risk of adverse drug reactions arising in the late stages of drug development. Here we have shown that QM/MM calculations can be used to investigate regioselectivity in drug metabolism. We have modeled the metabolism of three commonly used pharmaceutical compounds. Our calculations largely rationalize the experimentally known selectivity and provide further support to the theory that predictions of metabolites can be made on the basis of the relative energy barriers to oxidation by Cpd I at different sites on a given drug molecule. When (as is usually the case) multiple binding modes are possible, it is important also to consider which binding modes are favored; where multiple binding orientations of a drug molecule are accessible, the relative free energies of such orientations may also need to be taken into account. In the case of diclofenac, the relative numbers of hydrogen bonds in the two binding positions provide an indication of the preferred binding mode. Such interactions should be considered during the initial docking of the substrate into the active site. From a practical perspective, we present a protocol here that can be used for modeling the metabolic reactions of drug molecules in CYP2C9, and indeed in other P450 isoforms. Several factors must be taken into account to achieve reliable results. The choice of method is important; a QM method that can correctly model the electronic structure of Cpd I is imperative. In order to get barrier heights that are in reasonable agreement with experiment, dispersion should be accounted for, e.g., by use of an empirical correction[56] as shown here. When calculating reaction barriers in enzymes, conformational sampling is also extremely important, as a wide range of barriers can often be observed due to slight fluctuations in the orientation of the substrate and surrounding residues. This can be achieved by calculating several reaction profiles, using different structures generated from MD simulations. Applied in this way, QM/MM methods can contribute usefully to understanding and predicting drug metabolism by P450s.

61 in total

1. Development of a combined protein and pharmacophore model for cytochrome P450 2C9.

Authors: Marcel J de Groot; Alexander A Alex; Barry C Jones
Journal: J Med Chem Date: 2002-05-09 Impact factor: 7.446

2. Identification of residues 286 and 289 as critical for conferring substrate specificity of human CYP2C9 for diclofenac and ibuprofen.

Authors: T S Klose; G C Ibeanu; B I Ghanayem; L G Pedersen; L Li; S D Hall; J A Goldstein
Journal: Arch Biochem Biophys Date: 1998-09-15 Impact factor: 4.013

3. The role of CYP2C9 genotype in the metabolism of diclofenac in vivo and in vitro.

Authors: U Yasar; E Eliasson; C Forslund-Bergengren; G Tybring; M Gadd; F Sjöqvist; M L Dahl
Journal: Eur J Clin Pharmacol Date: 2001-12 Impact factor: 2.953

4. Understanding the determinants of selectivity in drug metabolism through modeling of dextromethorphan oxidation by cytochrome P450.

Authors: Julianna Oláh; Adrian J Mulholland; Jeremy N Harvey
Journal: Proc Natl Acad Sci U S A Date: 2011-03-28 Impact factor: 11.205

5. Diclofenac and its derivatives as tools for studying human cytochromes P450 active sites: particular efficiency and regioselectivity of P450 2Cs.

Authors: A Mancy; M Antignac; C Minoletti; S Dijols; V Mouries; N T Duong; P Battioni; P M Dansette; D Mansuy
Journal: Biochemistry Date: 1999-10-26 Impact factor: 3.162

6. Substrate probe for the mechanism of aromatic hydroxylation catalyzed by cytochrome P450.

Authors: J F Darbyshire; K R Iyer; J Grogan; K R Korzekwa; W F Trager
Journal: Drug Metab Dispos Date: 1996-09 Impact factor: 3.922

7. Putative active site template model for cytochrome P4502C9 (tolbutamide hydroxylase).

Authors: B C Jones; G Hawksworth; V A Horne; A Newlands; J Morsman; M S Tute; D A Smith
Journal: Drug Metab Dispos Date: 1996-02 Impact factor: 3.922

8. Oxygen economy of cytochrome P450: what is the origin of the mixed functionality as a dehydrogenase-oxidase enzyme compared with its normal function?

Authors: Devesh Kumar; Samuël P De Visser; Sason Shaik
Journal: J Am Chem Soc Date: 2004-04-28 Impact factor: 15.419

9. Re-engineering of CYP2C9 to probe acid-base substrate selectivity.

Authors: Guoying Tai; Leslie J Dickmann; Nicholas Matovic; James J DeVoss; Elizabeth M J Gillam; Allan E Rettie
Journal: Drug Metab Dispos Date: 2008-07-07 Impact factor: 3.922

10. Determinants of reactivity and selectivity in soluble epoxide hydrolase from quantum mechanics/molecular mechanics modeling.

Authors: Richard Lonsdale; Simon Hoyle; Daniel T Grey; Lars Ridder; Adrian J Mulholland
Journal: Biochemistry Date: 2012-02-10 Impact factor: 3.162

21 in total

1. Multiscale Methods in Drug Design Bridge Chemical and Biological Complexity in the Search for Cures.

Authors: Rommie E Amaro; Adrian J Mulholland
Journal: Nat Rev Chem Date: 2018-04-11 Impact factor: 34.035

2. Developing Structure-Activity Relationships for N-Nitrosamine Activity.

Authors: Kevin P Cross; David J Ponting
Journal: Comput Toxicol Date: 2021-09-08

3. Dynamics underlying hydroxylation selectivity of cytochrome P450cam.

Authors: Sashary Ramos; Claire C Mammoser; Katherine E Thibodeau; Megan C Thielges
Journal: Biophys J Date: 2021-02-03 Impact factor: 4.033

Review 4. Antihypertensive drugs metabolism: an update to pharmacokinetic profiles and computational approaches.

Authors: Aikaterini Zisaki; Ljubisa Miskovic; Vassily Hatzimanikatis
Journal: Curr Pharm Des Date: 2015 Impact factor: 3.116

5. With or without light: comparing the reaction mechanism of dark-operative protochlorophyllide oxidoreductase with the energetic requirements of the light-dependent protochlorophyllide oxidoreductase.

Authors: Pedro J Silva
Journal: PeerJ Date: 2014-09-02 Impact factor: 2.984

6. Multiscale Simulations on the Catalytic Plasticity of CYP76AH1.

Authors: Yufan Qiu; Hongjuan Diao; Ying Zheng; Ruibo Wu
Journal: Front Chem Date: 2021-06-02 Impact factor: 5.221

7. Quantum mechanical modeling: a tool for the understanding of enzyme reactions.

Authors: Gábor Náray-Szabó; Julianna Oláh; Balázs Krámos
Journal: Biomolecules Date: 2013-09-23

Review 8. Applications of density functional theory to iron-containing molecules of bioinorganic interest.

Authors: Hajime Hirao; Nandun Thellamurege; Xi Zhang
Journal: Front Chem Date: 2014-04-29 Impact factor: 5.221

9. Understanding a substrate's product regioselectivity in a family of enzymes: a case study of acetaminophen binding in cytochrome P450s.

Authors: Yue Yang; Sergio E Wong; Felice C Lightstone
Journal: PLoS One Date: 2014-02-03 Impact factor: 3.240

10. A multiscale approach to modelling drug metabolism by membrane-bound cytochrome P450 enzymes.

Authors: Richard Lonsdale; Sarah L Rouse; Mark S P Sansom; Adrian J Mulholland
Journal: PLoS Comput Biol Date: 2014-07-17 Impact factor: 4.779