Literature DB >> 27064579

Prediction of Stereochemistry using Q2MM.

Eric Hansen¹, Anthony R Rosales¹, Brandon Tutkowski¹, Per-Ola Norrby², Olaf Wiest^1,3.

Abstract

The standard method of screening ligands for selectivity in asymmetric, transition metal-catalyzed reactions requires experimental testing of hundreds of ligands from ligand libraries. This "trial and error" process is costly in terms of time as well as resources and, in general, is scientifically and intellectually unsatisfying as it reveals little about the underlying mechanism behind the selectivity. The accurate computational prediction of stereoselectivity in enantioselective catalysis requires adequate conformational sampling of the selectivity-determining transition state but has to be fast enough to compete with experimental screening techniques to be useful for the synthetic chemist. Although electronic structure calculations are accurate and general, they are too slow to allow for sampling or fast screening of ligand libraries. The combined requirements can be fulfilled by using appropriately fitted transition state force fields (TSFFs) that represent the transition state as a minimum and allow fast conformational sampling using Monte Carlo. Quantum-guided molecular mechanics (Q2MM) is an automated force field parametrization method that generates accurate, reaction-specific TSFFs by fitting the functional form of an arbitrary force field using only electronic structure calculations by minimization of an objective function. A key feature that distinguishes the Q2MM method from many other automated parametrization procedures is the use of the Hessian matrix in addition to geometric parameters and relative energies. This alleviates the known problems of overfitting of TSFFs. After validation of the TSFF by comparison to electronic structure results for a test set and available experimental data, the stereoselectivity of a reaction can be calculated by summation over the Boltzman-averaged relative energies of the conformations leading to the different stereoisomers. The Q2MM method has been applied successfully to perform virtual ligand screens on a range of transition metal-catalyzed reactions that are important from both an industrial and an academic perspective. In this Account, we provide an overview of the continued improvement of the prediction of stereochemistry using Q2MM-derived TSFFs using four examples from different stages of development: (i) Pd-catalyzed allylation, (ii) OsO4-catalyzed asymmetric dihydroxylation (AD) of alkenes, (iii) Rh-catalyzed hydrogenation of enamides, and (iv) Ru-catalyzed hydrogenation of ketones. In the current form, correlation coefficients of 0.8-0.9 between calculated and experimental ee values are typical for a wide range of substrate-ligand combinations, and suitable ligands can be predicted for a given substrate with ∼80% accuracy. Although the generation of a TSFF requires an initial effort and will therefore be most useful for widely used reactions that require frequent screening campaigns, the method allows for a rapid virtual screen of large ligand libraries to focus experimental efforts on the most promising substrate-ligand combinations.

Entities: Chemical Disease Species

Mesh：

Substances：

Year: 2016 PMID： 27064579 PMCID： PMC4879660 DOI： 10.1021/acs.accounts.6b00037

Source DB: PubMed Journal: Acc Chem Res ISSN： 0001-4842 Impact factor: 22.384

Introduction

Asymmetric catalysis is a major focus of synthetic organic chemistry. The experimental discovery of enantioselective catalysts typically involves trial-and-error approaches, thus necessitating many hours of bench work or specialized machinery to partially automate the process. This process provides only indirect insight into the origin of ligand performance and contributes little to the rational design of novel ligands. Computational approaches to enantioselective catalysis can engage in a productive interplay with experimental observations, making significant contributions by providing structural and mechanistic data rationalizing experimental data and quantitatively accurate predictions of selectivity before experimental ligand screening. The accurate calculation of ΔG⧧ is a daunting task. In contrast, the selectivity of a reaction depends only on the relative rates between the pathways leading to the different products, e.g., different enantiomers, as indicated by the solid red and blue pathways in Figure . Thus, only ΔΔG⧧ at the transition states is needed to predict selectivity, leading to significant error cancellation if the two relevant transition states are very similar, e.g., for diastereomeric transition states leading to enantiomeric products. The remainder of this account will focus on this case, and detailed analyses of each case can be found in the references cited.

Figure 1

Diastereomeric transition states leading to enantiomeric products.

Diastereomeric transition states leading to enantiomeric products. Multiple transition states (TSs), e.g., different conformations of the TSs, can lead to the same enantiomer (dotted lines in Figure ). These can be close enough in energy to the lowest energy TS to be relevant or even lower if the initial conformation chosen is not the lowest energy TS. In this case, the observed ee is determined by the Boltzmann distributions for each diastereomeric TS as shown in eq where i and j are summed over all the conformations for each diastereomeric TS. In practice, all ΔΔG⧧ are taken relative to an arbitrary reference energy, generally the lowest one for each diastereomer. Historically, only single conformations from limited manual conformational searches or educated guesses were considered in computational selectivity predictions. As the systems studied computationally approached the experimentally relevant ones, the number of conformations to be considered became unfeasible for quantum mechanics (QM) methods. This is particularly problematic for systems involving large, flexible ligands where it was shown that the experimentally observed selectivities cannot be explained by a single TS.[1] Furthermore, for predictions to be useful, they must be faster than experimental screens for possibly hundreds of ligands while still being quantitatively accurate. Despite significant advances in the speed of QM and QM/MM methods, such simulations are still intractable. A possible alternative to the problem of rapid conformational sampling is the use of fast molecular mechanics (MM) or force field (FF) methods. Although these are more commonly associated with the calculation of ground states, their use for the description of TSs has a long history. One of the first uses of an empirical FF was to describe the rotational barriers in biphenyls.[2] As early as 1929, the reaction of H and H2 was modeled by mixing the potential energy functions of the ground states.[3] This concept was generalized as the valence bond (VB) method,[4] which gave rise to a large number of related methods,[5,6] such as multiconfigurational MM (MCMM),[7] RFF,[8] SEAM,[9] ACE,[10] and most prominently empirical VB (EVB).[11] These methods share the common element of treating the transition state by an appropriate mix of the reactant and product PES as described by a classical FF but use different approaches to determine the interaction of two ground-state diabatic PES at the transition state, as shown in Figure .

Figure 2

Comparison of force field methods for TS modeling.

Comparison of force field methods for TS modeling. An alternative method is to model the TS not as an energetic maximum but as a minimum using transition state force fields (TSFFs). TSFFs have a 50-year history of explaining and predicting relative rates and selectivities in organic reactions.[12] Conceptually, TSFFs should be more accurate in describing the TS compared to an approximation by geometrically and electronically different ground state FFs, which are frequently only capable of describing the system at small distortions from the equilibrium structure. However, a TSFF necessitates the generation of new parameters specific to the TS of interest, and the PES of the reactant, product, and TS are discontinuous. Although the second point is less relevant to computational studies of selectivity as discussed above, a practical method for the prediction of stereochemistry requires a fast and ideally automated method for the generation of reaction-specific TSFFs.

The Quantum-Guided Molecular Mechanics (Q2MM) Method

Over the last two decades, we developed the Q2MM[13] method as a fast and accurate method for the parametrization of FFs. The basic idea of the Q2MM method is the automated fitting of the FF parameters using only electronic structure calculations, as described in Figure , by minimizing the objective function shown in eq .

Figure 3

Flowchart of the Q2MM method.

Flowchart of the Q2MM method. A prerequisite for the application of the Q2MM method is knowledge of the selectivity-determining step of a given reaction. The first step is then to determine reference data for a training set consisting of small model TSs. These small model TSs should cover the minimum structures needed for capturing the essential steric and electronic features of the reaction in the definition of the new parameters close to the reaction center. They should also avoid generating “noise” from the parts of the structure that will be represented by transferable ground state parameters from the underlying force field. For each of these TSs, a full geometry optimization and frequency calculation using an appropriate electronic structure method and standard quantum mechanical code is performed, which also yields the partial charges of the atoms. The second step is to decide for which atoms to generate new parameters and which functional form of the force field to use. Sometimes several different possibilities need to be fitted to obtain the best balance of capturing all relevant interactions with a minimum number of parameters. Most current applications of Q2MM concern organometallic catalysts for which no ground state FF parameters are available. Thus, new atom types must be added to describe the metals, and all FF parameters involving these new atom types must be fitted. As the majority of the atoms in the system will be described by standard FF parameters, high-quality parameters have to be available for the systems to be studied. For the small organic molecules under discussion here, the MM3* force field[14] was shown to be suitable, but Q2MM can be used to fit parameters for any force field. In the third step, the FF parameters are iteratively fit to reproduce the QM reference data by minimization of the objective functionwhere x° is the QM-derived reference data point, x is the corresponding FF calculated data point, and w is the weight. The weight is typically set to be the inverse of the acceptable error for a given data type (e.g., bond lengths).[15] Thus, a converged FF with N data points in its training set should converge to a penalty function value no greater than N. A combination of gradient-based and simplex optimization techniques is used to minimize the penalty function.[13] We have automated the fitting procedures outlined in Figure into code that works with a number of QM and MM programs, which is freely available from the authors. Although all bonded and nonbonded parameters of the force field need to be fit, we found it useful to start with the electrostatic parameters alone. Coulombic parameters are fit to reproduce RESP or CHELPG charges calculated using standard software packages. As Q2MM is designed to work closely with MM3*, existing van der Waals parameters can be used. A key difference between Q2MM and most traditional manual or automated methods for fitting system-specific FF parameters[16−19] is the use of the Hessian Matrix for the fitting of force constants of bonded parameters together with geometric data for reference values in the Q2MM method.[13,20,21] The Hessian matrix describes how the energy changes with respect to geometric distortions, which is important for a number of reasons. The Hessian matrix in Cartesian coordinates consists of 3N × 3N data points, where N is the number of atoms. This provides an abundance of reference data for FF optimizations, thus circumventing the previously criticized overfitting of parameters in TSFFs.[22] It also contains the information for an accurate description of the energetic penalty upon distortion from the equilibrium geometry that is expected to be relevant for bulky ligands often used in stereoselective catalysis. Finally, it provides a convenient means to invert the first order saddle point obtained from the QM calculations (Figure , left) to the minimum used in TSFFs (Figure , right).[23] Inverting the curvature of the PES along the reaction coordinate causes a geometry optimization to head toward the TS structure, allowing the use of standard geometry optimization techniques included in MM packages to locate TSs. Flipping this curvature is possible by decomposing the Hessian into its corresponding eigenvectors V and eigenvalues Sand replacing the negative eigenvalue of the TS with a large positive value or fitting directly to the eigenmodes and eigenvectors while ignoring the eigenmode with negative eigenvalues.[24]

Figure 4

Representation of the Hessian’s description of the PES about a TS (left) and minimum (right).

Representation of the Hessian’s description of the PES about a TS (left) and minimum (right). Bonded parameters are optimized following optimization of the nonbonded parameters. The reference, or “ideal”, bond and angle values are initially set to average values obtained from the training set and often change little throughout the optimization. Bond and angle force constants are initially set to fairly high standard values and are mainly influenced by the Hessian data during parametrization. Dihedral force constants are often fit to reproduce energies from QM torsional scans. However, this data may not be readily obtainable at the TS depending on the size and coordination of the system. Another option is to optimize dihedral force constants such that the MM FF reproduces the QM-derived Hessian and potentially the relative energies of different conformers from the training set. Fitting the dihedral parameters is typically one of the more problematic phases of the parametrization, as the choice of periodicity requires some expertise. In general, this should be done after an initial fitting of other parameters; torsional parameters can to some extent be seen as error corrections due to insufficient representation of through-space interactions in the force field but should not be allowed to take over the modeling of interactions that are more properly described by bond, angle, or nonbonded parameters. The same is true to an even larger extent for any cross-terms included in the force field.[24] The final step of the Q2MM method is validation of the TSFF by comparing the results for geometries, Hessian matrix values, and relative energies from optimizations using the TSFF for transition states obtained from electronic structure calculations that were not part of the training set. Once satisfactory agreement has been achieved, the TSFF can be used to run Monte Carlo (MC) conformational searches for the TS, leading to each of the stereoisomers as shown in Figure . The relative energies of the resulting conformations can be Boltzmann averaged to predict stereoselectivities using eq . After validation by comparison to experimentally known stereoselectivities, this can be used to rapidly predict ee’s and screen suitable ligands for a given catalyst/substrate combination.

Applications of Q2MM to Stereoselective Catalysis

Palladium-Mediated Allylation

The first applications of the predecessor of the Q2MM method to the prediction of stereoselectivity were qualitative predictions of palladium-catalyzed allylation (Figure ) using ground state FFs.[25,26] Several chiral bidentate ligands had been shown to be effective for enantioselective and regioselective catalysis of this reaction,[25] and MM parameters for metal–ligand interactions including palladium–nitrogen and η3-allylpalladium complexes were under development.[27] However, several competing reaction pathways with different selectivity determining steps dependent on the different substrates and ligands were thought to be operating, prompting two studies using ground state FFs for substituted phenanthroline ligands and several substrates to evaluate selectivity.

Figure 5

Catalytic cycle for the palladium-mediated allylation of nucleophiles.

Catalytic cycle for the palladium-mediated allylation of nucleophiles. The first study to explore the enantioselectivity of several substrates with chiral phenanthroline ligands is summarized in Table . The results rank the ligands in a qualitative fashion as high, low, or modest in enantioselectivity.[25] Although these results contributed to the development of the Q2MM method, these predictions had similar reliability to a chemist’s intuition of steric and electronic effects.

Table 1

Qualitative Predictions Using Only a Ground State FF

The second study combined MM with linear free energy relationships (LFER) between experimentally observed selectivities (ΔΔG⧧) and the ground state parameters of the allyl-Pd structures to improve the accuracy of the predictions.[26] Several achiral phenanthroline-derived ligands were included in the validation of the resulting FF to explore regioselectivity. This work emphasized steric interactions and thus included ligands and substrates with little variation in electronic structure. The focus on diethyl-methylmalonate and DMF cancelled effects of the nucleophile and solvent, respectively. The resulting predictions (Table ) were better than the earlier qualitative predictions. The inclusion of the LFER emphasized the need to examine TSs.

Table 2

Combined Ground State FF and LFER Energy Comparison (kJ/mol) of Nucleophilic Addition Using Diethyl Methylmalonate

These early studies provided several important insights into the further development of the Q2MM method. First, reactions with a well-defined rate- and selectivity-determining step are more suitable for reaction-specific FFs. Utilizing only general parameters of a ground state FF will usually provide only qualitative predictions at best. The performance of the combined MM and LFER, although accurate, requires several experiments with a range of selectivities for parametrization.

Osmium Tetroxide-Catalyzed Asymmetric Dihydroxylation (AD) of Alkenes

The first iterations of the Q2MM method attempted to address these problems by using the results from QM calculations for the parametrization of a TSFF. The osmium tetroxide AD reaction of alkenes (Figure )[28−30] has a wide scope, mild conditions, high yields, and excellent enantioselectivity. The intense debate over a concerted [3 + 2] or stepwise [2 + 2] mechanism was resolved in favor of the former by a combination of isotope effect experiments and DFT calculations.[31]

Figure 6

Osmium Tetroxide-Catalyzed Asymmetric Dihydroxylation (AD) of Alkenes.

Osmium Tetroxide-Catalyzed Asymmetric Dihydroxylation (AD) of Alkenes. The training set included several TS structures with simple model ligands of ammonia or trimethylamine along with various alkene substrates (Figure ). Validation of the results from the Q2MM-derived TSFF to QM calculations as well as experimental results (Figure ) show excellent agreement with an ∼2.5 kJ/mol mean unsigned error (MUE), which is considerably better than the “chemical accuracy” target of 4 kJ/mol. The TSFF was able to predict the correct major enantiomer, and in most cases, the predicted ee is close to the experimental values. Substrates where predictions deviated significantly from the experimental results typically show low ee’s (indicating low energy differences and hence large errors) and are often those that react slowly, possibly due to alternative pathways (e.g., nonselective background reactions without the chiral ligand).

Figure 7

Osmium complexes and alkenes used as the QM training set.

Figure 8

Selectivity comparison of Q2MM-derived FF and experimental results for AD reaction.

Osmium complexes and alkenes used as the QM training set. Selectivity comparison of Q2MM-derived FF and experimental results for AD reaction. The Q2MM-derived TSFF, together with additional experimental studies, allowed for the refinement of the original mnemonic for face-selectivity based on DHQD- or DHQ-derived ligands (Figure , top).[28,29] Analysis of the lowest energy conformations from the conformational searches using the TSFF provided an improved mnemonic (Figure , bottom) that gives better insights into the repulsive and attractive interactions responsible for stereoselectivity. Additionally, the TSFF also rationalizes the low stereoselectivity for certain substrates that cannot be described by the simple mnemonic.

Figure 9

(top) Original mnemonic to predict the selectivity of the Sharpless AD using (DHQD)2PHAL and (DHQ)2PHAL ligands. (bottom) Revised mnemonic derived from analysis with the Q2MM TSFF. Ovals and dotted circles represent steric bulk and attractive areas of the ligand, respectively. The AD reaction is also one of the few cases for which a direct comparison can be made between Q2MM and QM/MM methods.[32] Both methods were found to have similar accuracies, and each has its own strengths and weaknesses. Careful selection of the structures to be calculated in the QM/MM studies deemphasized the need for a large conformational search, but the number of cases that could be studied is still limited. Conversely, the effort needed for the generation of the Q2MM TSFF is considerable, but once available, it is orders of magnitude faster than the QM/MM calculations, enabling extensive sampling and/or high throughput virtual screening of substrates and ligands. Both the QM/MM and Q2MM approaches provided solutions to a well-known problem, the lack of dispersive attractive forces in DFT calculations. In both cases, the regions subjected to QM studies are small enough so that dispersion interactions could be ignored, whereas the attractive vdW interactions that are an important feature of the dihydroxylation reaction were captured using the accurate vdW potential in MM3.

Rhodium-Catalyzed Hydrogenation of Enamides

The rhodium-catalyzed hydrogenation of enamides is a reaction of great synthetic importance to both academia and industry for the synthesis of natural and unnatural amino acids.[33,34] A generalized example is shown in Figure . The reaction is known to be highly sensitive to both ligand and substrate structure, necessitating time- and resource-intensive experimental screens to identify highly enantioselective combinations. The reaction is therefore a prime candidate for the development of a Q2MM TSFF, which would allow a fast virtual screening of ligands to identify promising candidates among the more than 200 commercially available bisphosphines used for this reaction.

Figure 10

Rh-Catalyzed hydrogenation of enamides.

Rh-Catalyzed hydrogenation of enamides. The mechanism of the reaction had been elucidated previously[35−38] and is summarized in Figure , but the stereoselecting step was not unambiguously identified. A distinctive feature of this mechanism is the “anti-lock-and-key” principle, meaning that the major square planar conformation frequently leads to the minor product enantiomer.[35,39,40] The observed enantioselectivity is based on the reactivity of the substrate–catalyst complexes with hydrogen to generate the product, emphasizing the need for an analysis of the competing TSs.

Figure 11

Overall mechanism of rhodium-catalyzed hydrogenation of enamides.

Overall mechanism of rhodium-catalyzed hydrogenation of enamides. On the basis of DFT results for the stereodetermining step of the mechanism,[41] we developed a Q2MM TSFF.[42] The QM training set consisted of four structurally different chiral and achiral bidentate phosphine ligands (ZDMP, DMPE, (R,R)-MeDuPHOS, and (R,R)-Me-BPE) and the substrate α-formamidoacrylonitrile, Figure , which was used previously in computational studies of the reaction.[36−38]

Figure 12

Ligands and substrate used in the QM training set.

Ligands and substrate used in the QM training set. In total, nine structures were optimized at the B3LYP/lacvp** level of theory for use as the training set. Using the functional form of MM3*, new atom types were added, and the necessary parameters were optimized using the automated procedure outlined in Figures and 4 to reproduce the electronic and geometric structures of the TS calculated by DFT. A comparison of the QM and MM data verified that the developed TSFF could accurately reproduce the training set.[42] After the initial validation toward the training set, 13 ligands and seven substrates were chosen for a virtual screening.[43] The ligands A–M and substrates 1–7, shown in Figure , were chosen to represent structural and electronic diversity, including different types of chirality, substrate geometries, and different protecting groups, and for the availability of experimental results under similar conditions in the literature.[43]

Figure 13

Ligands A–M and substrates 1–7 used in virtual screening and selectivity comparison of Q2MM-derived FF and experimental results for Rh-catalyzed hydrogenation of enamides.

Ligands A–M and substrates 1–7 used in virtual screening and selectivity comparison of Q2MM-derived FF and experimental results for Rh-catalyzed hydrogenation of enamides. The correlation of the Q2MM ee predictions with the known experimental ee has an R2 of 0.90 as shown in Figure , indicating that the FF was correctly modeling the reaction in the vast majority of cases.

Figure 14

Selectivity comparison of Q2MM-derived FF and experimental results for Rh-catalyzed hydrogenation of enamides.

Selectivity comparison of Q2MM-derived FF and experimental results for Rh-catalyzed hydrogenation of enamides. Most importantly, highly selective substrate/ligand combinations are rapidly identified with an ∼80% accuracy, e.g., of five selected ligands, four will actually give a high selectivity for a given substrate. This accuracy, which is much higher than in virtual screens used in medicinal chemistry, provides the experimental chemist with a rapid tool to focus efforts on a very small number of ligands for a given substrate. There are two types of outliers. As was the case in the earlier studies, the largest deviations occur for reactions with low stereoselectivity due to the small energy values involved. As these cases are unlikely to be of interest to the experimental chemist, this is acceptable. There are a small number of false positives, i.e., ligands that are predicted to be highly selective but are experimentally known to have low selectivity. Interestingly, these consistently involve the bulkiest ligands tested, such as B, E, and G, suggesting a possible dissociation of the ligand or substrate.[44]

Ruthenium-Catalyzed Hydrogenation of Ketones

After several methodological improvements to the Q2MM procedure, it was applied to the ruthenium-catalyzed hydrogenation reaction of ketones, which occurs readily using the Noyori catalyst (Figure ).[45] The selectivity and rate-determining transition state was elucidated previously and is shown in Figure .[46] Hydrogenation of the ketone is concerted with hydrogen delivery mediated by the ruthenium and a dative nitrogen. The reaction is attractive for application of the Q2MM method for a variety of reasons. First, it is synthetically important in that it has the potential to be both chemo- and stereoselective in the reduction of ketones to alcohols. Chiral alcohols are a common motif in natural products as well as drugs and drug-like molecules. Although there are many ways to selectively reduce a ketone, very few provide a catalytic approach that uses inexpensive hydrogen gas. Second, several mechanistic studies have elucidated a single step that determines stereoselectivity. Third, the combinatorics of two different chiral ligands leads to a number of possible combinations too large to be explored experimentally. The use of the Q2MM method to perform a high throughput screen has the potential to focus an experimental screen on manageable numbers while exploring a wide chemical space.

Figure 15

Hydrogenation of a ketone with a Noyori catalyst.

Figure 16

TS depicting a reparameterized reaction center.

Hydrogenation of a ketone with a Noyori catalyst. TS depicting a reparameterized reaction center. The QM training set for parametrization included several combinations of the diamine ligands and substrates with a common phosphine ligand as is shown in Figure . All calculations were completed using the B3LYP/lacvp* level of theory. The TS structures were used to parametrize the MM3* FF for atoms involved in the reaction center shown in Figure . The internal validation toward the QM data demonstrated the success of the automated parametrization for the reproduction of geometries and vibrational modes.

Figure 17

Substrates and ligands used for the QM training set.

Substrates and ligands used for the QM training set. Although the training set used simplified ligands and substrates, the independent test set, calculated using the same QM method, included more complex systems such as BINAP-type and chiral diamine ligands. The conformational space of these more complex ligands was explored using the TSFF and located conformations that were subjected to QM calculations at the same level of theory. The stereoselectivity based on the Boltzmann averaging was compared to the one obtained by using only the lowest energy conformers of the two diasteromers. Comparison of both ΔΔE⧧ methods produces similar predictions, but the MUE of the latter was significantly larger than using the Boltzmann averaging, highlighting the importance of higher energy conformations. The validation by comparison to experimental data gave good overall agreement between experimental and calculated ee with an MUE of only 2.7 kJ/mol (Figure ), showcasing the improvements made to the Q2MM method in this more complex reaction. A further inspection of energy contributions of a ligand–substrate combination can assist in elucidating the major contribution of selectivity. One such example would be the calculation of the (S)-TolBINAP/(R)-DMAPEN/(E)-chalcone complex. Most of the energy differences between the two diastereomeric TSs are derived from vdW forces, especially with respect to edge–face interactions of the aromatic rings. This highlights that many ligand–substrate combinations have unique interactions that cannot be predicted by general rules.

Figure 18

Selectivity comparison of Q2MM-derived FF and experimental results for Ru-catalyzed hydrogenations of ketones.

Conclusions

For the past two decades, the Q2MM method has been developed and successfully applied to perform the virtual screening of ligands in enantioselective reactions.[47,48] Although the parametrization of the reaction-specific TSFF requires an upfront investment to obtain the underlying QM data and fit and validate the force field, the speed and accuracy of the computational predictions make Q2MM-derived TSFFs an attractive complement to experimental screens of substrate–ligand combinations for commonly used transition metal-catalyzed transformations. To this end, the Q2MM code and the validated force fields are freely available to the community via github.com/Q2MM/q2mm. The ultimate goal of the method is to enable bench chemists and other scientists who may not have specific experience in computational methods to easily use Q2MM to make quick, useful predictions that greatly decrease the number of experiments necessary for the identification of suitable ligands, thus increasing the efficiency of experimental approaches. The incorporation of the existing TSFF discussed here as well as additional ones into easy-to-use graphical interfaces with predefined ligand libraries can make a predictive virtual ligand screen an integral part of protocol development in enantioselective catalysis. The concepts outlined in this Account are, in principle, also transferrable to other types of selectivity (such as regioselectivity) and other problems where computationally expensive calculations have to be repeated many times to achieve adequate sampling, such as QM/MM molecular dynamics on enzymes. Force field methods cannot generally be used to calculate energies of isomers that differ in which parameters are used, such as regioisomers, but the force field could be used to explore the conformational space of each isomer (within an isoparametric set), and the energy difference in nonisoparametric comparisons could be calibrated using a few DFT calculations or by comparison to selected experimental data.[23] Current developments of Q2MM will investigate the applicability of the methodology to these problems.

16 in total

1. Ligand and substrate effects on the mechanism of rhodium-catalyzed hydrogenation of enamides.

Authors: Patrick J Donoghue; Paul Helquist; Olaf Wiest
Journal: J Org Chem Date: 2007-02-02 Impact factor: 4.354

2. Prediction of enantioselectivity in rhodium catalyzed hydrogenations.

Authors: Patrick J Donoghue; Paul Helquist; Per-Ola Norrby; Olaf Wiest
Journal: J Am Chem Soc Date: 2009-01-21 Impact factor: 15.419

3. Asymmetric catalysis: science and opportunities (Nobel lecture).

Authors: Ryoji Noyori
Journal: Angew Chem Int Ed Engl Date: 2002-06-17 Impact factor: 15.336

4. Toward a computational tool predicting the stereochemical outcome of asymmetric reactions: development of the molecular mechanics-based program ACE and application to asymmetric epoxidation reactions.

Authors: Nathanael Weill; Christopher R Corbeil; Joris W De Schutter; Nicolas Moitessier
Journal: J Comput Chem Date: 2011-07-06 Impact factor: 3.376

5. Development of a True Transition State Force Field from Quantum Mechanical Calculations.

Authors: Ádám Madarász; Dénes Berta; Robert S Paton
Journal: J Chem Theory Comput Date: 2016-03-24 Impact factor: 6.006

6. Unravelling the mechanism of the asymmetric hydrogenation of acetophenone by [RuX2(diphosphine)(1,2-diamine)] catalysts.

Authors: Pavel A Dub; Neil J Henson; Richard L Martin; John C Gordon
Journal: J Am Chem Soc Date: 2014-02-21 Impact factor: 15.419

7. Development of a Q2MM Force Field for the Asymmetric Rhodium Catalyzed Hydrogenation of Enamides.

Authors: Patrick J Donoghue; Paul Helquist; Per-Ola Norrby; Olaf Wiest
Journal: J Chem Theory Comput Date: 2008-08 Impact factor: 6.006

8. AUTOMATED FORCE FIELD PARAMETERIZATION FOR NON-POLARIZABLE AND POLARIZABLE ATOMIC MODELS BASED ON AB INITIO TARGET DATA.

Authors: Lei Huang; Benoît Roux
Journal: J Chem Theory Comput Date: 2013-08-13 Impact factor: 6.006

9. Sampling of Transition States for Predicting Diastereoselectivity Using Automated Search Method-Aqueous Lanthanide-Catalyzed Mukaiyama Aldol Reaction.

Authors: Miho Hatanaka; Satoshi Maeda; Keiji Morokuma
Journal: J Chem Theory Comput Date: 2013-06-25 Impact factor: 6.006

10. Stereoselectivity in Asymmetric Catalysis: The Case of Ruthenium-Catalyzed Ketone Hydrogenation.

Authors: Elaine Limé; Michelle D Lundholm; Aaron Forbes; Olaf Wiest; Paul Helquist; Per-Ola Norrby
Journal: J Chem Theory Comput Date: 2014-06-10 Impact factor: 6.006

15 in total

Review 1. Expanding the medicinal chemistry synthetic toolbox.

Authors: Jonas Boström; Dean G Brown; Robert J Young; György M Keserü
Journal: Nat Rev Drug Discov Date: 2018-08-24 Impact factor: 84.694

2. Hydrogenation catalyst generates cyclic peptide stereocentres in sequence.

Authors: Diane N Le; Eric Hansen; Hasan A Khan; Byoungmoo Kim; Olaf Wiest; Vy M Dong
Journal: Nat Chem Date: 2018-07-30 Impact factor: 24.427

3. Application of Q2MM to predictions in stereoselective synthesis.

Authors: Anthony R Rosales; Taylor R Quinn; Jessica Wahlers; Anna Tomberg; Xin Zhang; Paul Helquist; Olaf Wiest; Per-Ola Norrby
Journal: Chem Commun (Camb) Date: 2018-07-24 Impact factor: 6.222

4. Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction.

Authors: Elliot H E Farrar; Matthew N Grayson
Journal: Chem Sci Date: 2022-06-14 Impact factor: 9.969