Literature DB >> 24882917

Expansion of the Ligand Knowledge Base for Chelating P,P-Donor Ligands (LKB-PP).

Jesús Jover1, Natalie Fey1, Jeremy N Harvey1, Guy C Lloyd-Jones1, A Guy Orpen1, Gareth J J Owen-Smith1, Paul Murray2, David R J Hose2, Robert Osborne2, Mark Purdie2.   

Abstract

We have expanded the ligand knowledge base for bidentate P,P- and P,N-donor ligands (LKB-PP, Organometallics2008, 31, 1372-1383) by 208 ligands and introduced an additional steric descriptor (nHe8). This expanded knowledge base now captures information on 334 bidentate ligands and has been processed with principal component analysis (PCA) of the descriptors to produce a detailed map of bidentate ligand space, which better captures ligand variation and has been used for the analysis of ligand properties.

Entities:  

Year:  2012        PMID: 24882917      PMCID: PMC4034078          DOI: 10.1021/om300312t

Source DB:  PubMed          Journal:  Organometallics        ISSN: 0276-7333            Impact factor:   3.876


Chelating ligands with phosphorus and nitrogen donor atoms are important in synthetic organometallic chemistry, as they provide a number of features (donor, substituents, backbone) that can be varied in order to modify transition-metal complexes, allowing the fine tuning of complex properties.[1,2] In homogeneous catalysis, such modifications can affect both the rate and selectivity of a reaction, and the quantitative prediction of ligand effects is of particular interest.[2] We have recently reported the development of ligand knowledge bases (LKBs) for a number of different ligand classes, including monodentate ligands with P-[3,4] and C-donor[5] atoms and bidentate ligands with P,P- and P,N-donors.[6] These knowledge bases rely on parameters or descriptors (the terms are used interchangeably here), calculated with a standard DFT approach, which capture the properties of ligands in a range of compounds. A number of applications for such databases to ligand-driven catalyst optimization can be envisaged, ranging from the illustration of ligand similarities in so-called maps of ligand space[7] to regression models on observed response data.[3,4,6] Comprehensive sampling in screening experiments can be facilitated by using such ligand property maps,[4] derived by principal component analysis (PCA) of the descriptors, and the projection of screening results onto maps can identify “hot spots” of activity due to ligand similarity.[4] More detailed data analysis in terms of descriptors can help to interpret which ligand properties are contributing to observed behavior but can also lead to regression models suitable for making predictions about other ligands.[3,4,6] Training data for such models can be derived from a range of sources, e.g. yield, selectivity, reaction rates,[8] spectroscopic data, and structural features (XRD), as well as computed properties and reactivities.[9] Our initial LKB-PP[6] accounted for 108 different P,P- and P,N-ligands, capturing some of the trends in ligand electronic and steric features. However, some ligand types were not sampled well or have yet to be accessed experimentally, including, for example, bidentate ligands with electron-withdrawing substituents. Uneven sampling can hamper both ligand selection/test set design and data analysis, making it harder to identify outliers and, if for example the availability, stability, and cost of ligands are of concern, select suitable alternatives. Furthermore, the steric parameter described,[6] He8_wedge, does not allow for structural changes in response to steric hindrance and is likely to overestimate the size of ligands with flexible backbones. We have recently demonstrated how substituent and backbone effects on the properties of bidentate P,P-donor ligands can be mapped (LKB-PPscreen).[10] This led to the addition of 22 ligands to LKB-PP but also emphasized the need for further ligand variation in the database. Here we report the expansion of LKB-PP by a further 208 ligands, as well as the introduction of a new steric measure (nHe8), which better captures changes of the ligand bite angle in response to steric hindrance. The ligand descriptors have been processed to produce a map of bidentate ligand space for 334 ligands,[11] evaluating the properties of many ligands for the first time, and this expanded LKB-PP should be useful for ligand screening and subsequent data analysis, as well as prediction of ligand effects.

Ligands and Descriptors

This expansion of the LKB-PP considers a wider range of backbones and substituents, aimed at making the database more relevant to ligand design and experimental screening. Many of the ligands selected have been used and/or characterized experimentally, but for subsets of ligands a more systematic evaluation of backbone and substituent effects has also been undertaken, albeit not on the scale of other work reported, which used faster calculations at lower levels of theory.[10,12] Alkyl and aryl substituents are much more common for bidentate P,P-donor ligands than other substituents (OR/OAr, NR2/NAr2, Hal), and this continues to be reflected in the ligand set considered here. Details of all ligands, including their unique numbers as used in plots and data tables, have been included in the Supporting Information as Table S1. The descriptors used have been selected to capture different coordination environments as reported previously.[6] In brief, these have been derived from calculations on the isolated ligand in both free and chelating conformations, the tetrahedral zinc complex [ZnCl2{LL}] (LL represents bidentate ligands with P,P- and P,N-donor atoms) and the square-planar palladium(II) complex [PdCl2{LL}]. Steric descriptors, discussed in greater detail below, have been determined after optimization of [He8·{LL}] systems. In addition, ligands have been truncated as described previously[6] to calculate frontier molecular orbital energies and proton affinities for each donor atom (see Table S2 in the Supporting Information for details). Scheme 1 illustrates the complexes considered. A range of descriptors have been extracted from these calculations, which are detailed in Table S3 of the Supporting Information and may be summarized as follows:
Scheme 1

Complexes and Ligand Truncation Used in LKB-PP

See Figure 1 for details of [He8·{LL}].

adduct binding energies for tetrahedral [ZnCl2{LL}] and square-planar [PdCl2{LL}] complexes ligand bite angles in the metal complexes structural parameters describing geometry changes of ligands upon complexation and the metal–ligand bond lengths, as well as the geometry of the metal fragments. metal fragment charges measures of steric bulk (He8_wedge and nHe8 steric descriptors) frontier molecular orbital energies and ligand proton affinities for each donor atom, derived from truncated ligand fragments (see Table S2 for truncations used)

Complexes and Ligand Truncation Used in LKB-PP

See Figure 1 for details of [He8·{LL}].
Figure 1

[He8·{LL}] system used for the calculation of steric descriptors.

Steric Descriptors

The steric descriptor developed for LKB-PP,[6] He8_wedge, is calculated as the interaction energy between a ligand in its chelating conformation (from optimization of a zinc complex, [ZnCl2{LL}]) and a rigid wedge of eight helium atoms, intended to simulate the steric bulk of other ligands in an octahedral coordination environment. Figure 1 illustrates the [He8·{LL}] system used, and further details of the calculation setup can be found in the Supporting Information. The He8_wedge descriptor tends to overestimate the size of some large but “compressible” ligands. This arises because the bite angle[13] adopted in the zinc complex, ∠P–Zn–P, correlates highly[6] with the rather large bite angles adopted in the absence of a coordinating metal, as reported by van Leeuwen et al.[14] [He8·{LL}] system used for the calculation of steric descriptors. A second steric descriptor, nHe8, derived from the same starting geometry (Figure 1) as He8_wedge, has therefore been introduced to LKB-PP to observe the flexible response of ligands to steric hindrance. Instead of completely freezing the positions of donor atoms when calculating the interaction energy between the ligand and the helium wedge, their position is relaxed, subject to the constraint that their distance to the dummy “metal” atom (X) is constant (X–D = 2.28 Å for P and 2.00 Å for N). This allows bite angle changes. While He8_wedge thus contains steric information about the ligand fixed at a large bite angle, adopted when there is limited steric hindrance on coordination, the nHe8 descriptor (which is more computationally demanding) allows for relaxation of the backbone and donor atom positions in response to a sterically more crowded environment and this is closer to measuring ligand size in octahedral complexes. In five cases of large and conformationally flexible ligands (Scheme 2), the nHe8 optimizations failed to converge and we have instead estimated the nHe8 parameters from a linear regression model (see the Supporting Information). The relationship between these two steric measures is illustrated in Figure 2.
Scheme 2

Ligands for Which nHe8 Descriptor Was Estimated

Figure 2

Comparison of steric parameters, He8_wedge and nHe8. (The line indicates He8_wedge = nHe8.)

Comparison of steric parameters, He8_wedge and nHe8. (The line indicates He8_wedge = nHe8.) Ligands have larger values for He8_wedge than for nHe8 because fewer constraints are used in the optimization. The difference between them becomes more pronounced for larger ligands (Figure 2). Consideration of both steric parameters can thus better capture the compromise between ligand size and flexibility implicit in different coordination environments.

Map of Ligand Space

The most intuitive and accessible display of multiple ligand parameters can be achieved by processing the data to produce a two- or three-dimensional representation of the database, essentially mapping ligand properties. This can be achieved conveniently by processing the data with principal component analysis of the correlation matrix.[15] This statistical approach derives linear combinations of the original descriptors, so-called principal components (PCs), which “distill” the information content of the original database into fewer variables. The detailed composition of PCs can be affected by changes in the ligand or descriptor set, as the approach is sensitive to outliers,[3] but some tentative chemical interpretation of principal components can be achieved where a large number of ligands has been considered,[4] making the expanded LKB-PP more amenable to such analysis. Figure 3 shows a plot of the first two principal components, which already captures 59% of the information content (variance) in the data set, considering only ligands with P,P-donors. Further PCs can be considered to improve this, and details of PC3 and PC4 are shown in Table 1, while further details of the analysis and plots with all ligands identified can be found in the Supporting Information (Table S4, Figure S2). Here we have focused on ligands with two phosphorus(III) donor atoms, but the Supporting Information also includes PCA results for the full set of bidentate ligands, including 24 ligands with P,N-donors (Table S5, Figure S3).
Figure 3

Principal component score plot (PC1 and PC2) for P,P-donor ligands, capturing 59% of the variation in the data set. See the text for a detailed discussion.

Table 1

Descriptor Loadings for PC1–PC4a

descriptorPC1PC2PC3PC4
cumulative contribn, %39.259.467.074.1
EHOMO_P10.1630.259–0.328 
EHOMO_P20.1610.265–0.324 
ELUMO_P1 0.152 0.585
ELUMO_P2 0.148 0.577
PAP10.1850.248–0.291 
PAP20.1840.255–0.284 
He8_wedge0.261  0.174
nHe80.278   
BE(Zn) 0.3180.223 
Zn–Cl 0.157  
∠P1–Zn–P20.251 0.287 
ΔP1–R(Zn)0.187 –0.169 
ΔP2–R(Zn)0.189   
ΔR–P1–R(Zn)–0.156   
ΔR–P2–R(Zn)–0.1480.149  
ΔZn–P1 –0.301–0.2260.229
ΔZn–P2 –0.316–0.1840.236
Q(Zn) –0.273–0.321 
BE(Pd)–0.1850.259  
Pd–Cl 0.303  
∠P1–Pd–P20.238 0.347 
ΔP1–R(Pd)0.250   
ΔP2–R(Pd)0.246   
ΔR–P1–R(Pd)–0.234  0.191
ΔR–P2–R(Pd)–0.235 –0.1640.174
ΔPd–P10.278   
ΔPd–P20.277   
Q(Pd)0.195–0.2350.206 

Values <|0.145| are not shown; see Table S3 for full data and Table S1 for descriptor definitions (Supporting Information).

Principal component score plot (PC1 and PC2) for P,P-donor ligands, capturing 59% of the variation in the data set. See the text for a detailed discussion. Values <|0.145| are not shown; see Table S3 for full data and Table S1 for descriptor definitions (Supporting Information). Inspection of the ligand map in Figure 3 suggests that ligands cluster according to their substituents, generating a chemically intuitive representation of changes in ligand properties. Substituent electronic effects are captured mostly by PC2, with halide and perfluorinated alkyl substituents showing negative values of PC2, whereas aryl and alkyl groups give rise to more positive ligand scores on this axis. PC1 shows high loadings for steric descriptors and ligand bite angles, suggesting that backbone and ligand size effects are captured in this direction. This can be illustrated further by color-coding the PC plot according to the number of atoms in the ligand backbone (Figure 4). Short backbones occur mainly at negative PC1, whereas positive values along this axis are clearly associated with five atoms in the backbone. Longer backbones (six and seven atoms) appear scattered throughout ligand space, which is likely due to their increased flexibility, allowing greater responsiveness to both coordination environment and steric hindrance. Inspection of the descriptor loadings shown in Table 1 can be used to further substantiate these interpretations (see the Supporting Information for details).
Figure 4

Principal component score plot (PC1 and PC2) for P,P-donor ligands colored according to number of atoms in the backbone.

Principal component score plot (PC1 and PC2) for P,P-donor ligands colored according to number of atoms in the backbone. This expanded ligand set captures a wider range of ligand properties, and many ligands have been characterized within a coherent data framework for the first time. Nevertheless, there are still areas of ligand space with few examples. Ligands with positive PC1 and negative PC2 scores, i.e. electron-withdrawing substituents and four or five atoms in the backbone, remain undersampled, further supporting the observation that variation in bidentate ligand properties is currently achieved mainly by changes to the ligand backbone and not to substituents. A more complete exploration of different substituents, illustrated in related LKBs,[4,10] is likely to provide access to interesting chemical properties and increase ligand diversity, making such ligands a potential target for novel ligand designs.[7]

Summary and Outlook

The ligand knowledge base for bidentate P,P- and P,N-donor ligands has been expanded to capture the steric and electronic properties of 334 ligands. In addition, a new steric parameter has been introduced to this LKB-PP, capturing steric effects when the ligand is able to respond to steric hindrance by reducing its bite angle. A map of bidentate ligand space has been derived from principal component analysis of the resulting database. This map illustrates that sampling of ligand space has been improved but also highlights that electron-withdrawing ligands with medium-sized backbones (four to five atoms) are not represented well in comparison with monodentate analogues[4] and so might be interesting targets for novel ligand designs. While we have demonstrated possible applications of LKBs for data projection and regression models with both an earlier version of this database[6] and for monodentate P-donor ligands (LKB-P),[4] the dearth of suitable experimental data restricts our scope for similar analysis in this case. However, this expanded LKB-PP quantifies the properties of many additional ligands and can be used to select ligands for catalyst screening and optimization, either by visual inspection or using experimental design (DoE) approaches. In addition, ligand effects on the resulting screening data can be interpreted in detail where suitable ligand descriptors are available, and prediction for novel/untested ligands can be attempted from such models. We have recently demonstrated, using proximity on such a ligand map as an indicator of catalytic potential, how novel ligand designs can be assessed and refined with this approach.[7] We are thus optimistic that capturing ligand properties on a quantitative scale can contribute to the design and discovery of improved catalysts, as well as the optimization of industrial processes, both vital steps toward the rational and in silico design of better catalysts.

Computational Details

Density functional theory calculations were performed in Jaguar[16] and used the standard Becke–Perdew (BP86) density functional.[17] The triple-ζ form of the standard Los Alamos ECP basis set (LACV3P) as implemented in Jaguar was used on the transition-metal atoms, employing the 6-31G* basis for all other atoms. Molecular mechanics (MM) conformational searches used the default MMX force field in PCModel.[18] Principal component analyses were performed in SIMCA-P+.[19] Details of nondefault program settings have been included in the Supporting Information.
  13 in total

1.  Stable fluorophosphines: predicted and realized ligands for catalysis.

Authors:  Natalie Fey; Michael Garland; Jonathan P Hopewell; Claire L McMullin; Sergio Mastroianni; A Guy Orpen; Paul G Pringle
Journal:  Angew Chem Int Ed Engl       Date:  2011-11-11       Impact factor: 15.336

2.  Development of a ligand knowledge base, part 1: computational descriptors for phosphorus donor ligands.

Authors:  Natalie Fey; Athanassios C Tsipis; Stephanie E Harris; Jeremy N Harvey; A Guy Orpen; Ralph A Mansson
Journal:  Chemistry       Date:  2005-12-16       Impact factor: 5.236

3.  Statistical modeling of a ligand knowledge base.

Authors:  Ralph A Mansson; Alan H Welsh; Natalie Fey; A Guy Orpen
Journal:  J Chem Inf Model       Date:  2006 Nov-Dec       Impact factor: 4.956

4.  Counterintuitive kinetics in Tsuji-Trost allylation: ion-pair partitioning and implications for asymmetric catalysis.

Authors:  Louise A Evans; Natalie Fey; Jeremy N Harvey; David Hose; Guy C Lloyd-Jones; Paul Murray; A Guy Orpen; Robert Osborne; Gareth J J Owen-Smith; Mark Purdie
Journal:  J Am Chem Soc       Date:  2008-10-08       Impact factor: 15.419

5.  A ligand knowledge base for carbenes (LKB-C): maps of ligand space.

Authors:  Natalie Fey; Mairi F Haddow; Jeremy N Harvey; Claire L McMullin; A Guy Orpen
Journal:  Dalton Trans       Date:  2009-08-14       Impact factor: 4.390

6.  Density-functional approximation for the correlation energy of the inhomogeneous electron gas.

Authors: 
Journal:  Phys Rev B Condens Matter       Date:  1986-06-15

7.  Density-functional exchange-energy approximation with correct asymptotic behavior.

Authors: 
Journal:  Phys Rev A Gen Phys       Date:  1988-09-15

8.  Rational design of diphosphorus ligands--a route to superior catalysts.

Authors:  Jason A Gillespie; Deborah L Dodds; Paul C J Kamer
Journal:  Dalton Trans       Date:  2010-01-04       Impact factor: 4.390

9.  The contribution of computational studies to organometallic catalysis: descriptors, mechanisms and models.

Authors:  Natalie Fey
Journal:  Dalton Trans       Date:  2009-11-04       Impact factor: 4.390

10.  Bite angle effects of diphosphines in C-C and C-X bond forming cross coupling reactions.

Authors:  Mandy-Nicole Birkholz; Zoraida Freixa; Piet W N M van Leeuwen
Journal:  Chem Soc Rev       Date:  2009-02-10       Impact factor: 54.564

View more
  6 in total

1.  Iterative Supervised Principal Component Analysis Driven Ligand Design for Regioselective Ti-Catalyzed Pyrrole Synthesis.

Authors:  Xin Yi See; Xuelan Wen; T Alexander Wheeler; Channing K Klein; Jason D Goodpaster; Benjamin R Reiner; Ian A Tonks
Journal:  ACS Catal       Date:  2020-11-05       Impact factor: 13.084

2.  Lost in chemical space? Maps to support organometallic catalysis.

Authors:  Natalie Fey
Journal:  Chem Cent J       Date:  2015-06-18       Impact factor: 4.215

3.  On the relation between carbonyl stretching frequencies and the donor power of chelating diphosphines in nickel dicarbonyl complexes.

Authors:  Marco Fusè; Isabella Rimoldi; Edoardo Cesarotti; Sergio Rampino; Vincenzo Barone
Journal:  Phys Chem Chem Phys       Date:  2017-03-29       Impact factor: 3.676

4.  A reactivity model for oxidative addition to palladium enables quantitative predictions for catalytic cross-coupling reactions.

Authors:  Jingru Lu; Sofia Donnecke; Irina Paci; David C Leitch
Journal:  Chem Sci       Date:  2022-02-28       Impact factor: 9.825

5.  In Situ Studies of Arylboronic Acids/Esters and R3SiCF3 Reagents: Kinetics, Speciation, and Dysfunction at the Carbanion-Ate Interface.

Authors:  Andrés García-Domínguez; Andrew G Leach; Guy C Lloyd-Jones
Journal:  Acc Chem Res       Date:  2022-04-18       Impact factor: 24.466

6.  Quantum-mechanical transition-state model combined with machine learning provides catalyst design features for selective Cr olefin oligomerization.

Authors:  Steven M Maley; Doo-Hyun Kwon; Nick Rollins; Johnathan C Stanley; Orson L Sydora; Steven M Bischof; Daniel H Ess
Journal:  Chem Sci       Date:  2020-08-21       Impact factor: 9.825

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.