Literature DB >> 34327040

Iterative Supervised Principal Component Analysis Driven Ligand Design for Regioselective Ti-Catalyzed Pyrrole Synthesis.

Xin Yi See1, Xuelan Wen1, T Alexander Wheeler1, Channing K Klein1, Jason D Goodpaster1, Benjamin R Reiner1, Ian A Tonks1.   

Abstract

The rational design of catalysts remains a challenging endeavor within the broader chemical community owing to the myriad variables that can affect key bond-forming events. Designing selective catalysts for any reaction requires an efficient strategy for discovering predictive structure-activity relationships. Herein, we describe the use of iterative supervised principal component analysis (ISPCA) in de novo catalyst design. The regioselective synthesis of 2,5-dimethyl-1,3,4-triphenyl-1H-pyrrole (C) via a Ti-catalyzed formal [2 + 2 +1] cycloaddition of phenylpropyne and azobenzene was targeted as a proof of principle. The initial reaction conditions led to an unselective mixture of all possible pyrrole regioisomers. ISPCA was conducted on a training set of catalysts, and their performance was regressed against the scores from the top three principal components. Component loadings from this PCA space and k-means clustering were used to inform the design of new test catalysts. The selectivity of a prospective test set was predicted in silico using the ISPCA model, and optimal candidates were synthesized and tested experimentally. This data-driven predictive-modeling workflow was iterated, and after only three generations the catalytic selectivity was improved from 0.5 (statistical mixture of products) to over 11 (>90% C) by incorporating 2,6-dimethyl-4-(pyrrolidin-1-yl)pyridine as a ligand. The origin of catalyst selectivity was probed by examining ISPCA variable loadings in combination with DFT modeling, revealing that ligand lability plays an important role in selectivity. A parallel catalyst search using multivariate linear regression (MLR), a popular approach in catalysis informatics, was also conducted in order to compare these strategies in a hypothetical catalyst scouting campaign. ISPCA appears to be more robust and predictive than MLR when sparse training sets are used that are representative of the data available during the early search for an optimal catalyst. The successful development of a highly selective catalyst without resorting to long, stochastic screening processes demonstrates the inherent power of ISPCA in de novo catalyst design and should motivate the general use of ISPCA in reaction development.

Entities:  

Keywords:  DFT; catalyst prediction; iterative supervised principal component analysis; pyrrole; selectivity; titanium

Year:  2020        PMID: 34327040      PMCID: PMC8318334          DOI: 10.1021/acscatal.0c03939

Source DB:  PubMed          Journal:  ACS Catal            Impact factor:   13.084


  46 in total

1.  Computing organic stereoselectivity - from concepts to quantitative calculations and predictions.

Authors:  Qian Peng; Fernanda Duarte; Robert S Paton
Journal:  Chem Soc Rev       Date:  2016-11-07       Impact factor: 54.564

2.  Comment on "Predicting reaction performance in C-N cross-coupling using machine learning".

Authors:  Kangway V Chuang; Michael J Keiser
Journal:  Science       Date:  2018-11-16       Impact factor: 47.728

3.  Pursuit of Noncovalent Interactions for Strategic Site-Selective Catalysis.

Authors:  F Dean Toste; Matthew S Sigman; Scott J Miller
Journal:  Acc Chem Res       Date:  2017-03-21       Impact factor: 22.384

4.  Predicting reaction performance in C-N cross-coupling using machine learning.

Authors:  Derek T Ahneman; Jesús G Estrada; Shishi Lin; Spencer D Dreher; Abigail G Doyle
Journal:  Science       Date:  2018-02-15       Impact factor: 47.728

5.  Searching for Hidden Descriptors in the Metal-Ligand Bond through Statistical Analysis of Density Functional Theory (DFT) Results.

Authors:  Oier Lakuntza; Maria Besora; Feliu Maseras
Journal:  Inorg Chem       Date:  2018-11-16       Impact factor: 5.165

6.  Deoxyfluorination with Sulfonyl Fluorides: Navigating Reaction Space with Machine Learning.

Authors:  Matthew K Nielsen; Derek T Ahneman; Orestes Riera; Abigail G Doyle
Journal:  J Am Chem Soc       Date:  2018-04-03       Impact factor: 15.419

Review 7.  Synthetic approaches to the lamellarins--a comprehensive review.

Authors:  Dennis Imbri; Johannes Tauber; Till Opatz
Journal:  Mar Drugs       Date:  2014-12-18       Impact factor: 5.118

8.  Expansion of the Ligand Knowledge Base for Chelating P,P-Donor Ligands (LKB-PP).

Authors:  Jesús Jover; Natalie Fey; Jeremy N Harvey; Guy C Lloyd-Jones; A Guy Orpen; Gareth J J Owen-Smith; Paul Murray; David R J Hose; Robert Osborne; Mark Purdie
Journal:  Organometallics       Date:  2012-07-30       Impact factor: 3.876

Review 9.  Evolving Concept of Activity Cliffs.

Authors:  Dagmar Stumpfe; Huabin Hu; Jürgen Bajorath
Journal:  ACS Omega       Date:  2019-08-26

10.  Predictive Multivariate Linear Regression Analysis Guides Successful Catalytic Enantioselective Minisci Reactions of Diazines.

Authors:  Jolene P Reid; Rupert S J Proctor; Matthew S Sigman; Robert J Phipps
Journal:  J Am Chem Soc       Date:  2019-11-21       Impact factor: 15.419

View more
  1 in total

1.  Ti-Catalyzed and -Mediated Oxidative Amination Reactions.

Authors:  Ian A Tonks
Journal:  Acc Chem Res       Date:  2021-08-22       Impact factor: 24.466

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.