Literature DB >> 17125199

Statistical modeling of a ligand knowledge base.

Ralph A Mansson1, Alan H Welsh, Natalie Fey, A Guy Orpen.   

Abstract

A range of different statistical models has been fitted to experimental data for the Tolman electronic parameter (TEP) based on a large set of calculated descriptors in a prototype ligand knowledge base (LKB) of phosphorus(III) donor ligands. The models have been fitted by ordinary least squares using subsets of descriptors, principal component regression, and partial least squares which use variables derived from the complete set of descriptors, least angle regression, and the least absolute shrinkage and selection operator. None of these methods is robust against outliers, so we also applied a robust estimation procedure to the linear regression model. Criteria for model evaluation and comparison have been discussed, highlighting the importance of resampling methods for assessing the robustness of models and the scope for making predictions in chemically intuitive models. For the ligands covered by this LKB, ordinary least squares models of descriptor subsets provide a good representation of the data, while partial least squares, principal component regression, and least angle regression models are less suitable for our dual aims of prediction and interpretation. A linear regression model with robustly fitted parameters achieves the best model performance over all classes of models fitted to TEP data, and the weightings assigned to ligands during the robust estimation procedure are chemically intuitive. The increased model complexity when compared to the ordinary least squares linear model is justified by the reduced influence of individual ligands on the model parameters and predictions of new ligands. Robust linear regression models therefore represent the best compromise for achieving statistical robustness in simple, chemically meaningful models.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17125199     DOI: 10.1021/ci600212t

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  3 in total

1.  Parameterization of phosphine ligands reveals mechanistic pathways and predicts reaction outcomes.

Authors:  Zachary L Niemeyer; Anat Milo; David P Hickey; Matthew S Sigman
Journal:  Nat Chem       Date:  2016-05-16       Impact factor: 24.427

2.  Iterative Supervised Principal Component Analysis Driven Ligand Design for Regioselective Ti-Catalyzed Pyrrole Synthesis.

Authors:  Xin Yi See; Xuelan Wen; T Alexander Wheeler; Channing K Klein; Jason D Goodpaster; Benjamin R Reiner; Ian A Tonks
Journal:  ACS Catal       Date:  2020-11-05       Impact factor: 13.084

3.  Expansion of the Ligand Knowledge Base for Chelating P,P-Donor Ligands (LKB-PP).

Authors:  Jesús Jover; Natalie Fey; Jeremy N Harvey; Guy C Lloyd-Jones; A Guy Orpen; Gareth J J Owen-Smith; Paul Murray; David R J Hose; Robert Osborne; Mark Purdie
Journal:  Organometallics       Date:  2012-07-30       Impact factor: 3.876

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.