Literature DB >> 33671348

Enhancing Carbon Acid pKa Prediction by Augmentation of Sparse Experimental Datasets with Accurate AIBL (QM) Derived Values.

Jeffrey Plante1, Beth A Caine2, Paul L A Popelier2,3.   

Abstract

The prediction of the aqueous pKa of carbon acids by Quantitative Structure Property Relationship or cheminformatics-based methods is a rather arduous problem. Primarily, there are insufficient high-quality experimental data points measured in homogeneous conditions to allow for a good global model to be generated. In our computationally efficient pKa prediction method, we generate an atom-type feature vector, called a distance spectrum, from the assigned ionisation atom, and learn coefficients for those atom-types that show the impact each atom-type has on the pKa of the ionisable centre. In the current work, we augment our dataset with pKa values from a series of high performing local models derived from the Ab Initio Bond Lengths method (AIBL). We find that, in distilling the knowledge available from multiple models into one general model, the prediction error for an external test set is reduced compared to that using literature experimental data alone.

Entities:  

Keywords:  ab initio; bond length; carbon acid; pKa prediction

Mesh:

Substances:

Year:  2021        PMID: 33671348      PMCID: PMC7922142          DOI: 10.3390/molecules26041048

Source DB:  PubMed          Journal:  Molecules        ISSN: 1420-3049            Impact factor:   4.411


  21 in total

1.  Novel methods for the prediction of logP, pK(a), and logD.

Authors:  Li Xing; Robert C Glen
Journal:  J Chem Inf Comput Sci       Date:  2002 Jul-Aug

2.  Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies.

Authors:  Katja Hansen; Grégoire Montavon; Franziska Biegler; Siamac Fazli; Matthias Rupp; Matthias Scheffler; O Anatole von Lilienfeld; Alexandre Tkatchenko; Klaus-Robert Müller
Journal:  J Chem Theory Comput       Date:  2013-07-30       Impact factor: 6.006

3.  Surrogate data--a secure way to share corporate data.

Authors:  Igor V Tetko; Ruben Abagyan; Tudor I Oprea
Journal:  J Comput Aided Mol Des       Date:  2005-11-03       Impact factor: 3.686

4.  An expert system to predict the forced degradation of organic molecules.

Authors:  Alexis D C Parenty; William G Button; Martin A Ott
Journal:  Mol Pharm       Date:  2013-07-18       Impact factor: 4.939

5.  Linear free-energy relationships between a single gas-phase ab initio equilibrium bond length and experimental pKa values in aqueous solution.

Authors:  Ibon Alkorta; Paul L A Popelier
Journal:  Chemphyschem       Date:  2014-11-07       Impact factor: 3.102

6.  pKa prediction from an ab initio bond length: part 3--benzoic acids and anilines.

Authors:  A P Harding; P L A Popelier
Journal:  Phys Chem Chem Phys       Date:  2011-05-13       Impact factor: 3.676

7.  Comparison of nine programs predicting pK(a) values of pharmaceutical substances.

Authors:  Chenzhong Liao; Marc C Nicklaus
Journal:  J Chem Inf Model       Date:  2009-12       Impact factor: 4.956

8.  ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost.

Authors:  J S Smith; O Isayev; A E Roitberg
Journal:  Chem Sci       Date:  2017-02-08       Impact factor: 9.825

9.  JPlogP: an improved logP predictor trained using predicted data.

Authors:  Jeffrey Plante; Stephane Werner
Journal:  J Cheminform       Date:  2018-12-14       Impact factor: 5.514

10.  Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network.

Authors:  Roman Zubatyuk; Justin S Smith; Jerzy Leszczynski; Olexandr Isayev
Journal:  Sci Adv       Date:  2019-08-09       Impact factor: 14.136

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.