Literature DB >> 35359246

Comparison of logP and logD correction models trained with public and proprietary data sets.

Ignacio Aliagas1, Alberto Gobbi2, Man-Ling Lee2, Benjamin D Sellers2.   

Abstract

In drug discovery, partition and distribution coefficients, logP and logD for octanol/water, are widely used as metrics of the lipophilicity of molecules, which in turn have a strong influence on the bioactivity and bioavailability of potential drugs. There are a variety of established methods, mostly fragment or atom-based, to calculate logP while logD prediction generally relies on calculated logP and pKa for the estimation of neutral and ionized populations at a given pH. Algorithms such as ClogP have limitations generally leading to systematic errors for chemically related molecules while pKa estimation is generally more difficult due to the interplay of electronic, inductive and conjugation effects for ionizable moieties. We propose an integrated machine learning QSAR modeling approach to predict logD by training the model with experimental data while using ClogP and pKa predicted by commercial software as model descriptors. By optimizing the loss function for the ClogD calculated by the software, we build a correction model that incorporates both descriptors from the software and available experimental logD data. Additionally, we calculate logP from the logD model using the software predicted pKa's. Here, we have trained models using publicly or commercial available logD data to show that this approach can improve on commercial software predictions of lipophilicity. When applied to other logD data sets, this approach extends the domain of applicability of logD and logP predictions over commercial software. Performance of these models favorably compare with models built with a larger set of proprietary logD data.
© 2022. The Author(s), under exclusive licence to Springer Nature Switzerland AG.

Entities:  

Keywords:  BioByte; ChEMBL; ClogD; ClogP; Distribution coefficient; LogD; LogP; Machine learning; Partition coefficient; QSAR models; pKa

Mesh:

Substances:

Year:  2022        PMID: 35359246     DOI: 10.1007/s10822-022-00450-9

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  28 in total

1.  Novel methods for the prediction of logP, pK(a), and logD.

Authors:  Li Xing; Robert C Glen
Journal:  J Chem Inf Comput Sci       Date:  2002 Jul-Aug

2.  A comparison of physiochemical property profiles of development and marketed oral drugs.

Authors:  Mark C Wenlock; Rupert P Austin; Patrick Barton; Andrew M Davis; Paul D Leeson
Journal:  J Med Chem       Date:  2003-03-27       Impact factor: 7.446

3.  Dependence of hydrophobicity of apolar molecules on their molecular volume.

Authors:  A Leo; C Hansch; P Y Jow
Journal:  J Med Chem       Date:  1976-05       Impact factor: 7.446

Review 4.  Alternative measures of lipophilicity: from octanol-water partitioning to IAM retention.

Authors:  Costas Giaginis; Anna Tsantili-Kakoulidou
Journal:  J Pharm Sci       Date:  2008-08       Impact factor: 3.534

5.  Lipophilicity in drug discovery.

Authors:  Michael J Waring
Journal:  Expert Opin Drug Discov       Date:  2010-03       Impact factor: 6.098

6.  COSMOsar3D: molecular field analysis based on local COSMO σ-profiles.

Authors:  Andreas Klamt; Michael Thormann; Karin Wichmann; Paolo Tosco
Journal:  J Chem Inf Model       Date:  2012-07-26       Impact factor: 4.956

7.  Assessing the accuracy of octanol-water partition coefficient predictions in the SAMPL6 Part II log P Challenge.

Authors:  Mehtap Işık; Teresa Danielle Bergazin; Thomas Fox; Andrea Rizzi; John D Chodera; David L Mobley
Journal:  J Comput Aided Mol Des       Date:  2020-02-27       Impact factor: 3.686

8.  Effects of different buffer species on partition coefficients of drugs used in quantitative structure-activity relationships.

Authors:  P H Wang; E J Lien
Journal:  J Pharm Sci       Date:  1980-06       Impact factor: 3.534

Review 9.  Partitioning and lipophilicity in quantitative structure-activity relationships.

Authors:  J C Dearden
Journal:  Environ Health Perspect       Date:  1985-09       Impact factor: 9.031

10.  Open-source QSAR models for pKa prediction using multiple machine learning approaches.

Authors:  Kamel Mansouri; Neal F Cariello; Alexandru Korotcov; Valery Tkachenko; Chris M Grulke; Catherine S Sprankle; David Allen; Warren M Casey; Nicole C Kleinstreuer; Antony J Williams
Journal:  J Cheminform       Date:  2019-09-18       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.