Literature DB >> 19937825

Large-scale evaluation of log P predictors: local corrections may compensate insufficient accuracy and need of experimentally testing every other compound.

Igor V Tetko1, Gennadiy I Poda, Claude Ostermann, Raimund Mannhold.   

Abstract

A large variety of log P calculation methods failed to produce sufficient accuracy in log P prediction for two in-house datasets of more than 96000 compounds contrary to their significantly better performances on public datasets. The minimum Root Mean Squared Error (RMSE) of 1.02 and 0.65 were calculated for the Pfizer and Nycomed datasets, respectively, in the 'out-of-box' implementation. Importantly, the use of local corrections (LC) implemented in the ALOGPS program based on experimental in-house log P data significantly reduced the RMSE to 0.59 and 0.48 for the Pfizer and Nycomed datasets, respectively, instantly without retraining the model. Moreover, more than 60% of molecules predicted with the highest confidence in each set had a mean absolute error (MAE) less than 0.33 log units that is only ca. 10% higher than the estimated variation in experimental log P measurements for the Pfizer dataset. Therefore, following this retrospective analysis, we suggest that the use of the predicted log P values with high confidence may eliminate the need of experimentally testing every other compound. This strategy could reduce the cost of measurements for pharmaceutical companies by a factor of 2, increase the confidence in prediction at the analog design stage of drug discovery programs, and could be extended to other ADMET properties.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19937825     DOI: 10.1002/cbdv.200900075

Source DB:  PubMed          Journal:  Chem Biodivers        ISSN: 1612-1872            Impact factor:   2.408


  5 in total

1.  Online chemical modeling environment (OCHEM): web platform for data storage, model development and publishing of chemical information.

Authors:  Iurii Sushko; Sergii Novotarskyi; Robert Körner; Anil Kumar Pandey; Matthias Rupp; Wolfram Teetz; Stefan Brandmaier; Ahmed Abdelaziz; Volodymyr V Prokopenko; Vsevolod Y Tanchuk; Roberto Todeschini; Alexandre Varnek; Gilles Marcou; Peter Ertl; Vladimir Potemkin; Maria Grishina; Johann Gasteiger; Christof Schwab; Igor I Baskin; Vladimir A Palyulin; Eugene V Radchenko; William J Welsh; Vladyslav Kholodovych; Dmitriy Chekmarev; Artem Cherkasov; Joao Aires-de-Sousa; Qing-You Zhang; Andreas Bender; Florian Nigsch; Luc Patiny; Antony Williams; Valery Tkachenko; Igor V Tetko
Journal:  J Comput Aided Mol Des       Date:  2011-06-10       Impact factor: 3.686

2.  Robustness in experimental design: A study on the reliability of selection approaches.

Authors:  Stefan Brandmaier; Igor V Tetko
Journal:  Comput Struct Biotechnol J       Date:  2013-06-30       Impact factor: 7.271

3.  Nucleolipids of Canonical Purine ß-d-Ribo-Nucleosides: Synthesis and Cytostatic/Cytotoxic Activities Toward Human and Rat Glioblastoma Cells.

Authors:  Christine Knies; Katharina Hammerbacher; Gabriel A Bonaterra; Ralf Kinscherf; Helmut Rosemeyer
Journal:  ChemistryOpen       Date:  2015-12-20       Impact factor: 2.911

4.  Highly Hydrophilic and Lipophilic Derivatives of Bile Salts.

Authors:  M Pilar Vázquez-Tato; Julio A Seijas; Francisco Meijide; Francisco Fraga; Santiago de Frutos; Javier Miragaya; Juan Ventura Trillo; Aida Jover; Victor H Soto; José Vázquez Tato
Journal:  Int J Mol Sci       Date:  2021-06-22       Impact factor: 5.923

5.  The development of models to predict melting and pyrolysis point data associated with several hundred thousand compounds mined from PATENTS.

Authors:  Igor V Tetko; Daniel M Lowe; Antony J Williams
Journal:  J Cheminform       Date:  2016-01-22       Impact factor: 5.514

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.