Literature DB >> 20536191

Assessing synthetic accessibility of chemical compounds using machine learning methods.

Yevgeniy Podolyan1, Michael A Walters, George Karypis.   

Abstract

With de novo rational drug design, scientists can rapidly generate a very large number of potentially biologically active probes. However, many of them may be synthetically infeasible and, therefore, of limited value to drug developers. On the other hand, most of the tools for synthetic accessibility evaluation are very slow and can process only a few molecules per minute. In this study, we present two approaches to quickly predict the synthetic accessibility of chemical compounds by utilizing support vector machines operating on molecular descriptors. The first approach, RSsvm, is designed to identify the compounds that can be synthesized using a specific set of reactions and starting materials and builds its model by training on the compounds identified as synthetically accessible or not by retrosynthetic analysis. The second approach, DRsvm, is designed to provide a more general assessment of synthetic accessibility that is not tied to any set of reactions or starting materials. The training set compounds for this approach are selected from a diverse library based on the number of other similar compounds within the same library. Both approaches have been shown to perform very well in their corresponding areas of applicability with the RSsvm achieving a receiver operator characteristic score of 0.952 in cross-validation experiments and the DRsvm achieving a score of 0.888 on an independent set of compounds. Our implementations can successfully process thousands of compounds per minute.

Mesh:

Substances:

Year:  2010        PMID: 20536191     DOI: 10.1021/ci900301v

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  7 in total

1.  SCRIPDB: a portal for easy access to syntheses, chemicals and reactions in patents.

Authors:  Abraham Heifets; Igor Jurisica
Journal:  Nucleic Acids Res       Date:  2011-11-08       Impact factor: 16.971

2.  Neural Networks for the Prediction of Organic Chemistry Reactions.

Authors:  Jennifer N Wei; David Duvenaud; Alán Aspuru-Guzik
Journal:  ACS Cent Sci       Date:  2016-10-14       Impact factor: 14.553

3.  Nonpher: computational method for design of hard-to-synthesize structures.

Authors:  Milan Voršilák; Daniel Svozil
Journal:  J Cheminform       Date:  2017-03-20       Impact factor: 5.514

Review 4.  In silico Strategies to Support Fragment-to-Lead Optimization in Drug Discovery.

Authors:  Lauro Ribeiro de Souza Neto; José Teófilo Moreira-Filho; Bruno Junior Neves; Rocío Lucía Beatriz Riveros Maidana; Ana Carolina Ramos Guimarães; Nicholas Furnham; Carolina Horta Andrade; Floriano Paes Silva
Journal:  Front Chem       Date:  2020-02-18       Impact factor: 5.221

Review 5.  Miscellaneous Topics in Computer-Aided Drug Design: Synthetic Accessibility and GPU Computing, and Other Topics.

Authors:  Yoshifumi Fukunishi; Tadaaki Mashimo; Kiyotaka Misoo; Yoshinori Wakabayashi; Toshiaki Miyaki; Seiji Ohta; Mayu Nakamura; Kazuyoshi Ikeda
Journal:  Curr Pharm Des       Date:  2016       Impact factor: 3.116

6.  SAVI, in silico generation of billions of easily synthesizable compounds through expert-system type rules.

Authors:  Hitesh Patel; Wolf-Dietrich Ihlenfeldt; Philip N Judson; Yurii S Moroz; Yuri Pevzner; Megan L Peach; Victorien Delannée; Nadya I Tarasova; Marc C Nicklaus
Journal:  Sci Data       Date:  2020-11-11       Impact factor: 6.444

7.  An automatic pipeline for the design of irreversible derivatives identifies a potent SARS-CoV-2 Mpro inhibitor.

Authors:  Daniel Zaidman; Paul Gehrtz; Mihajlo Filep; Daren Fearon; Ronen Gabizon; Alice Douangamath; Jaime Prilusky; Shirly Duberstein; Galit Cohen; C David Owen; Efrat Resnick; Claire Strain-Damerell; Petra Lukacik; Haim Barr; Martin A Walsh; Frank von Delft; Nir London
Journal:  Cell Chem Biol       Date:  2021-06-25       Impact factor: 8.116

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.