Literature DB >> 34126827

Assigning confidence to molecular property prediction.

AkshatKumar Nigam1,2, Robert Pollice1,2, Matthew F D Hurley3, Riley J Hickman1,2, Matteo Aldeghi1,2,4, Naruki Yoshikawa1,2, Seyone Chithrananda2, Vincent A Voelz3, Alán Aspuru-Guzik1,2,4,5.   

Abstract

Introduction: Computational modeling has rapidly advanced over the last decades. Recently, machine learning has emerged as a powerful and cost-effective strategy to learn from existing datasets and perform predictions on unseen molecules. Accordingly, the explosive rise of data-driven techniques raises an important question: What confidence can be assigned to molecular property predictions and what techniques can be used?Areas covered: The authors discuss popular strategies for predicting molecular properties, their corresponding uncertainty sources and methods to quantify uncertainty. First, the authors' considerations for assessing confidence begin with dataset bias and size, data-driven property prediction and feature design. Next, the authors discuss property simulation via computations of binding affinity in detail. Lastly, they investigate how these uncertainties propagate to generative models, as they are usually coupled with property predictors.Expert opinion: Computational techniques are paramount to reduce the prohibitive cost of brute-force experimentation during exploration. The authors believe that assessing uncertainty in property prediction models is essential whenever closed-loop drug design campaigns relying on high-throughput virtual screening are deployed. Accordingly, considering sources of uncertainty leads to better-informed validations, more reliable predictions and more realistic expectations of the entire workflow. Overall, this increases confidence in the predictions and, ultimately, accelerates drug design.

Entities:  

Keywords:  Neural networks; artificial intelligence; deep learning; docking; drug discovery; generative models; model uncertainty estimation; molecular dynamics

Mesh:

Year:  2021        PMID: 34126827      PMCID: PMC9449894          DOI: 10.1080/17460441.2021.1925247

Source DB:  PubMed          Journal:  Expert Opin Drug Discov        ISSN: 1746-0441            Impact factor:   7.050


  86 in total

1.  Building Force Fields: An Automatic, Systematic, and Reproducible Approach.

Authors:  Lee-Ping Wang; Todd J Martinez; Vijay S Pande
Journal:  J Phys Chem Lett       Date:  2014-05-16       Impact factor: 6.475

2.  Accurate and reliable prediction of relative ligand binding potency in prospective drug discovery by way of a modern free-energy calculation protocol and force field.

Authors:  Lingle Wang; Yujie Wu; Yuqing Deng; Byungchan Kim; Levi Pierce; Goran Krilov; Dmitry Lupyan; Shaughnessy Robinson; Markus K Dahlgren; Jeremy Greenwood; Donna L Romero; Craig Masse; Jennifer L Knight; Thomas Steinbrecher; Thijs Beuming; Wolfgang Damm; Ed Harder; Woody Sherman; Mark Brewer; Ron Wester; Mark Murcko; Leah Frye; Ramy Farid; Teng Lin; David L Mobley; William L Jorgensen; Bruce J Berne; Richard A Friesner; Robert Abel
Journal:  J Am Chem Soc       Date:  2015-02-12       Impact factor: 15.419

Review 3.  Transfer Learning for Drug Discovery.

Authors:  Chenjing Cai; Shiwei Wang; Youjun Xu; Weilin Zhang; Ke Tang; Qi Ouyang; Luhua Lai; Jianfeng Pei
Journal:  J Med Chem       Date:  2020-07-24       Impact factor: 7.446

4.  PubChemQC PM6: Data Sets of 221 Million Molecules with Optimized Molecular Geometries and Electronic Properties.

Authors:  Maho Nakata; Tomomi Shimazaki; Masatomo Hashimoto; Toshiyuki Maeda
Journal:  J Chem Inf Model       Date:  2020-10-26       Impact factor: 4.956

5.  Are Protein Force Fields Getting Better? A Systematic Benchmark on 524 Diverse NMR Measurements.

Authors:  Kyle A Beauchamp; Yu-Shan Lin; Rhiju Das; Vijay S Pande
Journal:  J Chem Theory Comput       Date:  2012-03-12       Impact factor: 6.006

6.  ZINC20-A Free Ultralarge-Scale Chemical Database for Ligand Discovery.

Authors:  John J Irwin; Khanh G Tang; Jennifer Young; Chinzorig Dandarchuluun; Benjamin R Wong; Munkhzul Khurelbaatar; Yurii S Moroz; John Mayfield; Roger A Sayle
Journal:  J Chem Inf Model       Date:  2020-10-29       Impact factor: 4.956

7.  Challenges Encountered Applying Equilibrium and Nonequilibrium Binding Free Energy Calculations.

Authors:  Hannah M Baumann; Vytautas Gapsys; Bert L de Groot; David L Mobley
Journal:  J Phys Chem B       Date:  2021-04-27       Impact factor: 2.991

8.  PubChem's BioAssay Database.

Authors:  Yanli Wang; Jewen Xiao; Tugba O Suzek; Jian Zhang; Jiyao Wang; Zhigang Zhou; Lianyi Han; Karen Karapetyan; Svetlana Dracheva; Benjamin A Shoemaker; Evan Bolton; Asta Gindulyte; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2011-12-02       Impact factor: 16.971

9.  ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost.

Authors:  J S Smith; O Isayev; A E Roitberg
Journal:  Chem Sci       Date:  2017-02-08       Impact factor: 9.825

10.  Optimization of Molecules via Deep Reinforcement Learning.

Authors:  Zhenpeng Zhou; Steven Kearnes; Li Li; Richard N Zare; Patrick Riley
Journal:  Sci Rep       Date:  2019-07-24       Impact factor: 4.379

View more
  5 in total

1.  Theoretical studies on triplet-state driven dissociation of formaldehyde by quasi-classical molecular dynamics simulation on machine-learning potential energy surface.

Authors:  Shichen Lin; Daoling Peng; Weitao Yang; Feng Long Gu; Zhenggang Lan
Journal:  J Chem Phys       Date:  2021-12-07       Impact factor: 3.488

2.  Stacking Gaussian processes to improve [Formula: see text] predictions in the SAMPL7 challenge.

Authors:  Robert M Raddi; Vincent A Voelz
Journal:  J Comput Aided Mol Des       Date:  2021-08-07       Impact factor: 4.179

3.  A Novel Molecular Representation Learning for Molecular Property Prediction with a Multiple SMILES-Based Augmentation.

Authors:  Chunyan Li; Jihua Feng; Shihu Liu; Junfeng Yao
Journal:  Comput Intell Neurosci       Date:  2022-01-28

4.  Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design.

Authors:  AkshatKumar Nigam; Robert Pollice; Alán Aspuru-Guzik
Journal:  Digit Discov       Date:  2022-05-03

Review 5.  On scientific understanding with artificial intelligence.

Authors:  Mario Krenn; Robert Pollice; Si Yue Guo; Matteo Aldeghi; Alba Cervera-Lierta; Pascal Friederich; Gabriel Dos Passos Gomes; Florian Häse; Adrian Jinich; AkshatKumar Nigam; Zhenpeng Yao; Alán Aspuru-Guzik
Journal:  Nat Rev Phys       Date:  2022-10-11
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.