Literature DB >> 32266895

Are 2D fingerprints still valuable for drug discovery?

Kaifu Gao1, Duc Duy Nguyen1, Vishnu Sresht2, Alan M Mathiowetz2, Meihua Tu2, Guo-Wei Wei3.   

Abstract

Recently, molecular fingerprints extracted from three-dimensional (3D) structures using advanced mathematics, such as algebraic topology, differential geometry, and graph theory have been paired with efficient machine learning, especially deep learning algorithms to outperform other methods in drug discovery applications and competitions. This raises the question of whether classical 2D fingerprints are still valuable in computer-aided drug discovery. This work considers 23 datasets associated with four typical problems, namely protein-ligand binding, toxicity, solubility and partition coefficient to assess the performance of eight 2D fingerprints. Advanced machine learning algorithms including random forest, gradient boosted decision tree, single-task deep neural network and multitask deep neural network are employed to construct efficient 2D-fingerprint based models. Additionally, appropriate consensus models are built to further enhance the performance of 2D-fingerprint-based methods. It is demonstrated that 2D-fingerprint-based models perform as well as the state-of-the-art 3D structure-based models for the predictions of toxicity, solubility, partition coefficient and protein-ligand binding affinity based on only ligand information. However, 3D structure-based models outperform 2D fingerprint-based methods in complex-based protein-ligand binding affinity predictions.

Entities:  

Mesh:

Year:  2020        PMID: 32266895      PMCID: PMC7224332          DOI: 10.1039/d0cp00305k

Source DB:  PubMed          Journal:  Phys Chem Chem Phys        ISSN: 1463-9076            Impact factor:   3.676


  45 in total

1.  Random forest: a classification and regression tool for compound classification and QSAR modeling.

Authors:  Vladimir Svetnik; Andy Liaw; Christopher Tong; J Christopher Culberson; Robert P Sheridan; Bradley P Feuston
Journal:  J Chem Inf Comput Sci       Date:  2003 Nov-Dec

Review 2.  Electrotopological state atom (E-state) index in drug design, QSAR, property prediction and toxicity assessment.

Authors:  Kunal Roy; Indrani Mitra
Journal:  Curr Comput Aided Drug Des       Date:  2012-06       Impact factor: 1.606

3.  Comparative Assessment of Scoring Functions: The CASF-2016 Update.

Authors:  Minyi Su; Qifan Yang; Yu Du; Guoqin Feng; Zhihai Liu; Yan Li; Renxiao Wang
Journal:  J Chem Inf Model       Date:  2018-12-11       Impact factor: 4.956

4.  Predictive Multitask Deep Neural Network Models for ADME-Tox Properties: Learning from Large Data Sets.

Authors:  Jan Wenzel; Hans Matter; Friedemann Schmidt
Journal:  J Chem Inf Model       Date:  2019-01-24       Impact factor: 4.956

5.  Rigidity Strengthening: A Mechanism for Protein-Ligand Binding.

Authors:  Duc D Nguyen; Tian Xiao; Menglun Wang; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2017-07-12       Impact factor: 4.956

6.  DG-GL: Differential geometry-based geometric learning of molecular datasets.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2019-02-07       Impact factor: 2.747

Review 7.  Machine learning in chemoinformatics and drug discovery.

Authors:  Yu-Chen Lo; Stefano E Rensi; Wen Torng; Russ B Altman
Journal:  Drug Discov Today       Date:  2018-05-08       Impact factor: 7.851

8.  D3R grand challenge 2015: Evaluation of protein-ligand pose and affinity predictions.

Authors:  Symon Gathiaka; Shuai Liu; Michael Chiu; Huanwang Yang; Jeanne A Stuckey; You Na Kang; Jim Delproposto; Ginger Kubish; James B Dunbar; Heather A Carlson; Stephen K Burley; W Patrick Walters; Rommie E Amaro; Victoria A Feher; Michael K Gilson
Journal:  J Comput Aided Mol Des       Date:  2016-09-30       Impact factor: 3.686

9.  D3R grand challenge 4: blind prediction of protein-ligand poses, affinity rankings, and relative binding free energies.

Authors:  Conor D Parks; Zied Gaieb; Michael Chiu; Huanwang Yang; Chenghua Shao; W Patrick Walters; Johanna M Jansen; Georgia McGaughey; Richard A Lewis; Scott D Bembenek; Michael K Ameriks; Tara Mirzadegan; Stephen K Burley; Rommie E Amaro; Michael K Gilson
Journal:  J Comput Aided Mol Des       Date:  2020-01-23       Impact factor: 3.686

10.  Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening.

Authors:  Zixuan Cang; Lin Mu; Guo-Wei Wei
Journal:  PLoS Comput Biol       Date:  2018-01-08       Impact factor: 4.475

View more
  14 in total

1.  Comparative analysis of molecular fingerprints in prediction of drug combination effects.

Authors:  B Zagidullin; Z Wang; Y Guan; E Pitkänen; J Tang
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

2.  An Algorithm Framework for Drug-Induced Liver Injury Prediction Based on Genetic Algorithm and Ensemble Learning.

Authors:  Bowei Yan; Xiaona Ye; Jing Wang; Junshan Han; Lianlian Wu; Song He; Kunhong Liu; Xiaochen Bo
Journal:  Molecules       Date:  2022-05-12       Impact factor: 4.927

3.  Proteome-Informed Machine Learning Studies of Cocaine Addiction.

Authors:  Kaifu Gao; Dong Chen; Alfred J Robison; Guo-Wei Wei
Journal:  J Phys Chem Lett       Date:  2021-11-09       Impact factor: 6.888

4.  Extracting Predictive Representations from Hundreds of Millions of Molecules.

Authors:  Dong Chen; Jiaxin Zheng; Guo-Wei Wei; Feng Pan
Journal:  J Phys Chem Lett       Date:  2021-11-01       Impact factor: 6.888

5.  Boosting Tree-Assisted Multitask Deep Learning for Small Scientific Datasets.

Authors:  Jian Jiang; Rui Wang; Menglun Wang; Kaifu Gao; Duc Duy Nguyen; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2020-02-03       Impact factor: 4.956

6.  Generative Network Complex for the Automated Generation of Drug-like Molecules.

Authors:  Kaifu Gao; Duc Duy Nguyen; Meihua Tu; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2020-08-07       Impact factor: 4.956

7.  Pushing the limits of solubility prediction via quality-oriented data selection.

Authors:  Murat Cihan Sorkun; J M Vianney A Koelman; Süleyman Er
Journal:  iScience       Date:  2020-12-17

8.  Application of Networking Approaches to Assess the Chemical Diversity, Biogeography, and Pharmaceutical Potential of Verongiida Natural Products.

Authors:  James Lever; Robert Brkljača; Colin Rix; Sylvia Urban
Journal:  Mar Drugs       Date:  2021-10-18       Impact factor: 5.118

9.  Similarity-Based Virtual Screen Using Enhanced Siamese Deep Learning Methods.

Authors:  Mohammed Khaldoon Altalib; Naomie Salim
Journal:  ACS Omega       Date:  2022-02-03

10.  Dowker complex based machine learning (DCML) models for protein-ligand binding affinity prediction.

Authors:  Xiang Liu; Huitao Feng; Jie Wu; Kelin Xia
Journal:  PLoS Comput Biol       Date:  2022-04-06       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.