Literature DB >> 30693661

DG-GL: Differential geometry-based geometric learning of molecular datasets.

Duc Duy Nguyen1, Guo-Wei Wei1,2,3.   

Abstract

MOTIVATION: Despite its great success in various physical modeling, differential geometry (DG) has rarely been devised as a versatile tool for analyzing large, diverse, and complex molecular and biomolecular datasets because of the limited understanding of its potential power in dimensionality reduction and its ability to encode essential chemical and biological information in differentiable manifolds.
RESULTS: We put forward a differential geometry-based geometric learning (DG-GL) hypothesis that the intrinsic physics of three-dimensional (3D) molecular structures lies on a family of low-dimensional manifolds embedded in a high-dimensional data space. We encode crucial chemical, physical, and biological information into 2D element interactive manifolds, extracted from a high-dimensional structural data space via a multiscale discrete-to-continuum mapping using differentiable density estimators. Differential geometry apparatuses are utilized to construct element interactive curvatures in analytical forms for certain analytically differentiable density estimators. These low-dimensional differential geometry representations are paired with a robust machine learning algorithm to showcase their descriptive and predictive powers for large, diverse, and complex molecular and biomolecular datasets. Extensive numerical experiments are carried out to demonstrate that the proposed DG-GL strategy outperforms other advanced methods in the predictions of drug discovery-related protein-ligand binding affinity, drug toxicity, and molecular solvation free energy.
AVAILABILITY AND IMPLEMENTATION: http://weilab.math.msu.edu/DG-GL/ Contact: wei@math.msu.edu.
© 2019 John Wiley & Sons, Ltd.

Entities:  

Keywords:  biomolecular data; drug discovery; geometric data analysis; machine learning

Mesh:

Year:  2019        PMID: 30693661      PMCID: PMC6598676          DOI: 10.1002/cnm.3179

Source DB:  PubMed          Journal:  Int J Numer Method Biomed Eng        ISSN: 2040-7939            Impact factor:   2.747


  63 in total

1.  Random forest: a classification and regression tool for compound classification and QSAR modeling.

Authors:  Vladimir Svetnik; Andy Liaw; Christopher Tong; J Christopher Culberson; Robert P Sheridan; Bradley P Feuston
Journal:  J Chem Inf Comput Sci       Date:  2003 Nov-Dec

2.  Multiscale weighted colored graphs for protein flexibility and rigidity analysis.

Authors:  David Bramer; Guo-Wei Wei
Journal:  J Chem Phys       Date:  2018-02-07       Impact factor: 3.488

3.  The interpretation of protein structures: estimation of static accessibility.

Authors:  B Lee; F M Richards
Journal:  J Mol Biol       Date:  1971-02-14       Impact factor: 5.469

4.  Differential geometry based multiscale models.

Authors:  Guo-Wei Wei
Journal:  Bull Math Biol       Date:  2010-02-19       Impact factor: 1.758

5.  Multiscale multiphysics and multidomain models--flexibility and rigidity.

Authors:  Kelin Xia; Kristopher Opron; Guo-Wei Wei
Journal:  J Chem Phys       Date:  2013-11-21       Impact factor: 3.488

6.  Geometric modeling of subcellular structures, organelles, and multiprotein complexes.

Authors:  Xin Feng; Kelin Xia; Yiying Tong; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2012-11-21       Impact factor: 2.747

7.  pKa predictions for proteins, RNAs, and DNAs with the Gaussian dielectric function using DelPhi pKa.

Authors:  Lin Wang; Lin Li; Emil Alexov
Journal:  Proteins       Date:  2015-10-16

8.  Generalized flexibility-rigidity index.

Authors:  Duc Duy Nguyen; Kelin Xia; Guo-Wei Wei
Journal:  J Chem Phys       Date:  2016-06-21       Impact factor: 3.488

9.  Analysis and prediction of protein folding energy changes upon mutation by element specific persistent homology.

Authors:  Zixuan Cang; Guo-Wei Wei
Journal:  Bioinformatics       Date:  2017-11-15       Impact factor: 6.937

10.  istar: a web platform for large-scale protein-ligand docking.

Authors:  Hongjian Li; Kwong-Sak Leung; Pedro J Ballester; Man-Hon Wong
Journal:  PLoS One       Date:  2014-01-24       Impact factor: 3.240

View more
  11 in total

1.  Generative network complex (GNC) for drug discovery.

Authors:  Christopher Grow; Kaifu Gao; Duc Duy Nguyen; Guo-Wei Wei
Journal:  Commun Inf Syst       Date:  2019

2.  MathDL: mathematical deep learning for D3R Grand Challenge 4.

Authors:  Duc Duy Nguyen; Kaifu Gao; Menglun Wang; Guo-Wei Wei
Journal:  J Comput Aided Mol Des       Date:  2019-11-16       Impact factor: 3.686

3.  AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2019-07-01       Impact factor: 4.956

4.  Extracting Predictive Representations from Hundreds of Millions of Molecules.

Authors:  Dong Chen; Jiaxin Zheng; Guo-Wei Wei; Feng Pan
Journal:  J Phys Chem Lett       Date:  2021-11-01       Impact factor: 6.888

Review 5.  A review of mathematical representations of biomolecular data.

Authors:  Duc Duy Nguyen; Zixuan Cang; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-02-26       Impact factor: 3.676

6.  Are 2D fingerprints still valuable for drug discovery?

Authors:  Kaifu Gao; Duc Duy Nguyen; Vishnu Sresht; Alan M Mathiowetz; Meihua Tu; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-04-29       Impact factor: 3.676

7.  Boosting Tree-Assisted Multitask Deep Learning for Small Scientific Datasets.

Authors:  Jian Jiang; Rui Wang; Menglun Wang; Kaifu Gao; Duc Duy Nguyen; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2020-02-03       Impact factor: 4.956

8.  AweGNN: Auto-parametrized weighted element-specific graph neural networks for molecules.

Authors:  Timothy Szocinski; Duc Duy Nguyen; Guo-Wei Wei
Journal:  Comput Biol Med       Date:  2021-05-12       Impact factor: 6.698

9.  Molecular Design Learned from the Natural Product Porphyra-334: Molecular Generation via Chemical Variational Autoencoder versus Database Mining via Similarity Search, A Comparative Study.

Authors:  Yuki Harada; Makoto Hatakeyama; Shuichi Maeda; Qi Gao; Kenichi Koizumi; Yuki Sakamoto; Yuuki Ono; Shinichiro Nakamura
Journal:  ACS Omega       Date:  2022-03-02

10.  Unveiling the molecular mechanism of SARS-CoV-2 main protease inhibition from 137 crystal structures using algebraic topology and deep learning.

Authors:  Duc Duy Nguyen; Kaifu Gao; Jiahui Chen; Rui Wang; Guo-Wei Wei
Journal:  Chem Sci       Date:  2020-09-30       Impact factor: 9.825

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.