Literature DB >> 32067019

A review of mathematical representations of biomolecular data.

Duc Duy Nguyen1, Zixuan Cang1, Guo-Wei Wei2.   

Abstract

Recently, machine learning (ML) has established itself in various worldwide benchmarking competitions in computational biology, including Critical Assessment of Structure Prediction (CASP) and Drug Design Data Resource (D3R) Grand Challenges. However, the intricate structural complexity and high ML dimensionality of biomolecular datasets obstruct the efficient application of ML algorithms in the field. In addition to data and algorithm, an efficient ML machinery for biomolecular predictions must include structural representation as an indispensable component. Mathematical representations that simplify the biomolecular structural complexity and reduce ML dimensionality have emerged as a prime winner in D3R Grand Challenges. This review is devoted to the recent advances in developing low-dimensional and scalable mathematical representations of biomolecules in our laboratory. We discuss three classes of mathematical approaches, including algebraic topology, differential geometry, and graph theory. We elucidate how the physical and biological challenges have guided the evolution and development of these mathematical apparatuses for massive and diverse biomolecular data. We focus the performance analysis on protein-ligand binding predictions in this review although these methods have had tremendous success in many other applications, such as protein classification, virtual screening, and the predictions of solubility, solvation free energies, toxicity, partition coefficients, protein folding stability changes upon mutation, etc.

Entities:  

Mesh:

Year:  2020        PMID: 32067019      PMCID: PMC7081943          DOI: 10.1039/c9cp06554g

Source DB:  PubMed          Journal:  Phys Chem Chem Phys        ISSN: 1463-9076            Impact factor:   3.676


  103 in total

1.  LigandFit: a novel method for the shape-directed rapid docking of ligands to protein active sites.

Authors:  C M Venkatachalam; X Jiang; T Oldfield; M Waldman
Journal:  J Mol Graph Model       Date:  2003-01       Impact factor: 2.518

2.  Geometric and potential driving formation and evolution of biomolecular surfaces.

Authors:  P W Bates; Zhan Chen; Yuhui Sun; Guo-Wei Wei; Shan Zhao
Journal:  J Math Biol       Date:  2008-10-22       Impact factor: 2.259

3.  STRUM: structure-based prediction of protein stability changes upon single-point mutation.

Authors:  Lijun Quan; Qiang Lv; Yang Zhang
Journal:  Bioinformatics       Date:  2016-06-17       Impact factor: 6.937

4.  Integration of element specific persistent homology and machine learning for protein-ligand binding affinity prediction.

Authors:  Zixuan Cang; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2017-08-16       Impact factor: 2.747

5.  Coarse grained normal mode analysis vs. refined Gaussian Network Model for protein residue-level structural fluctuations.

Authors:  Jun-Koo Park; Robert Jernigan; Zhijun Wu
Journal:  Bull Math Biol       Date:  2013-01-08       Impact factor: 1.758

6.  DG-GL: Differential geometry-based geometric learning of molecular datasets.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2019-02-07       Impact factor: 2.747

7.  Geometric modeling of subcellular structures, organelles, and multiprotein complexes.

Authors:  Xin Feng; Kelin Xia; Yiying Tong; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2012-11-21       Impact factor: 2.747

8.  Variational multiscale models for charge transport.

Authors:  Guo-Wei Wei; Qiong Zheng; Zhan Chen; Kelin Xia
Journal:  SIAM Rev Soc Ind Appl Math       Date:  2012-11-08       Impact factor: 10.780

9.  Coarse-grained models reveal functional dynamics--I. Elastic network models--theories, comparisons and perspectives.

Authors:  Lee-Wei Yang; Choon-Peng Chng
Journal:  Bioinform Biol Insights       Date:  2008-03-04

10.  KFC Server: interactive forecasting of protein interaction hot spots.

Authors:  Steven J Darnell; Laura LeGault; Julie C Mitchell
Journal:  Nucleic Acids Res       Date:  2008-06-06       Impact factor: 16.971

View more
  17 in total

1.  MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors.

Authors:  Robson P Bonidia; Douglas S Domingues; Danilo S Sanches; André C P L F de Carvalho
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

2.  HOMOTOPY CONTINUATION FOR THE SPECTRA OF PERSISTENT LAPLACIANS.

Authors:  Xiaoqi Wei; Guo-Wei Wei
Journal:  Found Data Sci       Date:  2021-12

Review 3.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

4.  Are 2D fingerprints still valuable for drug discovery?

Authors:  Kaifu Gao; Duc Duy Nguyen; Vishnu Sresht; Alan M Mathiowetz; Meihua Tu; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-04-29       Impact factor: 3.676

5.  Site-Level Bioactivity of Small-Molecules from Deep-Learned Representations of Quantum Chemistry.

Authors:  Kathryn Sarullo; Matthew K Matlock; S Joshua Swamidass
Journal:  J Phys Chem A       Date:  2020-10-21       Impact factor: 2.781

6.  Persistent spectral-based machine learning (PerSpect ML) for protein-ligand binding affinity prediction.

Authors:  Zhenyu Meng; Kelin Xia
Journal:  Sci Adv       Date:  2021-05-07       Impact factor: 14.136

7.  AweGNN: Auto-parametrized weighted element-specific graph neural networks for molecules.

Authors:  Timothy Szocinski; Duc Duy Nguyen; Guo-Wei Wei
Journal:  Comput Biol Med       Date:  2021-05-12       Impact factor: 6.698

8.  Identifying metal binding amino acids based on backbone geometries as a tool for metalloprotein engineering.

Authors:  Hoang Nguyen; Jesse Kleingardner
Journal:  Protein Sci       Date:  2021-04-20       Impact factor: 6.993

9.  Mathematical artificial intelligence design of mutation-proof COVID-19 monoclonal antibodies.

Authors:  Jiahui Chen; Guo-Wei Wei
Journal:  ArXiv       Date:  2022-04-20

10.  Unveiling the molecular mechanism of SARS-CoV-2 main protease inhibition from 137 crystal structures using algebraic topology and deep learning.

Authors:  Duc Duy Nguyen; Kaifu Gao; Jiahui Chen; Rui Wang; Guo-Wei Wei
Journal:  Chem Sci       Date:  2020-09-30       Impact factor: 9.825

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.