Literature DB >> 33973773

UMAP as a Dimensionality Reduction Tool for Molecular Dynamics Simulations of Biomacromolecules: A Comparison Study.

Francesco Trozzi1, Xinlei Wang2, Peng Tao1.   

Abstract

Proteins are the molecular machines of life. The multitude of possible conformations that proteins can adopt determines their free-energy landscapes. However, the inherently high dimensionality of a protein free-energy landscape poses a challenge to deciphering how proteins perform their functions. For this reason, dimensionality reduction is an active field of research for molecular biologists. The uniform manifold approximation and projection (UMAP) is a dimensionality reduction method based on a fuzzy topological analysis of data. In the present study, the performance of UMAP is compared with that of other popular dimensionality reduction methods such as t-distributed stochastic neighbor embedding (t-SNE), principal component analysis (PCA), and time-structure independent components analysis (tICA) in the context of analyzing molecular dynamics simulations of the circadian clock protein VIVID. A good dimensionality reduction method should accurately represent the data structure on the projected components. The comparison of the raw high-dimensional data with the projections obtained using different dimensionality reduction methods based on various metrics showed that UMAP has superior performance when compared with linear reduction methods (PCA and tICA) and has competitive performance and scalable computational cost.

Entities:  

Mesh:

Year:  2021        PMID: 33973773      PMCID: PMC8356557          DOI: 10.1021/acs.jpcb.1c02081

Source DB:  PubMed          Journal:  J Phys Chem B        ISSN: 1520-5207            Impact factor:   2.991


  46 in total

Review 1.  From protein structure to function.

Authors:  C A Orengo; A E Todd; J M Thornton
Journal:  Curr Opin Struct Biol       Date:  1999-06       Impact factor: 6.809

2.  Large-amplitude nonlinear motions in proteins.

Authors: 
Journal:  Phys Rev Lett       Date:  1992-04-27       Impact factor: 9.161

Review 3.  CHARMM: the biomolecular simulation program.

Authors:  B R Brooks; C L Brooks; A D Mackerell; L Nilsson; R J Petrella; B Roux; Y Won; G Archontis; C Bartels; S Boresch; A Caflisch; L Caves; Q Cui; A R Dinner; M Feig; S Fischer; J Gao; M Hodoscek; W Im; K Kuczera; T Lazaridis; J Ma; V Ovchinnikov; E Paci; R W Pastor; C B Post; J Z Pu; M Schaefer; B Tidor; R M Venable; H L Woodcock; X Wu; W Yang; D M York; M Karplus
Journal:  J Comput Chem       Date:  2009-07-30       Impact factor: 3.376

Review 4.  Long-timescale molecular dynamics simulations of protein structure and function.

Authors:  John L Klepeis; Kresten Lindorff-Larsen; Ron O Dror; David E Shaw
Journal:  Curr Opin Struct Biol       Date:  2009-04-08       Impact factor: 6.809

5.  Algorithmic dimensionality reduction for molecular structure analysis.

Authors:  W Michael Brown; Shawn Martin; Sara N Pollock; Evangelos A Coutsias; Jean-Paul Watson
Journal:  J Chem Phys       Date:  2008-08-14       Impact factor: 3.488

6.  Application of nonlinear dimensionality reduction to characterize the conformational landscape of small peptides.

Authors:  Hernán Stamati; Cecilia Clementi; Lydia E Kavraki
Journal:  Proteins       Date:  2010-02-01

7.  MSMBuilder: Statistical Models for Biomolecular Dynamics.

Authors:  Matthew P Harrigan; Mohammad M Sultan; Carlos X Hernández; Brooke E Husic; Peter Eastman; Christian R Schwantes; Kyle A Beauchamp; Robert T McGibbon; Vijay S Pande
Journal:  Biophys J       Date:  2017-01-10       Impact factor: 4.033

8.  tICA-Metadynamics: Accelerating Metadynamics by Using Kinetically Selected Collective Variables.

Authors:  Mohammad M Sultan; Vijay S Pande
Journal:  J Chem Theory Comput       Date:  2017-05-09       Impact factor: 6.006

9.  Exploring protein dynamics space: the dynasome as the missing link between protein structure and function.

Authors:  Ulf Hensen; Tim Meyer; Jürgen Haas; René Rex; Gert Vriend; Helmut Grubmüller
Journal:  PLoS One       Date:  2012-05-11       Impact factor: 3.240

10.  High-resolution mapping of protein sequence-function relationships.

Authors:  Douglas M Fowler; Carlos L Araya; Sarel J Fleishman; Elizabeth H Kellogg; Jason J Stephany; David Baker; Stanley Fields
Journal:  Nat Methods       Date:  2010-08-15       Impact factor: 28.547

View more
  11 in total

1.  A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity.

Authors:  Scott Bowler; Georgios Papoutsoglou; Aristides Karanikas; Ioannis Tsamardinos; Michael J Corley; Lishomwa C Ndhlovu
Journal:  Sci Rep       Date:  2022-10-19       Impact factor: 4.996

2.  Size-and-Shape Space Gaussian Mixture Models for Structural Clustering of Molecular Dynamics Trajectories.

Authors:  Heidi Klem; Glen M Hocky; Martin McCullagh
Journal:  J Chem Theory Comput       Date:  2022-04-28       Impact factor: 6.578

3.  Using bi-dimensional representations to understand patterns in COVID-19 blood exam data.

Authors:  Vitor P Bezzan; Cleber D Rocco
Journal:  Inform Med Unlocked       Date:  2021-12-30

4.  Explore Protein Conformational Space With Variational Autoencoder.

Authors:  Hao Tian; Xi Jiang; Francesco Trozzi; Sian Xiao; Eric C Larson; Peng Tao
Journal:  Front Mol Biosci       Date:  2021-11-12

5.  Single-Cell Atlas of Adult Testis in Protogynous Hermaphroditic Orange-Spotted Grouper, Epinephelus coioides.

Authors:  Xi Wu; Yang Yang; Chaoyue Zhong; Tong Wang; Yanhong Deng; Hengjin Huang; Haoran Lin; Zining Meng; Xiaochun Liu
Journal:  Int J Mol Sci       Date:  2021-11-22       Impact factor: 5.923

6.  Circulating NAD+ Metabolism-Derived Genes Unveils Prognostic and Peripheral Immune Infiltration in Amyotrophic Lateral Sclerosis.

Authors:  Cheng Li; Yu Zhu; Wenzhi Chen; Menghua Li; Mi Yang; Ziyang Shen; Yiyi Zhou; Lulu Wang; Huan Wang; Shu Li; Jiacheng Ma; Mengni Gong; Renshi Xu
Journal:  Front Cell Dev Biol       Date:  2022-01-28

7.  Single-Cell Transcriptome and Network Analyses Unveil Key Transcription Factors Regulating Mesophyll Cell Development in Maize.

Authors:  Shentong Tao; Peng Liu; Yining Shi; Yilong Feng; Jingjing Gao; Lifen Chen; Aicen Zhang; Xuejiao Cheng; Hairong Wei; Tao Zhang; Wenli Zhang
Journal:  Genes (Basel)       Date:  2022-02-20       Impact factor: 4.096

8.  Mechanistic Insights into Enzyme Catalysis from Explaining Machine-Learned Quantum Mechanical and Molecular Mechanical Minimum Energy Pathways.

Authors:  Zilin Song; Francesco Trozzi; Hao Tian; Chao Yin; Peng Tao
Journal:  ACS Phys Chem Au       Date:  2022-05-18

Review 9.  Collective variable discovery in the age of machine learning: reality, hype and everything in between.

Authors:  Soumendranath Bhakat
Journal:  RSC Adv       Date:  2022-09-02       Impact factor: 4.036

10.  Cuproptosis-Related genes in the prognosis of colorectal cancer and their correlation with the tumor microenvironment.

Authors:  Weiqiang Wu; Jingqing Dong; Yang Lv; Dongmin Chang
Journal:  Front Genet       Date:  2022-09-28       Impact factor: 4.772

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.