Literature DB >> 35483073

Size-and-Shape Space Gaussian Mixture Models for Structural Clustering of Molecular Dynamics Trajectories.

Heidi Klem1, Glen M Hocky2, Martin McCullagh3.   

Abstract

Determining the optimal number and identity of structural clusters from an ensemble of molecular configurations continues to be a challenge. Recent structural clustering methods have focused on the use of internal coordinates due to the innate rotational and translational invariance of these features. The vast number of possible internal coordinates necessitates a feature space supervision step to make clustering tractable but yields a protocol that can be system type-specific. Particle positions offer an appealing alternative to internal coordinates but suffer from a lack of rotational and translational invariance, as well as a perceived insensitivity to regions of structural dissimilarity. Here, we present a method, denoted shape-GMM, that overcomes the shortcomings of particle positions using a weighted maximum likelihood alignment procedure. This alignment strategy is then built into an expectation maximization Gaussian mixture model (GMM) procedure to capture metastable states in the free-energy landscape. The resulting algorithm distinguishes between a variety of different structures, including those indistinguishable by root-mean-square displacement and pairwise distances, as demonstrated on several model systems. Shape-GMM results on an extensive simulation of the fast-folding HP35 Nle/Nle mutant protein support a four-state folding/unfolding mechanism, which is consistent with previous experimental results and provides kinetic details comparable to previous state-of-the art clustering approaches, as measured by the VAMP-2 score. Currently, training of shape-GMMs is recommended for systems (or subsystems) that can be represented by ≲200 particles and ≲100k configurations to estimate high-dimensional covariance matrices and balance computational expense. Once a shape-GMM is trained, it can be used to predict the cluster identities of millions of configurations.

Entities:  

Mesh:

Year:  2022        PMID: 35483073      PMCID: PMC9228201          DOI: 10.1021/acs.jctc.1c01290

Source DB:  PubMed          Journal:  J Chem Theory Comput        ISSN: 1549-9618            Impact factor:   6.578


  48 in total

1.  Simple few-state models reveal hidden complexity in protein folding.

Authors:  Kyle A Beauchamp; Robert McGibbon; Yu-Shan Lin; Vijay S Pande
Journal:  Proc Natl Acad Sci U S A       Date:  2012-07-09       Impact factor: 11.205

2.  Clustering Molecular Dynamics Trajectories: 1. Characterizing the Performance of Different Clustering Algorithms.

Authors:  Jianyin Shao; Stephen W Tanner; Nephi Thompson; Thomas E Cheatham
Journal:  J Chem Theory Comput       Date:  2007-11       Impact factor: 6.006

3.  High-resolution x-ray crystal structures of the villin headpiece subdomain, an ultrafast folding protein.

Authors:  Thang K Chiu; Jan Kubelka; Regine Herbst-Irmer; William A Eaton; James Hofrichter; David R Davies
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-13       Impact factor: 11.205

4.  Sub-microsecond protein folding.

Authors:  Jan Kubelka; Thang K Chiu; David R Davies; William A Eaton; James Hofrichter
Journal:  J Mol Biol       Date:  2006-03-31       Impact factor: 5.469

5.  Dynamical coring of Markov state models.

Authors:  Daniel Nagel; Anna Weber; Benjamin Lickert; Gerhard Stock
Journal:  J Chem Phys       Date:  2019-03-07       Impact factor: 3.488

6.  Molecular Mechanism Behind the Fast Folding/Unfolding Transitions of Villin Headpiece Subdomain: Hierarchy and Heterogeneity.

Authors:  Toshifumi Mori; Shinji Saito
Journal:  J Phys Chem B       Date:  2016-11-03       Impact factor: 2.991

7.  Protein folding kinetics and thermodynamics from atomistic simulation.

Authors:  Stefano Piana; Kresten Lindorff-Larsen; David E Shaw
Journal:  Proc Natl Acad Sci U S A       Date:  2012-07-20       Impact factor: 11.205

8.  Principal component analysis of molecular dynamics: on the use of Cartesian vs. internal coordinates.

Authors:  Florian Sittel; Abhinav Jain; Gerhard Stock
Journal:  J Chem Phys       Date:  2014-07-07       Impact factor: 3.488

9.  Robust Density-Based Clustering To Identify Metastable Conformational States of Proteins.

Authors:  Florian Sittel; Gerhard Stock
Journal:  J Chem Theory Comput       Date:  2016-04-21       Impact factor: 6.006

10.  Sapphire-Based Clustering.

Authors:  Francesco Cocina; Andreas Vitalis; Amedeo Caflisch
Journal:  J Chem Theory Comput       Date:  2020-09-24       Impact factor: 6.006

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.