Literature DB >> 29253336

A Minimum Variance Clustering Approach Produces Robust and Interpretable Coarse-Grained Models.

Brooke E Husic1, Keri A McKiernan1, Hannah K Wayment-Steele1, Mohammad M Sultan1, Vijay S Pande1.   

Abstract

Markov state models (MSMs) are a powerful framework for the analysis of molecular dynamics data sets, such as protein folding simulations, because of their straightforward construction and statistical rigor. The coarse-graining of MSMs into an interpretable number of macrostates is a crucial step for connecting theoretical results with experimental observables. Here we present the minimum variance clustering approach (MVCA) for the coarse-graining of MSMs into macrostate models. The method utilizes agglomerative clustering with Ward's minimum variance objective function, and the similarity of the microstate dynamics is determined using the Jensen-Shannon divergence between the corresponding rows in the MSM transition probability matrix. We first show that MVCA produces intuitive results for a simple tripeptide system and is robust toward long-duration statistical artifacts. MVCA is then applied to two protein folding simulations of the same protein in different force fields to demonstrate that a different number of macrostates is appropriate for each model, revealing a misfolded state present in only one of the simulations. Finally, we show that the same method can be used to analyze a data set containing many MSMs from simulations in different force fields by aggregating them into groups and quantifying their dynamical similarity in the context of force field parameter choices. The minimum variance clustering approach with the Jensen-Shannon divergence provides a powerful tool to group dynamics by similarity, both among model states and among dynamical models themselves.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29253336     DOI: 10.1021/acs.jctc.7b01004

Source DB:  PubMed          Journal:  J Chem Theory Comput        ISSN: 1549-9618            Impact factor:   6.006


  4 in total

1.  Galerkin approximation of dynamical quantities using trajectory data.

Authors:  Erik H Thiede; Dimitrios Giannakis; Aaron R Dinner; Jonathan Weare
Journal:  J Chem Phys       Date:  2019-06-28       Impact factor: 3.488

2.  Conformational analysis of replica exchange MD: Temperature-dependent Markov networks for FF amyloid peptides.

Authors:  Brajesh Narayan; Colm Herbert; Ye Yuan; Brian J Rodriguez; Bernard R Brooks; Nicolae-Viorel Buchete
Journal:  J Chem Phys       Date:  2018-08-21       Impact factor: 3.488

3.  Deep learning the structural determinants of protein biochemical properties by comparing structural ensembles with DiffNets.

Authors:  Michael D Ward; Maxwell I Zimmerman; Artur Meller; Moses Chung; S J Swamidass; Gregory R Bowman
Journal:  Nat Commun       Date:  2021-05-21       Impact factor: 14.919

4.  Simultaneous coherent structure coloring facilitates interpretable clustering of scientific data by amplifying dissimilarity.

Authors:  Brooke E Husic; Kristy L Schlueter-Kuck; John O Dabiri
Journal:  PLoS One       Date:  2019-03-13       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.