Literature DB >> 18662925

Mixture models for protein structure ensembles.

Michael Hirsch1, Michael Habeck.   

Abstract

MOTIVATION: Protein structure ensembles provide important insight into the dynamics and function of a protein and contain information that is not captured with a single static structure. However, it is not clear a priori to what extent the variability within an ensemble is caused by internal structural changes. Additional variability results from overall translations and rotations of the molecule. And most experimental data do not provide information to relate the structures to a common reference frame. To report meaningful values of intrinsic dynamics, structural precision, conformational entropy, etc., it is therefore important to disentangle local from global conformational heterogeneity.
RESULTS: We consider the task of disentangling local from global heterogeneity as an inference problem. We use probabilistic methods to infer from the protein ensemble missing information on reference frames and stable conformational sub-states. To this end, we model a protein ensemble as a mixture of Gaussian probability distributions of either entire conformations or structural segments. We learn these models from a protein ensemble using the expectation-maximization algorithm. Our first model can be used to find multiple conformers in a structure ensemble. The second model partitions the protein chain into locally stable structural segments or core elements and less structured regions typically found in loops. Both models are simple to implement and contain only a single free parameter: the number of conformers or structural segments. Our models can be used to analyse experimental ensembles, molecular dynamics trajectories and conformational change in proteins. AVAILABILITY: The Python source code for protein ensemble analysis is available from the authors upon request.

Mesh:

Substances:

Year:  2008        PMID: 18662925     DOI: 10.1093/bioinformatics/btn396

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  A robust method for quantitative identification of ordered cores in an ensemble of biomolecular structures by non-linear multi-dimensional scaling using inter-atomic distance variance matrix.

Authors:  Naohiro Kobayashi
Journal:  J Biomol NMR       Date:  2014-01-03       Impact factor: 2.835

2.  Markov dynamic models for long-timescale protein motion.

Authors:  Tsung-Han Chiang; David Hsu; Jean-Claude Latombe
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

3.  Ensemble-based evaluation for protein structure models.

Authors:  Michal Jamroz; Andrzej Kolinski; Daisuke Kihara
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

4.  A graph-based algorithm for detecting rigid domains in protein structures.

Authors:  Truong Khanh Linh Dang; Thach Nguyen; Michael Habeck; Mehmet Gültas; Stephan Waack
Journal:  BMC Bioinformatics       Date:  2021-02-12       Impact factor: 3.169

5.  ENCORE: Software for Quantitative Ensemble Comparison.

Authors:  Matteo Tiberti; Elena Papaleo; Tone Bengtsen; Wouter Boomsma; Kresten Lindorff-Larsen
Journal:  PLoS Comput Biol       Date:  2015-10-27       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.