Literature DB >> 26203626

A Data-Driven Evolutionary Algorithm for Mapping Multibasin Protein Energy Landscapes.

Rudy Clausen1, Amarda Shehu1,2,3.   

Abstract

Evidence is emerging that many proteins involved in proteinopathies are dynamic molecules switching between stable and semistable structures to modulate their function. A detailed understanding of the relationship between structure and function in such molecules demands a comprehensive characterization of their conformation space. Currently, only stochastic optimization methods are capable of exploring conformation spaces to obtain sample-based representations of associated energy surfaces. These methods have to address the fundamental but challenging issue of balancing computational resources between exploration (obtaining a broad view of the space) and exploitation (going deep in the energy surface). We propose a novel algorithm that strikes an effective balance by employing concepts from evolutionary computation. The algorithm leverages deposited crystal structures of wildtype and variant sequences of a protein to define a reduced, low-dimensional search space from where to rapidly draw samples. A multiscale technique maps samples to local minima of the all-atom energy surface of a protein under investigation. Several novel algorithmic strategies are employed to avoid premature convergence to particular minima and obtain a broad view of a possibly multibasin energy surface. Analysis of applications on different proteins demonstrates the broad utility of the algorithm to map multibasin energy landscapes and advance modeling of multibasin proteins. In particular, applications on wildtype and variant sequences of proteins involved in proteinopathies demonstrate that the algorithm makes an important first step toward understanding the impact of sequence mutations on misfunction by providing the energy landscape as the intermediate explanatory link between protein sequence and function.

Entities:  

Keywords:  dimensionality reduction; evolutionary algorithm; multibasin energy landscape; multiscale modeling; protein structure modeling

Mesh:

Substances:

Year:  2015        PMID: 26203626     DOI: 10.1089/cmb.2015.0107

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

Review 1.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics.

Authors:  Tatiana Maximova; Ryan Moffatt; Buyong Ma; Ruth Nussinov; Amarda Shehu
Journal:  PLoS Comput Biol       Date:  2016-04-28       Impact factor: 4.475

2.  Maintaining and Enhancing Diversity of Sampled Protein Conformations in Robotics-Inspired Methods.

Authors:  Jayvee R Abella; Mark Moll; Lydia E Kavraki
Journal:  J Comput Biol       Date:  2017-10-16       Impact factor: 1.479

3.  Computing energy landscape maps and structural excursions of proteins.

Authors:  Emmanuel Sapin; Daniel B Carr; Kenneth A De Jong; Amarda Shehu
Journal:  BMC Genomics       Date:  2016-08-18       Impact factor: 3.969

4.  From mutations to mechanisms and dysfunction via computation and mining of protein energy landscapes.

Authors:  Wanli Qiao; Nasrin Akhter; Xiaowen Fang; Tatiana Maximova; Erion Plaku; Amarda Shehu
Journal:  BMC Genomics       Date:  2018-09-24       Impact factor: 3.969

5.  Evaluating Autoencoder-Based Featurization and Supervised Learning for Protein Decoy Selection.

Authors:  Fardina Fathmiul Alam; Taseef Rahman; Amarda Shehu
Journal:  Molecules       Date:  2020-03-04       Impact factor: 4.411

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.