| Literature DB >> 35874311 |
Mrinal Shekhar1, Genki Terashi2, Chitrak Gupta3,4, Daipayan Sarkar2,3, Gaspard Debussche5, Nicholas J Sisco3,6, Jonathan Nguyen3,4, Arup Mondal7, John Vant3,4, Petra Fromme3,4, Wade D Van Horn3,6, Emad Tajkhorshid1, Daisuke Kihara2,8, Ken Dill9, Alberto Perez7, Abhishek Singharoy3,4.
Abstract
Cryo-electron microscopy (EM) requires molecular modeling to refine structural details from data. Ensemble models arrive at low free-energy molecular structures, but are computationally expensive and limited to resolving only small proteins that cannot be resolved by cryo-EM. Here, we introduce CryoFold - a pipeline of molecular dynamics simulations that determines ensembles of protein structures directly from sequence by integrating density data of varying sparsity at 3-5 Å resolution with coarse-grained topological knowledge of the protein folds. We present six examples showing its broad applicability for folding proteins between 72 to 2000 residues, including large membrane and multi-domain systems, and results from two EMDB competitions. Driven by data from a single state, CryoFold discovers ensembles of common low-energy models together with rare low-probability structures that capture the equilibrium distribution of proteins constrained by the density maps. Many of these conformations, unseen by traditional methods, are experimentally validated and functionally relevant. We arrive at a set of best practices for data-guided protein folding that are controlled using a Python GUI.Entities:
Year: 2021 PMID: 35874311 PMCID: PMC9302471 DOI: 10.1016/j.matt.2021.09.004
Source DB: PubMed Journal: Matter ISSN: 2590-2385