| Literature DB >> 28706065 |
Gabriel J Rocklin1, Tamuka M Chidyausiku1,2, Inna Goreshnik1, Alex Ford1,2, Scott Houliston3,4, Alexander Lemak3, Lauren Carter1, Rashmi Ravichandran1, Vikram K Mulligan1, Aaron Chevalier1, Cheryl H Arrowsmith3,4,5, David Baker6,7.
Abstract
Proteins fold into unique native structures stabilized by thousands of weak interactions that collectively overcome the entropic cost of folding. Although these forces are "encoded" in the thousands of known protein structures, "decoding" them is challenging because of the complexity of natural proteins that have evolved for function, not stability. We combined computational protein design, next-generation gene synthesis, and a high-throughput protease susceptibility assay to measure folding and stability for more than 15,000 de novo designed miniproteins, 1000 natural proteins, 10,000 point mutants, and 30,000 negative control sequences. This analysis identified more than 2500 stable designed proteins in four basic folds-a number sufficient to enable us to systematically examine how sequence determines folding and stability in uncharted protein space. Iteration between design and experiment increased the design success rate from 6% to 47%, produced stable proteins unlike those found in nature for topologies where design was initially unsuccessful, and revealed subtle contributions to stability as designs became increasingly optimized. Our approach achieves the long-standing goal of a tight feedback cycle between computation and experiment and has the potential to transform computational protein design into a data-driven science.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28706065 PMCID: PMC5568797 DOI: 10.1126/science.aan0693
Source DB: PubMed Journal: Science ISSN: 0036-8075 Impact factor: 47.728