| Literature DB >> 32971019 |
Alexey Strokach1, David Becerra2, Carles Corbi-Verge2, Albert Perez-Riba2, Philip M Kim3.
Abstract
Protein structure and function is determined by the arrangement of the linear sequence of amino acids in 3D space. We show that a deep graph neural network, ProteinSolver, can precisely design sequences that fold into a predetermined shape by phrasing this challenge as a constraint satisfaction problem (CSP), akin to Sudoku puzzles. We trained ProteinSolver on over 70,000,000 real protein sequences corresponding to over 80,000 structures. We show that our method rapidly designs new protein sequences and benchmark them in silico using energy-based scores, molecular dynamics, and structure prediction methods. As a proof-of-principle validation, we use ProteinSolver to generate sequences that match the structure of serum albumin, then synthesize the top-scoring design and validate it in vitro using circular dichroism. ProteinSolver is freely available at http://design.proteinsolver.org and https://gitlab.com/ostrokach/proteinsolver. A record of this paper's transparent peer review process is included in the Supplemental Information.Entities:
Keywords: constraint satisfaction problem; deep learning; graph neural networks; inverse protein folding; protein design; protein optimization
Year: 2020 PMID: 32971019 DOI: 10.1016/j.cels.2020.08.016
Source DB: PubMed Journal: Cell Syst ISSN: 2405-4712 Impact factor: 10.304