Literature DB >> 31636460

Unified rational protein engineering with sequence-based deep representation learning.

Ethan C Alley1,2, Grigory Khimulya, Surojit Biswas1,3, Mohammed AlQuraishi4, George M Church5,6.   

Abstract

Rational protein engineering requires a holistic understanding of protein function. Here, we apply deep learning to unlabeled amino-acid sequences to distill the fundamental features of a protein into a statistical representation that is semantically rich and structurally, evolutionarily and biophysically grounded. We show that the simplest models built on top of this unified representation (UniRep) are broadly applicable and generalize to unseen regions of sequence space. Our data-driven approach predicts the stability of natural and de novo designed proteins, and the quantitative function of molecularly diverse mutants, competitively with the state-of-the-art methods. UniRep further enables two orders of magnitude efficiency improvement in a protein engineering task. UniRep is a versatile summary of fundamental protein features that can be applied across protein engineering informatics.

Entities:  

Mesh:

Year:  2019        PMID: 31636460      PMCID: PMC7067682          DOI: 10.1038/s41592-019-0598-1

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


  43 in total

Review 1.  Methods for the directed evolution of proteins.

Authors:  Michael S Packer; David R Liu
Journal:  Nat Rev Genet       Date:  2015-06-09       Impact factor: 53.242

Review 2.  Computational protein design: a review.

Authors:  Ivan Coluzza
Journal:  J Phys Condens Matter       Date:  2017-01-31       Impact factor: 2.333

3.  Navigating the protein fitness landscape with Gaussian processes.

Authors:  Philip A Romero; Andreas Krause; Frances H Arnold
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-31       Impact factor: 11.205

Review 4.  The coming of age of de novo protein design.

Authors:  Po-Ssu Huang; Scott E Boyken; David Baker
Journal:  Nature       Date:  2016-09-15       Impact factor: 49.962

5.  Programming molecular self-assembly of intrinsically disordered proteins containing sequences of low complexity.

Authors:  Joseph R Simon; Nick J Carroll; Michael Rubinstein; Ashutosh Chilkoti; Gabriel P López
Journal:  Nat Chem       Date:  2017-01-30       Impact factor: 24.427

6.  Improving catalytic function by ProSAR-driven enzyme evolution.

Authors:  Richard J Fox; S Christopher Davis; Emily C Mundorff; Lisa M Newman; Vesna Gavrilovic; Steven K Ma; Loleta M Chung; Charlene Ching; Sarena Tam; Sheela Muley; John Grate; John Gruber; John C Whitman; Roger A Sheldon; Gjalt W Huisman
Journal:  Nat Biotechnol       Date:  2007-02-18       Impact factor: 54.908

Review 7.  Exploring protein fitness landscapes by directed evolution.

Authors:  Philip A Romero; Frances H Arnold
Journal:  Nat Rev Mol Cell Biol       Date:  2009-12       Impact factor: 94.444

8.  Global analysis of protein folding using massively parallel design, synthesis, and testing.

Authors:  Gabriel J Rocklin; Tamuka M Chidyausiku; Inna Goreshnik; Alex Ford; Scott Houliston; Alexander Lemak; Lauren Carter; Rashmi Ravichandran; Vikram K Mulligan; Aaron Chevalier; Cheryl H Arrowsmith; David Baker
Journal:  Science       Date:  2017-07-14       Impact factor: 47.728

9.  Engineering an allosteric transcription factor to respond to new ligands.

Authors:  Noah D Taylor; Alexander S Garruss; Rocco Moretti; Sum Chan; Mark A Arbing; Duilio Cascio; Jameson K Rogers; Farren J Isaacs; Sriram Kosuri; David Baker; Stanley Fields; George M Church; Srivatsan Raman
Journal:  Nat Methods       Date:  2015-12-21       Impact factor: 28.547

10.  Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization.

Authors:  Claire N Bedbrook; Kevin K Yang; Austin J Rice; Viviana Gradinaru; Frances H Arnold
Journal:  PLoS Comput Biol       Date:  2017-10-23       Impact factor: 4.475

View more
  105 in total

Review 1.  Generative chemistry: drug discovery with deep learning generative models.

Authors:  Yuemin Bian; Xiang-Qun Xie
Journal:  J Mol Model       Date:  2021-02-04       Impact factor: 1.810

Review 2.  Genetically Encodable Fluorescent and Bioluminescent Biosensors Light Up Signaling Networks.

Authors:  Xin Zhou; Sohum Mehta; Jin Zhang
Journal:  Trends Biochem Sci       Date:  2020-07-10       Impact factor: 13.807

3.  Density Peak clustering of protein sequences associated to a Pfam clan reveals clear similarities and interesting differences with respect to manual family annotation.

Authors:  Alessandro Laio; Marco Punta; Elena Tea Russo
Journal:  BMC Bioinformatics       Date:  2021-03-12       Impact factor: 3.169

4.  Interpretable detection of novel human viruses from genome sequencing data.

Authors:  Jakub M Bartoszewicz; Anja Seidel; Bernhard Y Renard
Journal:  NAR Genom Bioinform       Date:  2021-02-01

Review 5.  A guide to machine learning for biologists.

Authors:  Joe G Greener; Shaun M Kandathil; Lewis Moffat; David T Jones
Journal:  Nat Rev Mol Cell Biol       Date:  2021-09-13       Impact factor: 94.444

Review 6.  Learning Strategies in Protein Directed Evolution.

Authors:  Xavier F Cadet; Jean Christophe Gelly; Aster van Noord; Frédéric Cadet; Carlos G Acevedo-Rocha
Journal:  Methods Mol Biol       Date:  2022

7.  SidechainNet: An all-atom protein structure dataset for machine learning.

Authors:  Jonathan Edward King; David Ryan Koes
Journal:  Proteins       Date:  2021-07-12

8.  Deep Learning for Protein-Protein Interaction Site Prediction.

Authors:  Arian R Jamasb; Ben Day; Cătălina Cangea; Pietro Liò; Tom L Blundell
Journal:  Methods Mol Biol       Date:  2021

Review 9.  Synthetic biology in the clinic: engineering vaccines, diagnostics, and therapeutics.

Authors:  Xiao Tan; Justin H Letendre; James J Collins; Wilson W Wong
Journal:  Cell       Date:  2021-02-10       Impact factor: 41.582

10.  Identification of Sub-Golgi protein localization by use of deep representation learning features.

Authors:  Zhibin Lv; Pingping Wang; Quan Zou; Qinghua Jiang
Journal:  Bioinformatics       Date:  2020-12-26       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.