Literature DB >> 33828272

Low-N protein engineering with data-efficient deep learning.

Surojit Biswas1,2, Grigory Khimulya3, Ethan C Alley4, Kevin M Esvelt4, George M Church5,6.   

Abstract

Protein engineering has enormous academic and industrial potential. However, it is limited by the lack of experimental assays that are consistent with the design goal and sufficiently high throughput to find rare, enhanced variants. Here we introduce a machine learning-guided paradigm that can use as few as 24 functionally assayed mutant sequences to build an accurate virtual fitness landscape and screen ten million sequences via in silico directed evolution. As demonstrated in two dissimilar proteins, GFP from Aequorea victoria (avGFP) and E. coli strain TEM-1 β-lactamase, top candidates from a single round are diverse and as active as engineered mutants obtained from previous high-throughput efforts. By distilling information from natural protein sequence landscapes, our model learns a latent representation of 'unnaturalness', which helps to guide search away from nonfunctional sequence neighborhoods. Subsequent low-N supervision then identifies improvements to the activity of interest. In sum, our approach enables efficient use of resource-intensive high-fidelity assays without sacrificing throughput, and helps to accelerate engineered proteins into the fermenter, field and clinic.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33828272     DOI: 10.1038/s41592-021-01100-y

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


  1 in total
  1 in total
  34 in total

Review 1.  Learning Strategies in Protein Directed Evolution.

Authors:  Xavier F Cadet; Jean Christophe Gelly; Aster van Noord; Frédéric Cadet; Carlos G Acevedo-Rocha
Journal:  Methods Mol Biol       Date:  2022

Review 2.  Progress and challenges for the machine learning-based design of fit-for-purpose monoclonal antibodies.

Authors:  Rahmad Akbar; Habib Bashour; Puneet Rawat; Philippe A Robert; Eva Smorodina; Tudor-Stefan Cotet; Karine Flem-Karlsen; Robert Frank; Brij Bhushan Mehta; Mai Ha Vu; Talip Zengin; Jose Gutierrez-Marcos; Fridtjof Lund-Johansen; Jan Terje Andersen; Victor Greiff
Journal:  MAbs       Date:  2022 Jan-Dec       Impact factor: 5.857

3.  Natural Evolution Provides Strong Hints about Laboratory Evolution of Designer Enzymes.

Authors:  Wen Jun Xie; Arieh Warshel
Journal:  Proc Natl Acad Sci U S A       Date:  2022-07-28       Impact factor: 12.779

4.  Recombineering and MAGE.

Authors:  Timothy M Wannier; Peter N Ciaccia; Andrew D Ellington; Gabriel T Filsinger; Farren J Isaacs; Kamyab Javanmardi; Michaela A Jones; Aditya M Kunjapur; Akos Nyerges; Csaba Pal; Max G Schubert; George M Church
Journal:  Nat Rev Methods Primers       Date:  2021-01-14

5.  Synthetic Biology Meets Machine Learning.

Authors:  Brendan Fu-Long Sieow; Ryan De Sotto; Zhi Ren Darren Seet; In Young Hwang; Matthew Wook Chang
Journal:  Methods Mol Biol       Date:  2023

6.  Heterogeneity of the GFP fitness landscape and data-driven protein design.

Authors:  Louisa Gonzalez Somermeyer; Aubin Fleiss; Alexander S Mishin; Nina G Bozhanova; Anna A Igolkina; Jens Meiler; Maria-Elisenda Alaball Pujol; Ekaterina V Putintseva; Karen S Sarkisyan; Fyodor A Kondrashov
Journal:  Elife       Date:  2022-05-05       Impact factor: 8.713

Review 7.  Deep Learning Concepts and Applications for Synthetic Biology.

Authors:  William A V Beardall; Guy-Bart Stan; Mary J Dunlop
Journal:  GEN Biotechnol       Date:  2022-08-18

8.  Relation Between the Number of Peaks and the Number of Reciprocal Sign Epistatic Interactions.

Authors:  Raimundo Saona; Fyodor A Kondrashov; Ksenia A Khudiakova
Journal:  Bull Math Biol       Date:  2022-06-17       Impact factor: 3.871

Review 9.  Differentiable biology: using deep learning for biophysics-based and data-driven modeling of molecular mechanisms.

Authors:  Mohammed AlQuraishi; Peter K Sorger
Journal:  Nat Methods       Date:  2021-10-04       Impact factor: 28.547

10.  Deep representation learning improves prediction of LacI-mediated transcriptional repression.

Authors:  Alexander S Garruss; Katherine M Collins; George M Church
Journal:  Proc Natl Acad Sci U S A       Date:  2021-07-06       Impact factor: 12.779

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.