Literature DB >> 31308553

Machine-learning-guided directed evolution for protein engineering.

Kevin K Yang1, Zachary Wu1, Frances H Arnold2.   

Abstract

Protein engineering through machine-learning-guided directed evolution enables the optimization of protein functions. Machine-learning approaches predict how sequence maps to function in a data-driven manner without requiring a detailed model of the underlying physics or biological pathways. Such methods accelerate directed evolution by learning from the properties of characterized variants and using that information to select sequences that are likely to exhibit improved properties. Here we introduce the steps required to build machine-learning sequence-function models and to use those models to guide engineering, making recommendations at each stage. This review covers basic concepts relevant to the use of machine learning for protein engineering, as well as the current literature and applications of this engineering paradigm. We illustrate the process with two case studies. Finally, we look to future opportunities for machine learning to enable the discovery of unknown protein functions and uncover the relationship between protein sequence and function.

Mesh:

Substances:

Year:  2019        PMID: 31308553     DOI: 10.1038/s41592-019-0496-6

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


  100 in total

1.  An enzyme-based biosensor for monitoring and engineering protein stability in vivo.

Authors:  Chang Ren; Xin Wen; Jun Mencius; Shu Quan
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-30       Impact factor: 11.205

Review 2.  Biotechnology and bioengineering of pullulanase: state of the art and perspectives.

Authors:  Pei Xu; Shi-Yu Zhang; Zhi-Gang Luo; Min-Hua Zong; Xiao-Xi Li; Wen-Yong Lou
Journal:  World J Microbiol Biotechnol       Date:  2021-02-06       Impact factor: 3.312

Review 3.  Biosynthetic Approaches towards the Design of Artificial Hydrogen-Evolution Catalysts.

Authors:  Pallavi Prasad; Dhanashree Selvan; Saumen Chakraborty
Journal:  Chemistry       Date:  2020-08-26       Impact factor: 5.236

4.  Assessing the Risks Posed by the Convergence of Artificial Intelligence and Biotechnology.

Authors:  John T O'Brien; Cassidy Nelson
Journal:  Health Secur       Date:  2020 May/Jun

Review 5.  A guide to machine learning for biologists.

Authors:  Joe G Greener; Shaun M Kandathil; Lewis Moffat; David T Jones
Journal:  Nat Rev Mol Cell Biol       Date:  2021-09-13       Impact factor: 94.444

6.  Minimum epistasis interpolation for sequence-function relationships.

Authors:  Juannan Zhou; David M McCandlish
Journal:  Nat Commun       Date:  2020-04-14       Impact factor: 14.919

Review 7.  Ru(II)-diimine complexes and cytochrome P450 working hand-in-hand.

Authors:  Celine Eidenschenk; Lionel Cheruzel
Journal:  J Inorg Biochem       Date:  2020-09-12       Impact factor: 4.155

8.  Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning.

Authors:  Hyebin Song; Bennett J Bremer; Emily C Hinds; Garvesh Raskutti; Philip A Romero
Journal:  Cell Syst       Date:  2020-11-18       Impact factor: 10.304

9.  Integration of Proteomics and Other Omics Data.

Authors:  Mengyun Wu; Yu Jiang; Shuangge Ma
Journal:  Methods Mol Biol       Date:  2021

10.  Utilizing graph machine learning within drug discovery and development.

Authors:  Thomas Gaudelet; Ben Day; Arian R Jamasb; Jyothish Soman; Cristian Regep; Gertrude Liu; Jeremy B R Hayter; Richard Vickers; Charles Roberts; Jian Tang; David Roblin; Tom L Blundell; Michael M Bronstein; Jake P Taylor-King
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.