Literature DB >> 35482186

Machine Learning-driven Protein Library Design: A Path Toward Smarter Libraries.

Mehrsa Mardikoraem1,2, Daniel Woldring3,4.   

Abstract

Proteins are small yet valuable biomolecules that play a versatile role in therapeutics and diagnostics. The intricate sequence-structure-function paradigm in the realm of proteins opens the possibility for directly mapping amino acid sequence to function. However, the rugged nature of the protein fitness landscape and an astronomical number of possible mutations even for small proteins make navigating this system a daunting task. Moreover, the scarcity of functional proteins and the ease with which deleterious mutations are introduced, due to complex epistatic relationships, compound the existing challenges. This highlights the need for auxiliary tools in current techniques such as rational design and directed evolution. To that end, the state-of-the-art machine learning can offer time and cost efficiency in finding high fitness proteins, circumventing unnecessary wet-lab experiments. In the context of improving library design, machine learning provides valuable insights via its unique features such as high adaptation to complex systems, multi-tasking, and parallelism, and the ability to capture hidden trends in input data. Finally, both the advancements in computational resources and the rapidly increasing number of sequences in protein databases will allow more promising and detailed insights delivered from machine learning to protein library design. In this chapter, fundamental concepts and a method for machine learning-driven library design leveraging deep sequencing datasets will be discussed. We elaborate on (1) basic knowledge about machine learning algorithms, (2) the benefit of machine learning in library design, and (3) methodology for implementing machine learning in library design.
© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  Deep learning; Directed evolution; Library design; Machine learning

Mesh:

Substances:

Year:  2022        PMID: 35482186     DOI: 10.1007/978-1-0716-2285-8_5

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  38 in total

Review 1.  Cell signaling by receptor tyrosine kinases.

Authors:  J Schlessinger
Journal:  Cell       Date:  2000-10-13       Impact factor: 41.582

2.  Virtual screening using molecular simulations.

Authors:  Tianyi Yang; Johnny C Wu; Chunli Yan; Yuanfeng Wang; Ray Luo; Michael B Gonzales; Kevin N Dalby; Pengyu Ren
Journal:  Proteins       Date:  2011-04-12

3.  A novel approach to prediction of the 3-dimensional structures of protein backbones by neural networks.

Authors:  H Bohr; J Bohr; S Brunak; R M Cotterill; H Fredholm; B Lautrup; S B Petersen
Journal:  FEBS Lett       Date:  1990-02-12       Impact factor: 4.124

Review 4.  Bone morphogenetic proteins: multifunctional regulators of vertebrate development.

Authors:  B L Hogan
Journal:  Genes Dev       Date:  1996-07-01       Impact factor: 11.361

Review 5.  Protein engineering and the development of generic biosensors.

Authors:  H W Hellinga; J S Marvin
Journal:  Trends Biotechnol       Date:  1998-04       Impact factor: 19.536

Review 6.  Machine-learning-guided directed evolution for protein engineering.

Authors:  Kevin K Yang; Zachary Wu; Frances H Arnold
Journal:  Nat Methods       Date:  2019-07-15       Impact factor: 28.547

Review 7.  Deep sequencing methods for protein engineering and design.

Authors:  Emily E Wrenbeck; Matthew S Faber; Timothy A Whitehead
Journal:  Curr Opin Struct Biol       Date:  2016-11-22       Impact factor: 6.809

8.  A novel affinity protein selection system based on staphylococcal cell surface display and flow cytometry.

Authors:  Nina Kronqvist; John Löfblom; Andreas Jonsson; Henrik Wernérus; Stefan Ståhl
Journal:  Protein Eng Des Sel       Date:  2008-01-31       Impact factor: 1.650

Review 9.  Regulation, Signaling, and Physiological Functions of G-Proteins.

Authors:  Viktoriya Syrovatkina; Kamela O Alegre; Raja Dey; Xin-Yun Huang
Journal:  J Mol Biol       Date:  2016-08-08       Impact factor: 5.469

10.  Prediction of membrane transport proteins and their substrate specificities using primary sequence information.

Authors:  Nitish K Mishra; Junil Chang; Patrick X Zhao
Journal:  PLoS One       Date:  2014-06-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.