Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Bayesian optimization with evolutionary and structure-based regularization for directed protein evolution.

Literature DB >> 34210336

Bayesian optimization with evolutionary and structure-based regularization for directed protein evolution.

Trevor S Frisby¹, Christopher James Langmead².

Abstract

BACKGROUND: Directed evolution (DE) is a technique for protein engineering that involves iterative rounds of mutagenesis and screening to search for sequences that optimize a given property, such as binding affinity to a specified target. Unfortunately, the underlying optimization problem is under-determined, and so mutations introduced to improve the specified property may come at the expense of unmeasured, but nevertheless important properties (ex. solubility, thermostability, etc). We address this issue by formulating DE as a regularized Bayesian optimization problem where the regularization term reflects evolutionary or structure-based constraints.
RESULTS: We applied our approach to DE to three representative proteins, GB1, BRCA1, and SARS-CoV-2 Spike, and evaluated both evolutionary and structure-based regularization terms. The results of these experiments demonstrate that: (i) structure-based regularization usually leads to better designs (and never hurts), compared to the unregularized setting; (ii) evolutionary-based regularization tends to be least effective; and (iii) regularization leads to better designs because it effectively focuses the search in certain areas of sequence space, making better use of the experimental budget. Additionally, like previous work in Machine learning assisted DE, we find that our approach significantly reduces the experimental burden of DE, relative to model-free methods.
CONCLUSION: Introducing regularization into a Bayesian ML-assisted DE framework alters the exploratory patterns of the underlying optimization routine, and can shift variant selections towards those with a range of targeted and desirable properties. In particular, we find that structure-based regularization often improves variant selection compared to unregularized approaches, and never hurts.

Entities: Chemical Disease Gene Mutation Species

Keywords: Active learning; Bayesian optimization; Directed evolution; Gaussian process regression; Protein design; Protein language model; Rational design; Regularization

Year: 2021 PMID： 34210336 DOI： 10.1186/s13015-021-00195-4

Source DB: PubMed Journal: Algorithms Mol Biol ISSN： 1748-7188 Impact factor: 1.405

19 in total

Review 1. Epistasis in protein evolution.

Authors: Tyler N Starr; Joseph W Thornton
Journal: Protein Sci Date: 2016-02-28 Impact factor: 6.725

2. Evolution of stability in a cold-active enzyme elicits specificity relaxation and highlights substrate-related effects on temperature adaptation.

Authors: Pietro Gatti-Lafranconi; Antonino Natalello; Sascha Rehm; Silvia Maria Doglia; Jürgen Pleiss; Marina Lotti
Journal: J Mol Biol Date: 2009-10-20 Impact factor: 5.469

Review 3. Machine-learning-guided directed evolution for protein engineering.

Authors: Kevin K Yang; Zachary Wu; Frances H Arnold
Journal: Nat Methods Date: 2019-07-15 Impact factor: 28.547

Review 4. The de novo design of protein structures.

Authors: J S Richardson; D C Richardson
Journal: Trends Biochem Sci Date: 1989-07 Impact factor: 13.807

5. Hidden Markov models in computational biology. Applications to protein modeling.

Authors: A Krogh; M Brown; I S Mian; K Sjölander; D Haussler
Journal: J Mol Biol Date: 1994-02-04 Impact factor: 5.469

Review 6. Exploring protein fitness landscapes by directed evolution.

Authors: Philip A Romero; Frances H Arnold
Journal: Nat Rev Mol Cell Biol Date: 2009-12 Impact factor: 94.444

Review 7. Teaching old enzymes new tricks: engineering and evolution of glycosidases and glycosyl transferases for improved glycoside synthesis.

Authors: Fathima Aidha Shaikh; Stephen G Withers
Journal: Biochem Cell Biol Date: 2008-04 Impact factor: 3.626

Bayesian optimization with evolutionary and structure-based regularization for directed protein evolution.

Review 1. Epistasis in protein evolution.

2. Evolution of stability in a cold-active enzyme elicits specificity relaxation and highlights substrate-related effects on temperature adaptation.

Review 3. Machine-learning-guided directed evolution for protein engineering.

Review 4. The de novo design of protein structures.

5. Hidden Markov models in computational biology. Applications to protein modeling.

Review 6. Exploring protein fitness landscapes by directed evolution.

Review 7. Teaching old enzymes new tricks: engineering and evolution of glycosidases and glycosyl transferases for improved glycoside synthesis.

8. Selection of phage antibodies by binding affinity. Mimicking affinity maturation.

9. Evolution of a designed retro-aldolase leads to complete active site remodeling.

10. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.

1. AMaLa: Analysis of Directed Evolution Experiments via Annealed Mutational Approximated Landscape.