Literature DB >> 29234997

Learning epistatic interactions from sequence-activity data to predict enantioselectivity.

Julian Zaugg1, Yosephine Gumulya2, Alpeshkumar K Malde2, Mikael Bodén3.   

Abstract

Enzymes with a high selectivity are desirable for improving economics of chemical synthesis of enantiopure compounds. To improve enzyme selectivity mutations are often introduced near the catalytic active site. In this compact environment epistatic interactions between residues, where contributions to selectivity are non-additive, play a significant role in determining the degree of selectivity. Using support vector machine regression models we map mutations to the experimentally characterised enantioselectivities for a set of 136 variants of the epoxide hydrolase from the fungus Aspergillus niger (AnEH). We investigate whether the influence a mutation has on enzyme selectivity can be accurately predicted through linear models, and whether prediction accuracy can be improved using higher-order counterparts. Comparing linear and polynomial degree = 2 models, mean Pearson coefficients (r) from [Formula: see text]-fold cross-validation increase from 0.84 to 0.91 respectively. Equivalent models tested on interaction-minimised sequences achieve values of [Formula: see text] and [Formula: see text]. As expected, testing on a simulated control data set with no interactions results in no significant improvements from higher-order models. Additional experimentally derived AnEH mutants are tested with linear and polynomial degree = 2 models, with values increasing from [Formula: see text] to [Formula: see text] respectively. The study demonstrates that linear models perform well, however the representation of epistatic interactions in predictive models improves identification of selectivity-enhancing mutations. The improvement is attributed to higher-order kernel functions that represent epistatic interactions between residues.

Entities:  

Keywords:  Aspergillus niger; Bioinformatics; Epoxide hydrolase; Fitness; Machine learning; Non-additive; Support vector machine

Mesh:

Substances:

Year:  2017        PMID: 29234997     DOI: 10.1007/s10822-017-0090-x

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  56 in total

1.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

2.  How mutational epistasis impairs predictability in protein evolution and design.

Authors:  Charlotte M Miton; Nobuhiko Tokuriki
Journal:  Protein Sci       Date:  2016-01-22       Impact factor: 6.725

Review 3.  Epistasis in protein evolution.

Authors:  Tyler N Starr; Joseph W Thornton
Journal:  Protein Sci       Date:  2016-02-28       Impact factor: 6.725

4.  Characterization of the Enantioselective Properties of the Quinohemoprotein Alcohol Dehydrogenase of Acetobacter pasteurianus LMG 1635. 1. Different Enantiomeric Ratios of Whole Cells and Purified Enzyme in the Kinetic Resolution of Racemic Glycidol.

Authors:  S S Machado; U Wandel; J A Jongejan; A J Straathof; J A Duine
Journal:  Biosci Biotechnol Biochem       Date:  1999       Impact factor: 2.043

5.  A comprehensive analysis of the thermodynamic events involved in ligand-receptor binding using CoRIA and its variants.

Authors:  Jitender Verma; Vijay M Khedkar; Arati S Prabhu; Santosh A Khedkar; Alpeshkumar K Malde; Evans C Coutinho
Journal:  J Comput Aided Mol Des       Date:  2008-01-25       Impact factor: 3.686

6.  A diverse family of thermostable cytochrome P450s created by recombination of stabilizing fragments.

Authors:  Yougen Li; D Allan Drummond; Andrew M Sawayama; Christopher D Snow; Jesse D Bloom; Frances H Arnold
Journal:  Nat Biotechnol       Date:  2007-08-26       Impact factor: 54.908

7.  Navigating the protein fitness landscape with Gaussian processes.

Authors:  Philip A Romero; Andreas Krause; Frances H Arnold
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-31       Impact factor: 11.205

8.  Protein redesign by learning from data.

Authors:  Bastiaan A van den Berg; Marcel J T Reinders; Jan-Metske van der Laan; Johannes A Roubos; Dick de Ridder
Journal:  Protein Eng Des Sel       Date:  2014-07-30       Impact factor: 1.650

9.  New Concepts for Increasing the Efficiency in Directed Evolution of Stereoselective Enzymes.

Authors:  Zhoutong Sun; Ylva Wikmark; Jan-E Bäckvall; Manfred T Reetz
Journal:  Chemistry       Date:  2016-02-23       Impact factor: 5.236

10.  Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization.

Authors:  Claire N Bedbrook; Kevin K Yang; Austin J Rice; Viviana Gradinaru; Frances H Arnold
Journal:  PLoS Comput Biol       Date:  2017-10-23       Impact factor: 4.475

View more
  1 in total

1.  Learned protein embeddings for machine learning.

Authors:  Kevin K Yang; Zachary Wu; Claire N Bedbrook; Frances H Arnold
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.