Literature DB >> 29030274

Predicting novel substrates for enzymes with minimal experimental effort with active learning.

Dante A Pertusi1, Matthew E Moura1, James G Jeffryes2, Siddhant Prabhu1, Bradley Walters Biggs1, Keith E J Tyo3.   

Abstract

Enzymatic substrate promiscuity is more ubiquitous than previously thought, with significant consequences for understanding metabolism and its application to biocatalysis. This realization has given rise to the need for efficient characterization of enzyme promiscuity. Enzyme promiscuity is currently characterized with a limited number of human-selected compounds that may not be representative of the enzyme's versatility. While testing large numbers of compounds may be impractical, computational approaches can exploit existing data to determine the most informative substrates to test next, thereby more thoroughly exploring an enzyme's versatility. To demonstrate this, we used existing studies and tested compounds for four different enzymes, developed support vector machine (SVM) models using these datasets, and selected additional compounds for experiments using an active learning approach. SVMs trained on a chemically diverse set of compounds were discovered to achieve maximum accuracies of ~80% using ~33% fewer compounds than datasets based on all compounds tested in existing studies. Active learning-selected compounds for testing resolved apparent conflicts in the existing training data, while adding diversity to the dataset. The application of these algorithms to wide arrays of metabolic enzymes would result in a library of SVMs that can predict high-probability promiscuous enzymatic reactions and could prove a valuable resource for the design of novel metabolic pathways.
Copyright © 2017 International Metabolic Engineering Society. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Active learning; Enzyme promiscuity; Machine learning

Mesh:

Year:  2017        PMID: 29030274      PMCID: PMC7055960          DOI: 10.1016/j.ymben.2017.09.016

Source DB:  PubMed          Journal:  Metab Eng        ISSN: 1096-7176            Impact factor:   9.783


  39 in total

1.  Active learning with support vector machines in the drug discovery process.

Authors:  Manfred K Warmuth; Jun Liao; Gunnar Rätsch; Michael Mathieson; Santosh Putta; Christian Lemmen
Journal:  J Chem Inf Comput Sci       Date:  2003 Mar-Apr

Review 2.  Enzyme promiscuity: a mechanistic and evolutionary perspective.

Authors:  Olga Khersonsky; Dan S Tawfik
Journal:  Annu Rev Biochem       Date:  2010       Impact factor: 23.643

3.  Ligand-based models for the isoform specificity of cytochrome P450 3A4, 2D6, and 2C9 substrates.

Authors:  Lothar Terfloth; Bruno Bienfait; Johann Gasteiger
Journal:  J Chem Inf Model       Date:  2007-07-03       Impact factor: 4.956

4.  The subtle benefits of being promiscuous: adaptive evolution potentiated by enzyme promiscuity.

Authors:  Mark A Depristo
Journal:  HFSP J       Date:  2007-07-10

5.  Inhibitory cross-talk upon introduction of a new metabolic pathway into an existing metabolic network.

Authors:  Juhan Kim; Shelley D Copley
Journal:  Proc Natl Acad Sci U S A       Date:  2012-09-14       Impact factor: 11.205

Review 6.  Metabolite damage and its repair or pre-emption.

Authors:  Carole L Linster; Emile Van Schaftingen; Andrew D Hanson
Journal:  Nat Chem Biol       Date:  2013-02       Impact factor: 15.040

7.  Characterizing and predicting carboxylic acid reductase activity for diversifying bioaldehyde production.

Authors:  Matthew Moura; Dante Pertusi; Stephen Lenzini; Namita Bhan; Linda J Broadbelt; Keith E J Tyo
Journal:  Biotechnol Bioeng       Date:  2015-11-10       Impact factor: 4.530

8.  Generation of an atlas for commodity chemical production in Escherichia coli and a novel pathway prediction algorithm, GEM-Path.

Authors:  Miguel A Campodonico; Barbara A Andrews; Juan A Asenjo; Bernhard O Palsson; Adam M Feist
Journal:  Metab Eng       Date:  2014-07-28       Impact factor: 9.783

9.  Open Babel: An open chemical toolbox.

Authors:  Noel M O'Boyle; Michael Banck; Craig A James; Chris Morley; Tim Vandermeersch; Geoffrey R Hutchison
Journal:  J Cheminform       Date:  2011-10-07       Impact factor: 5.514

10.  Pybel: a Python wrapper for the OpenBabel cheminformatics toolkit.

Authors:  Noel M O'Boyle; Chris Morley; Geoffrey R Hutchison
Journal:  Chem Cent J       Date:  2008-03-09       Impact factor: 4.215

View more
  5 in total

1.  Biochemical control systems for small molecule damage in plants.

Authors:  M Hüdig; J Schmitz; M K M Engqvist; V G Maurino
Journal:  Plant Signal Behav       Date:  2018-06-26

Review 2.  Specifics of Metabolite-Protein Interactions and Their Computational Analysis and Prediction.

Authors:  Dirk Walther
Journal:  Methods Mol Biol       Date:  2023

3.  Data-driven discovery of cardiolipin-selective small molecules by computational active learning.

Authors:  Bernadette Mohr; Kirill Shmilovich; Isabel S Kleinwächter; Dirk Schneider; Andrew L Ferguson; Tristan Bereau
Journal:  Chem Sci       Date:  2022-03-02       Impact factor: 9.969

4.  A semi-supervised machine learning framework for microRNA classification.

Authors:  Mohsen Sheikh Hassani; James R Green
Journal:  Hum Genomics       Date:  2019-10-22       Impact factor: 4.639

Review 5.  Key Enzymes in Fatty Acid Synthesis Pathway for Bioactive Lipids Biosynthesis.

Authors:  Xiao-Yan Zhuang; Yong-Hui Zhang; An-Feng Xiao; Ai-Hui Zhang; Bai-Shan Fang
Journal:  Front Nutr       Date:  2022-02-23
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.