Literature DB >> 27007080

Semisupervised Gaussian Process for Automated Enzyme Search.

Joseph Mellor1,2, Ioana Grigoras3, Pablo Carbonell4, Jean-Loup Faulon1,3,4,5.   

Abstract

Synthetic biology is today harnessing the design of novel and greener biosynthesis routes for the production of added-value chemicals and natural products. The design of novel pathways often requires a detailed selection of enzyme sequences to import into the chassis at each of the reaction steps. To address such design requirements in an automated way, we present here a tool for exploring the space of enzymatic reactions. Given a reaction and an enzyme the tool provides a probability estimate that the enzyme catalyzes the reaction. Our tool first considers the similarity of a reaction to known biochemical reactions with respect to signatures around their reaction centers. Signatures are defined based on chemical transformation rules by using extended connectivity fingerprint descriptors. A semisupervised Gaussian process model associated with the similar known reactions then provides the probability estimate. The Gaussian process model uses information about both the reaction and the enzyme in providing the estimate. These estimates were validated experimentally by the application of the Gaussian process model to a newly identified metabolite in Escherichia coli in order to search for the enzymes catalyzing its associated reactions. Furthermore, we show with several pathway design examples how such ability to assign probability estimates to enzymatic reactions provides the potential to assist in bioengineering applications, providing experimental validation to our proposed approach. To the best of our knowledge, the proposed approach is the first application of Gaussian processes dealing with biological sequences and chemicals, the use of a semisupervised Gaussian process framework is also novel in the context of machine learning applied to bioinformatics. However, the ability of an enzyme to catalyze a reaction depends on the affinity between the substrates of the reaction and the enzyme. This affinity is generally quantified by the Michaelis constant KM. Therefore, we also demonstrate using Gaussian process regression to predict KM given a substrate-enzyme pair.

Entities:  

Keywords:  Gaussian process regression; enzyme kinetics; enzyme screening; metabolic engineering; reaction fingerprint; semisupervised Gaussian process

Mesh:

Substances:

Year:  2016        PMID: 27007080     DOI: 10.1021/acssynbio.5b00294

Source DB:  PubMed          Journal:  ACS Synth Biol        ISSN: 2161-5063            Impact factor:   5.110


  11 in total

Review 1.  Computational Approaches to Design and Test Plant Synthetic Metabolic Pathways.

Authors:  Anika Küken; Zoran Nikoloski
Journal:  Plant Physiol       Date:  2019-01-15       Impact factor: 8.340

2.  SpeedyGenesXL: an Automated, High-Throughput Platform for the Preparation of Bespoke Ultralarge Variant Libraries for Directed Evolution.

Authors:  Joanna C Sadler; Neil Swainston; Mark S Dunstan; Andrew Currin; Douglas B Kell
Journal:  Methods Mol Biol       Date:  2022

3.  Synthetic Biology Meets Machine Learning.

Authors:  Brendan Fu-Long Sieow; Ryan De Sotto; Zhi Ren Darren Seet; In Young Hwang; Matthew Wook Chang
Journal:  Methods Mol Biol       Date:  2023

4.  Boost-RS: boosted embeddings for recommender systems and its application to enzyme-substrate interaction prediction.

Authors:  Xinmeng Li; Li-Ping Liu; Soha Hassoun
Journal:  Bioinformatics       Date:  2022-05-13       Impact factor: 6.931

5.  Molecular structures enumeration and virtual screening in the chemical space with RetroPath2.0.

Authors:  Mathilde Koch; Thomas Duigou; Pablo Carbonell; Jean-Loup Faulon
Journal:  J Cheminform       Date:  2017-12-19       Impact factor: 5.514

6.  SYNBIOCHEM Synthetic Biology Research Centre, Manchester - A UK foundry for fine and speciality chemicals production.

Authors:  Le Feuvre Ra; Carbonell P; Currin A; Dunstan M; Fellows D; Jervis Aj; Rattray Njw; Robinson Cj; Swainston N; Vinaixa M; Williams A; Yan C; Barran P; Breitling R; Chen Gg; Faulon Jl; Goble C; Goodacre R; Kell Db; Micklefield J; Scrutton Ns; Shapira P; Takano E; Turner Nj
Journal:  Synth Syst Biotechnol       Date:  2016-08-25

7.  DEEPre: sequence-based enzyme EC number prediction by deep learning.

Authors:  Yu Li; Sheng Wang; Ramzan Umarov; Bingqing Xie; Ming Fan; Lihua Li; Xin Gao
Journal:  Bioinformatics       Date:  2018-03-01       Impact factor: 6.937

8.  Machine learning applied to enzyme turnover numbers reveals protein structural correlates and improves metabolic models.

Authors:  David Heckmann; Colton J Lloyd; Nathan Mih; Yuanchi Ha; Daniel C Zielinski; Zachary B Haiman; Abdelmoneim Amer Desouki; Martin J Lercher; Bernhard O Palsson
Journal:  Nat Commun       Date:  2018-12-07       Impact factor: 14.919

9.  Pathway design using de novo steps through uncharted biochemical spaces.

Authors:  Akhil Kumar; Lin Wang; Chiam Yu Ng; Costas D Maranas
Journal:  Nat Commun       Date:  2018-01-12       Impact factor: 14.919

10.  Prediction of train wheel diameter based on Gaussian process regression optimized using a fast simulated annealing algorithm.

Authors:  Xiaoying Yu; Hongsheng Su; Zeyuan Fan; Yu Dong
Journal:  PLoS One       Date:  2019-12-30       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.