Literature DB >> 31117361

Machine Learning Applied to Predicting Microorganism Growth Temperatures and Enzyme Catalytic Optima.

Gang Li1, Kersten S Rabe2, Jens Nielsen1,3, Martin K M Engqvist1.   

Abstract

Enzymes that catalyze chemical reactions at high temperatures are used for industrial biocatalysis, applications in molecular biology, and as highly evolvable starting points for protein engineering. The optimal growth temperature (OGT) of organisms is commonly used to estimate the stability of enzymes encoded in their genomes, but the number of experimentally determined OGT values are limited, particularly for thermophilic organisms. Here, we report on the development of a machine learning model that can accurately predict OGT for bacteria, archaea, and microbial eukaryotes directly from their proteome-wide 2-mer amino acid composition. The trained model is made freely available for reuse. In a subsequent step we use OGT data in combination with amino acid composition of individual enzymes to develop a second machine learning model-for prediction of enzyme catalytic temperature optima ( Topt). The resulting model generates enzyme Topt estimates that are far superior to using OGT alone. Finally, we predict Topt for 6.5 million enzymes, covering 4447 enzyme classes, and make the resulting data set available to researchers. This work enables simple and rapid identification of enzymes that are potentially functional at extreme temperatures.

Keywords:  enzyme temperature optima; machine learning; optimal growth temperature; sequence-based prediction; thermostable enzymes

Mesh:

Substances:

Year:  2019        PMID: 31117361     DOI: 10.1021/acssynbio.9b00099

Source DB:  PubMed          Journal:  ACS Synth Biol        ISSN: 2161-5063            Impact factor:   5.110


  17 in total

1.  Building a tRNA thermometer to estimate microbial adaptation to temperature.

Authors:  Emre Cimen; Sarah E Jensen; Edward S Buckler
Journal:  Nucleic Acids Res       Date:  2020-12-02       Impact factor: 16.971

2.  Revisiting the Relationships Between Genomic G + C Content, RNA Secondary Structures, and Optimal Growth Temperature.

Authors:  Michelle M Meyer
Journal:  J Mol Evol       Date:  2020-11-20       Impact factor: 2.395

3.  Sulfur cycling and host-virus interactions in Aquificales-dominated biofilms from Yellowstone's hottest ecosystems.

Authors:  Luke J McKay; Olivia D Nigro; Mensur Dlakić; Karen M Luttrell; Douglas B Rusch; Matthew W Fields; William P Inskeep
Journal:  ISME J       Date:  2021-10-14       Impact factor: 10.302

4.  Recovery of Lutacidiplasmatales archaeal order genomes suggests convergent evolution in Thermoplasmatota.

Authors:  Paul O Sheridan; Yiyu Meng; Tom A Williams; Cécile Gubry-Rangin
Journal:  Nat Commun       Date:  2022-07-15       Impact factor: 17.694

5.  Growth rate determines prokaryote-provirus network modulated by temperature and host genetic traits.

Authors:  Zhenghua Liu; Qingyun Yan; Chengying Jiang; Juan Li; Huahua Jian; Lu Fan; Rui Zhang; Xiang Xiao; Delong Meng; Xueduan Liu; Jianjun Wang; Huaqun Yin
Journal:  Microbiome       Date:  2022-06-14       Impact factor: 16.837

Review 6.  Life and death in the soil microbiome: how ecological processes influence biogeochemistry.

Authors:  Noah W Sokol; Eric Slessarev; Gianna L Marschmann; Alexa Nicolas; Steven J Blazewicz; Eoin L Brodie; Mary K Firestone; Megan M Foley; Rachel Hestrin; Bruce A Hungate; Benjamin J Koch; Bram W Stone; Matthew B Sullivan; Olivier Zablocki; Jennifer Pett-Ridge
Journal:  Nat Rev Microbiol       Date:  2022-02-28       Impact factor: 78.297

Review 7.  Machine learning for enzyme engineering, selection and design.

Authors:  Ryan Feehan; Daniel Montezano; Joanna S G Slusky
Journal:  Protein Eng Des Sel       Date:  2021-02-15       Impact factor: 1.952

8.  Bayesian genome scale modelling identifies thermal determinants of yeast metabolism.

Authors:  Gang Li; Yating Hu; Hao Luo; Hao Wang; Aleksej Zelezniak; Boyang Ji; Jens Nielsen
Journal:  Nat Commun       Date:  2021-01-08       Impact factor: 14.919

9.  Performance of Regression Models as a Function of Experiment Noise.

Authors:  Gang Li; Jan Zrimec; Boyang Ji; Jun Geng; Johan Larsbrink; Aleksej Zelezniak; Jens Nielsen; Martin Km Engqvist
Journal:  Bioinform Biol Insights       Date:  2021-06-27

10.  MCIC: Automated Identification of Cellulases From Metagenomic Data and Characterization Based on Temperature and pH Dependence.

Authors:  Mehdi Foroozandeh Shahraki; Shohreh Ariaeenejad; Fereshteh Fallah Atanaki; Behrouz Zolfaghari; Takeshi Koshiba; Kaveh Kavousi; Ghasem Hosseini Salekdeh
Journal:  Front Microbiol       Date:  2020-10-23       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.