Literature DB >> 31221760

Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers.

Jae Yong Ryu1,2,3,4, Hyun Uk Kim5,4,6,7, Sang Yup Lee8,2,3,4,7.   

Abstract

High-quality and high-throughput prediction of enzyme commission (EC) numbers is essential for accurate understanding of enzyme functions, which have many implications in pathologies and industrial biotechnology. Several EC number prediction tools are currently available, but their prediction performance needs to be further improved to precisely and efficiently process an ever-increasing volume of protein sequence data. Here, we report DeepEC, a deep learning-based computational framework that predicts EC numbers for protein sequences with high precision and in a high-throughput manner. DeepEC takes a protein sequence as input and predicts EC numbers as output. DeepEC uses 3 convolutional neural networks (CNNs) as a major engine for the prediction of EC numbers, and also implements homology analysis for EC numbers that cannot be classified by the CNNs. Comparative analyses against 5 representative EC number prediction tools show that DeepEC allows the most precise prediction of EC numbers, and is the fastest and the lightest in terms of the disk space required. Furthermore, DeepEC is the most sensitive in detecting the effects of mutated domains/binding site residues of protein sequences. DeepEC can be used as an independent tool, and also as a third-party software component in combination with other computational platforms that examine metabolic reactions.

Keywords:  DeepEC; EC number prediction; deep learning; enzyme commission number; metabolism

Year:  2019        PMID: 31221760      PMCID: PMC6628820          DOI: 10.1073/pnas.1821905116

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  31 in total

Review 1.  Combinatorial alanine-scanning.

Authors:  K L Morrison; G A Weiss
Journal:  Curr Opin Chem Biol       Date:  2001-06       Impact factor: 8.822

2.  Enzyme-specific profiles for genome annotation: PRIAM.

Authors:  Clotilde Claudel-Renard; Claude Chevalet; Thomas Faraut; Daniel Kahn
Journal:  Nucleic Acids Res       Date:  2003-11-15       Impact factor: 16.971

3.  BRENDA, the enzyme database: updates and major new developments.

Authors:  Ida Schomburg; Antje Chang; Christian Ebeling; Marion Gremse; Christian Heldt; Gregor Huhn; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  ExPASy: The proteomics server for in-depth protein knowledge and analysis.

Authors:  Elisabeth Gasteiger; Alexandre Gattiker; Christine Hoogland; Ivan Ivanyi; Ron D Appel; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 5.  Metabolic networks: enzyme function and metabolite structure.

Authors:  Vassily Hatzimanikatis; Chunhui Li; Justin A Ionita; Linda J Broadbelt
Journal:  Curr Opin Struct Biol       Date:  2004-06       Impact factor: 6.809

6.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 7.  Automated protein function prediction--the genomic challenge.

Authors:  Iddo Friedberg
Journal:  Brief Bioinform       Date:  2006-05-23       Impact factor: 11.622

8.  EzyPred: a top-down approach for predicting enzyme functional classes and subclasses.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Biochem Biophys Res Commun       Date:  2007-10-02       Impact factor: 3.575

9.  Genome-wide enzyme annotation with precision control: catalytic families (CatFam) databases.

Authors:  Chenggang Yu; Nela Zavaljevski; Valmik Desai; Jaques Reifman
Journal:  Proteins       Date:  2009-02-01

10.  WoLF PSORT: protein localization predictor.

Authors:  Paul Horton; Keun-Joon Park; Takeshi Obayashi; Naoya Fujita; Hajime Harada; C J Adams-Collier; Kenta Nakai
Journal:  Nucleic Acids Res       Date:  2007-05-21       Impact factor: 16.971

View more
  23 in total

1.  DeepTFactor: A deep learning-based tool for the prediction of transcription factors.

Authors:  Gi Bae Kim; Ye Gao; Bernhard O Palsson; Sang Yup Lee
Journal:  Proc Natl Acad Sci U S A       Date:  2021-01-12       Impact factor: 11.205

Review 2.  A roadmap for multi-omics data integration using deep learning.

Authors:  Mingon Kang; Euiseong Ko; Tesfaye B Mersha
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

3.  Synthetic Biology Meets Machine Learning.

Authors:  Brendan Fu-Long Sieow; Ryan De Sotto; Zhi Ren Darren Seet; In Young Hwang; Matthew Wook Chang
Journal:  Methods Mol Biol       Date:  2023

4.  Reconstruction of a Genome-Scale Metabolic Network for Shewanella oneidensis MR-1 and Analysis of its Metabolic Potential for Bioelectrochemical Systems.

Authors:  Jiahao Luo; Qianqian Yuan; Yufeng Mao; Fan Wei; Juntao Zhao; Wentong Yu; Shutian Kong; Yanmei Guo; Jingyi Cai; Xiaoping Liao; Zhiwen Wang; Hongwu Ma
Journal:  Front Bioeng Biotechnol       Date:  2022-05-12

5.  Comparative genomic analysis of Streptomyces rapamycinicus NRRL 5491 and its mutant overproducing rapamycin.

Authors:  Hee-Geun Jo; Joshua Julio Adidjaja; Do-Kyung Kim; Bu-Soo Park; Namil Lee; Byung-Kwan Cho; Hyun Uk Kim; Min-Kyu Oh
Journal:  Sci Rep       Date:  2022-06-18       Impact factor: 4.996

6.  Utilizing graph machine learning within drug discovery and development.

Authors:  Thomas Gaudelet; Ben Day; Arian R Jamasb; Jyothish Soman; Cristian Regep; Gertrude Liu; Jeremy B R Hayter; Richard Vickers; Charles Roberts; Jian Tang; David Roblin; Tom L Blundell; Michael M Bronstein; Jake P Taylor-King
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

Review 7.  Machine learning for enzyme engineering, selection and design.

Authors:  Ryan Feehan; Daniel Montezano; Joanna S G Slusky
Journal:  Protein Eng Des Sel       Date:  2021-02-15       Impact factor: 1.952

8.  Shotgun proteomics of peach fruit reveals major metabolic pathways associated to ripening.

Authors:  Ricardo Nilo-Poyanco; Carol Moraga; Gianfranco Benedetto; Ariel Orellana; Andrea Miyasaka Almeida
Journal:  BMC Genomics       Date:  2021-01-06       Impact factor: 3.969

9.  Mantis: flexible and consensus-driven genome annotation.

Authors:  Pedro Queirós; Francesco Delogu; Oskar Hickl; Patrick May; Paul Wilmes
Journal:  Gigascience       Date:  2021-06-02       Impact factor: 6.524

10.  Machine learning differentiates enzymatic and non-enzymatic metals in proteins.

Authors:  Ryan Feehan; Meghan W Franklin; Joanna S G Slusky
Journal:  Nat Commun       Date:  2021-06-17       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.