Literature DB >> 35112110

High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function.

Mu Gao1, Peik Lund-Andersen2, Alex Morehead3, Sajid Mahmud3, Chen Chen3, Xiao Chen3, Nabin Giri3, Raj S Roy3, Farhan Quadir3, T Chad Effler4, Ryan Prout4, Subil Abraham4, Wael Elwasif4, N Quentin Haas4, Jeffrey Skolnick1, Jianlin Cheng3, Ada Sedova4.   

Abstract

Computational biology is one of many scientific disciplines ripe for innovation and acceleration with the advent of high-performance computing (HPC). In recent years, the field of machine learning has also seen significant benefits from adopting HPC practices. In this work, we present a novel HPC pipeline that incorporates various machine-learning approaches for structure-based functional annotation of proteins on the scale of whole genomes. Our pipeline makes extensive use of deep learning and provides computational insights into best practices for training advanced deep-learning models for high-throughput data such as proteomics data. We showcase methodologies our pipeline currently supports and detail future tasks for our pipeline to envelop, including large-scale sequence comparison using SAdLSA and prediction of protein tertiary structures using AlphaFold2.

Entities:  

Keywords:  computational biology; deep learning; high-performance computing; machine learning; protein sequence alignment; protein structure prediction

Year:  2021        PMID: 35112110      PMCID: PMC8802329          DOI: 10.1109/mlhpc54614.2021.00010

Source DB:  PubMed          Journal:  Workshop Mach Learn HPC Environ        ISSN: 2768-4237


  50 in total

1.  Benchmarking PSI-BLAST in genome annotation.

Authors:  A Müller; R M MacCallum; M J Sternberg
Journal:  J Mol Biol       Date:  1999-11-12       Impact factor: 5.469

2.  Scoring function for automated assessment of protein structure template quality.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proteins       Date:  2004-12-01

3.  Protein homology detection by HMM-HMM comparison.

Authors:  Johannes Söding
Journal:  Bioinformatics       Date:  2004-11-05       Impact factor: 6.937

4.  Cryo-electron microscopy wins chemistry Nobel.

Authors:  Daniel Cressey; Ewen Callaway
Journal:  Nature       Date:  2017-10-04       Impact factor: 49.962

5.  DeepDom: Predicting protein domain boundary from sequence alone using stacked bidirectional LSTM.

Authors:  Yuexu Jiang; Duolin Wang; Dong Xu
Journal:  Pac Symp Biocomput       Date:  2019

6.  Assessment of protein model structure accuracy estimation in CASP14: Old and new challenges.

Authors:  Sohee Kwon; Jonghun Won; Andriy Kryshtafovych; Chaok Seok
Journal:  Proteins       Date:  2021-08-05

7.  Big Data: Astronomical or Genomical?

Authors:  Zachary D Stephens; Skylar Y Lee; Faraz Faghri; Roy H Campbell; Chengxiang Zhai; Miles J Efron; Ravishankar Iyer; Michael C Schatz; Saurabh Sinha; Gene E Robinson
Journal:  PLoS Biol       Date:  2015-07-07       Impact factor: 8.029

8.  CATH: comprehensive structural and functional annotations for genome sequences.

Authors:  Ian Sillitoe; Tony E Lewis; Alison Cuff; Sayoni Das; Paul Ashford; Natalie L Dawson; Nicholas Furnham; Roman A Laskowski; David Lee; Jonathan G Lees; Sonja Lehtinen; Romain A Studer; Janet Thornton; Christine A Orengo
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 19.160

9.  DeepDist: real-value inter-residue distance prediction with deep residual convolutional network.

Authors:  Tianqi Wu; Zhiye Guo; Jie Hou; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2021-01-25       Impact factor: 3.169

10.  HH-suite3 for fast remote homology detection and deep protein annotation.

Authors:  Martin Steinegger; Markus Meier; Milot Mirdita; Harald Vöhringer; Stephan J Haunsberger; Johannes Söding
Journal:  BMC Bioinformatics       Date:  2019-09-14       Impact factor: 3.169

View more
  2 in total

1.  AF2Complex predicts direct physical interactions in multimeric proteins with deep learning.

Authors:  Mu Gao; Davi Nakajima An; Jerry M Parks; Jeffrey Skolnick
Journal:  Nat Commun       Date:  2022-04-01       Impact factor: 14.919

2.  Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps.

Authors:  Sajid Mahmud; Zhiye Guo; Farhan Quadir; Jian Liu; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2022-07-19       Impact factor: 3.307

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.