Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function.

Literature DB >> 35112110

High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function.

Mu Gao¹, Peik Lund-Andersen², Alex Morehead³, Sajid Mahmud³, Chen Chen³, Xiao Chen³, Nabin Giri³, Raj S Roy³, Farhan Quadir³, T Chad Effler⁴, Ryan Prout⁴, Subil Abraham⁴, Wael Elwasif⁴, N Quentin Haas⁴, Jeffrey Skolnick¹, Jianlin Cheng³, Ada Sedova⁴.

Abstract

Computational biology is one of many scientific disciplines ripe for innovation and acceleration with the advent of high-performance computing (HPC). In recent years, the field of machine learning has also seen significant benefits from adopting HPC practices. In this work, we present a novel HPC pipeline that incorporates various machine-learning approaches for structure-based functional annotation of proteins on the scale of whole genomes. Our pipeline makes extensive use of deep learning and provides computational insights into best practices for training advanced deep-learning models for high-throughput data such as proteomics data. We showcase methodologies our pipeline currently supports and detail future tasks for our pipeline to envelop, including large-scale sequence comparison using SAdLSA and prediction of protein tertiary structures using AlphaFold2.

Entities: Chemical

Keywords: computational biology; deep learning; high-performance computing; machine learning; protein sequence alignment; protein structure prediction

Year: 2021 PMID： 35112110 PMCID： PMC8802329 DOI： 10.1109/mlhpc54614.2021.00010

Source DB: PubMed Journal: Workshop Mach Learn HPC Environ ISSN： 2768-4237

50 in total

1. Benchmarking PSI-BLAST in genome annotation.

Authors: A Müller; R M MacCallum; M J Sternberg
Journal: J Mol Biol Date: 1999-11-12 Impact factor: 5.469

2. Scoring function for automated assessment of protein structure template quality.

Authors: Yang Zhang; Jeffrey Skolnick
Journal: Proteins Date: 2004-12-01

3. Protein homology detection by HMM-HMM comparison.

Authors: Johannes Söding
Journal: Bioinformatics Date: 2004-11-05 Impact factor: 6.937

4. Cryo-electron microscopy wins chemistry Nobel.

Authors: Daniel Cressey; Ewen Callaway
Journal: Nature Date: 2017-10-04 Impact factor: 49.962

5. DeepDom: Predicting protein domain boundary from sequence alone using stacked bidirectional LSTM.

Authors: Yuexu Jiang; Duolin Wang; Dong Xu
Journal: Pac Symp Biocomput Date: 2019

6. Assessment of protein model structure accuracy estimation in CASP14: Old and new challenges.

Authors: Sohee Kwon; Jonghun Won; Andriy Kryshtafovych; Chaok Seok
Journal: Proteins Date: 2021-08-05

7. Big Data: Astronomical or Genomical?

Authors: Zachary D Stephens; Skylar Y Lee; Faraz Faghri; Roy H Campbell; Chengxiang Zhai; Miles J Efron; Ravishankar Iyer; Michael C Schatz; Saurabh Sinha; Gene E Robinson
Journal: PLoS Biol Date: 2015-07-07 Impact factor: 8.029

8. CATH: comprehensive structural and functional annotations for genome sequences.

Authors: Ian Sillitoe; Tony E Lewis; Alison Cuff; Sayoni Das; Paul Ashford; Natalie L Dawson; Nicholas Furnham; Roman A Laskowski; David Lee; Jonathan G Lees; Sonja Lehtinen; Romain A Studer; Janet Thornton; Christine A Orengo
Journal: Nucleic Acids Res Date: 2014-10-27 Impact factor: 19.160

2. Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps.

Authors: Sajid Mahmud; Zhiye Guo; Farhan Quadir; Jian Liu; Jianlin Cheng
Journal: BMC Bioinformatics Date: 2022-07-19 Impact factor: 3.307

2 in total