Literature DB >> 35477726

LM-GVP: an extensible sequence and structure informed deep learning framework for protein property prediction.

Zichen Wang1, Steven A Combs2, Ryan Brand1, Miguel Romero Calvo1, Panpan Xu1, George Price1, Nataliya Golovach2, Emmanuel O Salawu1, Colby J Wise1, Sri Priya Ponnapalli3, Peter M Clark4.   

Abstract

Proteins perform many essential functions in biological systems and can be successfully developed as bio-therapeutics. It is invaluable to be able to predict their properties based on a proposed sequence and structure. In this study, we developed a novel generalizable deep learning framework, LM-GVP, composed of a protein Language Model (LM) and Graph Neural Network (GNN) to leverage information from both 1D amino acid sequences and 3D structures of proteins. Our approach outperformed the state-of-the-art protein LMs on a variety of property prediction tasks including fluorescence, protease stability, and protein functions from Gene Ontology (GO). We also illustrated insights into how a GNN prediction head can inform the fine-tuning of protein LMs to better leverage structural information. We envision that our deep learning framework will be generalizable to many protein property prediction problems to greatly accelerate protein engineering and drug development.
© 2022. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35477726      PMCID: PMC9046255          DOI: 10.1038/s41598-022-10775-y

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.996


  25 in total

Review 1.  Protein dynamics and enzyme catalysis: insights from simulations.

Authors:  John D McGeagh; Kara E Ranaghan; Adrian J Mulholland
Journal:  Biochim Biophys Acta       Date:  2010-12-15

Review 2.  The dilemma of conformational dynamics in enzyme catalysis: perspectives from theory and experiment.

Authors:  Urmi Doshi; Donald Hamelberg
Journal:  Adv Exp Med Biol       Date:  2014       Impact factor: 2.622

3.  Improved protein structure prediction using predicted interresidue orientations.

Authors:  Jianyi Yang; Ivan Anishchenko; Hahnbeom Park; Zhenling Peng; Sergey Ovchinnikov; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2020-01-02       Impact factor: 11.205

4.  Improved protein structure prediction using potentials from deep learning.

Authors:  Andrew W Senior; Richard Evans; John Jumper; James Kirkpatrick; Laurent Sifre; Tim Green; Chongli Qin; Augustin Žídek; Alexander W R Nelson; Alex Bridgland; Hugo Penedones; Stig Petersen; Karen Simonyan; Steve Crossan; Pushmeet Kohli; David T Jones; David Silver; Koray Kavukcuoglu; Demis Hassabis
Journal:  Nature       Date:  2020-01-15       Impact factor: 49.962

5.  Biopython: freely available Python tools for computational molecular biology and bioinformatics.

Authors:  Peter J A Cock; Tiago Antao; Jeffrey T Chang; Brad A Chapman; Cymon J Cox; Andrew Dalke; Iddo Friedberg; Thomas Hamelryck; Frank Kauff; Bartek Wilczynski; Michiel J L de Hoon
Journal:  Bioinformatics       Date:  2009-03-20       Impact factor: 6.937

Review 6.  Allosteric modulators of GPCRs: a novel approach for the treatment of CNS disorders.

Authors:  P Jeffrey Conn; Arthur Christopoulos; Craig W Lindsley
Journal:  Nat Rev Drug Discov       Date:  2009-01       Impact factor: 84.694

7.  Modeling aspects of the language of life through transfer-learning protein sequences.

Authors:  Michael Heinzinger; Ahmed Elnaggar; Yu Wang; Christian Dallago; Dmitrii Nechaev; Florian Matthes; Burkhard Rost
Journal:  BMC Bioinformatics       Date:  2019-12-17       Impact factor: 3.169

8.  Accurate prediction of protein structures and interactions using a three-track neural network.

Authors:  Minkyung Baek; Frank DiMaio; Ivan Anishchenko; Justas Dauparas; Sergey Ovchinnikov; Gyu Rie Lee; Jue Wang; Qian Cong; Lisa N Kinch; R Dustin Schaeffer; Claudia Millán; Hahnbeom Park; Carson Adams; Caleb R Glassman; Andy DeGiovanni; Jose H Pereira; Andria V Rodrigues; Alberdina A van Dijk; Ana C Ebrecht; Diederik J Opperman; Theo Sagmeister; Christoph Buhlheller; Tea Pavkov-Keller; Manoj K Rathinaswamy; Udit Dalwadi; Calvin K Yip; John E Burke; K Christopher Garcia; Nick V Grishin; Paul D Adams; Randy J Read; David Baker
Journal:  Science       Date:  2021-07-15       Impact factor: 47.728

9.  The Gene Ontology resource: enriching a GOld mine.

Authors: 
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

10.  BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions.

Authors:  Jianyi Yang; Ambrish Roy; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2012-10-18       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.