Literature DB >> 31800243

To Improve Protein Sequence Profile Prediction through Image Captioning on Pairwise Residue Distance Map.

Sheng Chen1, Zhe Sun1, Lihua Lin1, Zifeng Liu2, Xun Liu2, Yutian Chong2, Yutong Lu1, Huiying Zhao3, Yuedong Yang1,4.   

Abstract

Protein sequence profile prediction aims to generate multiple sequences from structural information to advance the protein design. Protein sequence profile can be computationally predicted by energy-based or fragment-based methods. By integrating these methods with neural networks, our previous method, SPIN2, has achieved a sequence recovery rate of 34%. However, SPIN2 employed only one-dimensional (1D) structural properties that are not sufficient to represent three-dimensional (3D) structures. In this study, we represented 3D structures by 2D maps of pairwise residue distances and developed a new method (SPROF) to predict protein sequence profiles based on an image captioning learning frame. To our best knowledge, this is the first method to employ a 2D distance map for predicting protein properties. SPROF achieved 39.8% in sequence recovery of residues on the independent test set, representing a 5.2% improvement over SPIN2. We also found the sequence recovery increased with the number of their neighbored residues in 3D structural space, indicating that our method can effectively learn long-range information from the 2D distance map. Thus, such network architecture using a 2D distance map is expected to be useful for other 3D structure-based applications, such as binding site prediction, protein function prediction, and protein interaction prediction. The online server and the source code is available at http://biomed.nscc-gz.cn and https://github.com/biomed-AI/SPROF , respectively.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31800243     DOI: 10.1021/acs.jcim.9b00438

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  6 in total

1.  Protein-RNA interaction prediction with deep learning: structure matters.

Authors:  Junkang Wei; Siyuan Chen; Licheng Zong; Xin Gao; Yu Li
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

Review 2.  Structure-based protein design with deep learning.

Authors:  Sergey Ovchinnikov; Po-Ssu Huang
Journal:  Curr Opin Chem Biol       Date:  2021-09-20       Impact factor: 8.822

3.  Fold2Seq: A Joint Sequence(1D)-Fold(3D) Embedding-based Generative Model for Protein Design.

Authors:  Yue Cao; Payel Das; Vijil Chenthamarakshan; Pin-Yu Chen; Igor Melnyk; Yang Shen
Journal:  Proc Mach Learn Res       Date:  2021-07

Review 4.  Protein Design with Deep Learning.

Authors:  Marianne Defresne; Sophie Barbe; Thomas Schiex
Journal:  Int J Mol Sci       Date:  2021-10-29       Impact factor: 5.923

5.  Protein sequence design with a learned potential.

Authors:  Namrata Anand; Raphael Eguchi; Irimpan I Mathews; Carla P Perez; Alexander Derry; Russ B Altman; Po-Ssu Huang
Journal:  Nat Commun       Date:  2022-02-08       Impact factor: 14.919

Review 6.  Protein-protein interaction prediction with deep learning: A comprehensive review.

Authors:  Farzan Soleymani; Eric Paquet; Herna Viktor; Wojtek Michalowski; Davide Spinello
Journal:  Comput Struct Biotechnol J       Date:  2022-09-19       Impact factor: 6.155

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.