Literature DB >> 35073170

ProALIGN: Directly Learning Alignments for Protein Structure Prediction via Exploiting Context-Specific Alignment Motifs.

Lupeng Kong1,2,3, Fusong Ju1,2, Wei-Mou Zheng4, Jianwei Zhu5, Shiwei Sun1,2, Jinbo Xu3, Dongbo Bu1,2.   

Abstract

Template-based modeling (TBM), including homology modeling and protein threading, is one of the most reliable techniques for protein structure prediction. It predicts protein structure by building an alignment between the query sequence under prediction and the templates with solved structures. However, it is still very challenging to build the optimal sequence-template alignment, especially when only distantly related templates are available. Here we report a novel deep learning approach ProALIGN that can predict much more accurate sequence-template alignment. Like protein sequences consisting of sequence motifs, protein alignments are also composed of frequently occurring alignment motifs with characteristic patterns. Alignment motifs are context-specific as their characteristic patterns are tightly related to sequence contexts of the aligned regions. Inspired by this observation, we represent a protein alignment as a binary matrix (in which 1 denotes an aligned residue pair) and then use a deep convolutional neural network to predict the optimal alignment from the query protein and its template. The trained neural network implicitly but effectively encodes an alignment scoring function, which reduces inaccuracies in the handcrafted scoring functions widely used by the current threading approaches. For a query protein and a template, we apply the neural network to directly infer likelihoods of all possible residue pairs in their entirety, which could effectively consider the correlations among multiple residues. We further construct the alignment with maximum likelihood, and finally build a structure model according to the alignment. Tested on three independent data sets with a total of 6688 protein alignment targets and 80 CASP13 TBM targets, our method achieved much better alignments and 3D structure models than the existing methods, including HHpred, CNFpred, CEthreader, and DeepThreader. These results clearly demonstrate the effectiveness of exploiting the context-specific alignment motifs by deep learning for protein threading.

Entities:  

Keywords:  deep learning and protein threading; protein alignment; protein structure prediction

Mesh:

Substances:

Year:  2022        PMID: 35073170      PMCID: PMC8892980          DOI: 10.1089/cmb.2021.0430

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  24 in total

1.  SCOP: a structural classification of proteins database.

Authors:  L Lo Conte; B Ailey; T J Hubbard; S E Brenner; A G Murzin; C Chothia
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  UniProt: the Universal Protein knowledgebase.

Authors:  Rolf Apweiler; Amos Bairoch; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Sequence context-specific profiles for homology searching.

Authors:  A Biegert; J Söding
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-20       Impact factor: 11.205

4.  Comparative protein structure modeling using Modeller.

Authors:  Ben Webb; Andrej Sali; Narayanan Eswar; Marc A Marti-Renom; M S Madhusudhan; David Eramian; Min-Yi Shen; Ursula Pieper
Journal:  Curr Protoc Bioinformatics       Date:  2006-10

5.  Boosting Protein Threading Accuracy.

Authors:  Jian Peng; Jinbo Xu
Journal:  Res Comput Mol Biol       Date:  2009

6.  CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction.

Authors:  Fusong Ju; Jianwei Zhu; Bin Shao; Lupeng Kong; Tie-Yan Liu; Wei-Mou Zheng; Dongbo Bu
Journal:  Nat Commun       Date:  2021-05-05       Impact factor: 14.919

7.  A conditional neural fields model for protein threading.

Authors:  Jianzhu Ma; Jian Peng; Sheng Wang; Jinbo Xu
Journal:  Bioinformatics       Date:  2012-06-15       Impact factor: 6.937

8.  Protein structure alignment beyond spatial proximity.

Authors:  Sheng Wang; Jianzhu Ma; Jian Peng; Jinbo Xu
Journal:  Sci Rep       Date:  2013       Impact factor: 4.379

9.  RaptorX-Property: a web server for protein structure property prediction.

Authors:  Sheng Wang; Wei Li; Shiwang Liu; Jinbo Xu
Journal:  Nucleic Acids Res       Date:  2016-04-25       Impact factor: 16.971

10.  Detecting distant-homology protein structures by aligning deep neural-network based contact maps.

Authors:  Wei Zheng; Qiqige Wuyun; Yang Li; S M Mortuza; Chengxin Zhang; Robin Pearce; Jishou Ruan; Yang Zhang
Journal:  PLoS Comput Biol       Date:  2019-10-17       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.