Literature DB >> 32227201

FUpred: detecting protein domains through deep-learning-based contact map prediction.

Wei Zheng1, Xiaogen Zhou1, Qiqige Wuyun2, Robin Pearce1, Yang Li1,3, Yang Zhang1,4.   

Abstract

MOTIVATION: Protein domains are subunits that can fold and function independently. Correct domain boundary assignment is thus a critical step toward accurate protein structure and function analyses. There is, however, no efficient algorithm available for accurate domain prediction from sequence. The problem is particularly challenging for proteins with discontinuous domains, which consist of domain segments that are separated along the sequence.
RESULTS: We developed a new algorithm, FUpred, which predicts protein domain boundaries utilizing contact maps created by deep residual neural networks coupled with coevolutionary precision matrices. The core idea of the algorithm is to retrieve domain boundary locations by maximizing the number of intra-domain contacts, while minimizing the number of inter-domain contacts from the contact maps. FUpred was tested on a large-scale dataset consisting of 2549 proteins and generated correct single- and multi-domain classifications with a Matthew's correlation coefficient of 0.799, which was 19.1% (or 5.3%) higher than the best machine learning (or threading)-based method. For proteins with discontinuous domains, the domain boundary detection and normalized domain overlapping scores of FUpred were 0.788 and 0.521, respectively, which were 17.3% and 23.8% higher than the best control method. The results demonstrate a new avenue to accurately detect domain composition from sequence alone, especially for discontinuous, multi-domain proteins.
AVAILABILITY AND IMPLEMENTATION: https://zhanglab.ccmb.med.umich.edu/FUpred. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2020        PMID: 32227201      PMCID: PMC7320627          DOI: 10.1093/bioinformatics/btaa217

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  PDP: protein domain parser.

Authors:  Nickolai Alexandrov; Ilya Shindyalov
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

2.  Improving the performance of DomainParser for structural domain partition using neural network.

Authors:  Jun-tao Guo; Dong Xu; Dongsup Kim; Ying Xu
Journal:  Nucleic Acids Res       Date:  2003-02-01       Impact factor: 16.971

3.  Protein homology detection by HMM-HMM comparison.

Authors:  Johannes Söding
Journal:  Bioinformatics       Date:  2004-11-05       Impact factor: 6.937

4.  Evaluation of domain prediction in CASP6.

Authors:  Chin-Hsien Tai; Woei-Jyh Lee; James J Vincent; Byungkook Lee
Journal:  Proteins       Date:  2005

5.  DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile.

Authors:  Hongyi Zhou; Bin Xue; Yaoqi Zhou
Journal:  Protein Sci       Date:  2007-05       Impact factor: 6.725

6.  ConDo: protein domain boundary prediction using coevolutionary information.

Authors:  Seung Hwan Hong; Keehyoung Joo; Jooyoung Lee
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

7.  SCOPe: Manual Curation and Artifact Removal in the Structural Classification of Proteins - extended Database.

Authors:  John-Marc Chandonia; Naomi K Fox; Steven E Brenner
Journal:  J Mol Biol       Date:  2016-11-30       Impact factor: 5.469

8.  DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning.

Authors:  Jesse Eickholt; Xin Deng; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2011-02-01       Impact factor: 3.169

9.  DOMAC: an accurate, hybrid protein domain prediction server.

Authors:  Jianlin Cheng
Journal:  Nucleic Acids Res       Date:  2007-06-06       Impact factor: 16.971

10.  Predicting active site residue annotations in the Pfam database.

Authors:  Jaina Mistry; Alex Bateman; Robert D Finn
Journal:  BMC Bioinformatics       Date:  2007-08-09       Impact factor: 3.169

View more
  15 in total

1.  Progressive assembly of multi-domain protein structures from cryo-EM density maps.

Authors:  Xiaogen Zhou; Yang Li; Chengxin Zhang; Wei Zheng; Guijun Zhang; Yang Zhang
Journal:  Nat Comput Sci       Date:  2022-04-28

2.  Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field.

Authors:  Jalil Villalobos-Alva; Luis Ochoa-Toledo; Mario Javier Villalobos-Alva; Atocha Aliseda; Fernando Pérez-Escamirosa; Nelly F Altamirano-Bustamante; Francine Ochoa-Fernández; Ricardo Zamora-Solís; Sebastián Villalobos-Alva; Cristina Revilla-Monsalve; Nicolás Kemper-Valverde; Myriam M Altamirano-Bustamante
Journal:  Front Bioeng Biotechnol       Date:  2022-07-07

3.  Characterization of Treponema denticola Major Surface Protein (Msp) by Deletion Analysis and Advanced Molecular Modeling.

Authors:  M Paula Goetting-Minesky; Valentina Godovikova; Wei Zheng; J Christopher Fenno
Journal:  J Bacteriol       Date:  2022-08-01       Impact factor: 3.476

Review 4.  I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction.

Authors:  Xiaogen Zhou; Wei Zheng; Yang Li; Robin Pearce; Chengxin Zhang; Eric W Bell; Guijun Zhang; Yang Zhang
Journal:  Nat Protoc       Date:  2022-08-05       Impact factor: 17.021

5.  Insights from the molecular docking analysis of colistin with the PmrA protein model from Acinetobacter baumannii.

Authors:  Shalini Ganeshan; Mohammad Reza Shakibaie; Rajaguru Rajagopal
Journal:  Bioinformation       Date:  2022-01-31

6.  Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps.

Authors:  Sajid Mahmud; Zhiye Guo; Farhan Quadir; Jian Liu; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2022-07-19       Impact factor: 3.307

7.  Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14.

Authors:  Wei Zheng; Yang Li; Chengxin Zhang; Xiaogen Zhou; Robin Pearce; Eric W Bell; Xiaoqiang Huang; Yang Zhang
Journal:  Proteins       Date:  2021-08-07

8.  LOMETS3: integrating deep learning and profile alignment for advanced protein template recognition and function annotation.

Authors:  Wei Zheng; Qiqige Wuyun; Xiaogen Zhou; Yang Li; Peter L Freddolino; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2022-04-14       Impact factor: 19.160

9.  APOD: accurate sequence-based predictor of disordered flexible linkers.

Authors:  Zhenling Peng; Qian Xing; Lukasz Kurgan
Journal:  Bioinformatics       Date:  2020-12-30       Impact factor: 6.937

Review 10.  Deep Learning-Based Advances in Protein Structure Prediction.

Authors:  Subash C Pakhrin; Bikash Shrestha; Badri Adhikari; Dukka B Kc
Journal:  Int J Mol Sci       Date:  2021-05-24       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.