Literature DB >> 31504193

Identifying molecular recognition features in intrinsically disordered regions of proteins by transfer learning.

Jack Hanson1, Thomas Litfin2, Kuldip Paliwal1, Yaoqi Zhou2.   

Abstract

MOTIVATION: Protein intrinsic disorder describes the tendency of sequence residues to not fold into a rigid three-dimensional shape by themselves. However, some of these disordered regions can transition from disorder to order when interacting with another molecule in segments known as molecular recognition features (MoRFs). Previous analysis has shown that these MoRF regions are indirectly encoded within the prediction of residue disorder as low-confidence predictions [i.e. in a semi-disordered state P(D)≈0.5]. Thus, what has been learned for disorder prediction may be transferable to MoRF prediction. Transferring the internal characterization of protein disorder for the prediction of MoRF residues would allow us to take advantage of the large training set available for disorder prediction, enabling the training of larger analytical models than is currently feasible on the small number of currently available annotated MoRF proteins. In this paper, we propose a new method for MoRF prediction by transfer learning from the SPOT-Disorder2 ensemble models built for disorder prediction.
RESULTS: We confirm that directly training on the MoRF set with a randomly initialized model yields substantially poorer performance on independent test sets than by using the transfer-learning-based method SPOT-MoRF, for both deep and simple networks. Its comparison to current state-of-the-art techniques reveals its superior performance in identifying MoRF binding regions in proteins across two independent testing sets, including our new dataset of >800 protein chains. These test chains share <30% sequence similarity to all training and validation proteins used in SPOT-Disorder2 and SPOT-MoRF, and provide a much-needed large-scale update on the performance of current MoRF predictors. The method is expected to be useful in locating functional disordered regions in proteins.
AVAILABILITY AND IMPLEMENTATION: SPOT-MoRF and its data are available as a web server and as a standalone program at: http://sparks-lab.org/jack/server/SPOT-MoRF/index.php. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31504193     DOI: 10.1093/bioinformatics/btz691

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Predicting Protein Conformational Disorder and Disordered Binding Sites.

Authors:  Ketty C Tamburrini; Giulia Pesce; Juliet Nilsson; Frank Gondelaud; Andrey V Kajava; Jean-Guy Berrin; Sonia Longhi
Journal:  Methods Mol Biol       Date:  2022

2.  Application of DNA-Binding Protein Prediction Based on Graph Convolutional Network and Contact Map.

Authors:  Weizhong Lu; Nan Zhou; Yijie Ding; Hongjie Wu; Yu Zhang; Qiming Fu; Haiou Li
Journal:  Biomed Res Int       Date:  2022-01-17       Impact factor: 3.411

3.  APOD: accurate sequence-based predictor of disordered flexible linkers.

Authors:  Zhenling Peng; Qian Xing; Lukasz Kurgan
Journal:  Bioinformatics       Date:  2020-12-30       Impact factor: 6.937

Review 4.  Intrinsically disordered proteins play diverse roles in cell signaling.

Authors:  Sarah E Bondos; A Keith Dunker; Vladimir N Uversky
Journal:  Cell Commun Signal       Date:  2022-02-17       Impact factor: 5.712

Review 5.  MoRF-FUNCpred: Molecular Recognition Feature Function Prediction Based on Multi-Label Learning and Ensemble Learning.

Authors:  Haozheng Li; Yihe Pang; Bin Liu; Liang Yu
Journal:  Front Pharmacol       Date:  2022-03-08       Impact factor: 5.810

Review 6.  Deep learning in prediction of intrinsic disorder in proteins.

Authors:  Bi Zhao; Lukasz Kurgan
Journal:  Comput Struct Biotechnol J       Date:  2022-03-08       Impact factor: 7.271

7.  Challenges in describing the conformation and dynamics of proteins with ambiguous behavior.

Authors:  Joel Roca-Martinez; Tamas Lazar; Jose Gavalda-Garcia; David Bickel; Rita Pancsa; Bhawna Dixit; Konstantina Tzavella; Pathmanaban Ramasamy; Maite Sanchez-Fornaris; Isel Grau; Wim F Vranken
Journal:  Front Mol Biosci       Date:  2022-08-03
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.