Literature DB >> 27587688

AUCpreD: proteome-level protein disorder prediction by AUC-maximized deep convolutional neural fields.

Sheng Wang1, Jianzhu Ma2, Jinbo Xu2.   

Abstract

MOTIVATION: Protein intrinsically disordered regions (IDRs) play an important role in many biological processes. Two key properties of IDRs are (i) the occurrence is proteome-wide and (ii) the ratio of disordered residues is about 6%, which makes it challenging to accurately predict IDRs. Most IDR prediction methods use sequence profile to improve accuracy, which prevents its application to proteome-wide prediction since it is time-consuming to generate sequence profiles. On the other hand, the methods without using sequence profile fare much worse than using sequence profile.
METHOD: This article formulates IDR prediction as a sequence labeling problem and employs a new machine learning method called Deep Convolutional Neural Fields (DeepCNF) to solve it. DeepCNF is an integration of deep convolutional neural networks (DCNN) and conditional random fields (CRF); it can model not only complex sequence-structure relationship in a hierarchical manner, but also correlation among adjacent residues. To deal with highly imbalanced order/disorder ratio, instead of training DeepCNF by widely used maximum-likelihood, we develop a novel approach to train it by maximizing area under the ROC curve (AUC), which is an unbiased measure for class-imbalanced data.
RESULTS: Our experimental results show that our IDR prediction method AUCpreD outperforms existing popular disorder predictors. More importantly, AUCpreD works very well even without sequence profile, comparing favorably to or even outperforming many methods using sequence profile. Therefore, our method works for proteome-wide disorder prediction while yielding similar or better accuracy than the others.
AVAILABILITY AND IMPLEMENTATION: http://raptorx2.uchicago.edu/StructurePropertyPred/predict/ CONTACT: wangsheng@uchicago.edu, jinboxu@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27587688      PMCID: PMC5013916          DOI: 10.1093/bioinformatics/btw446

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  49 in total

1.  Optical rotation and viscosity of native and denatured proteins. X. Further studies on optical rotatory dispersion.

Authors:  B JIRGENSONS
Journal:  Arch Biochem Biophys       Date:  1958-03       Impact factor: 4.013

2.  Solving the protein sequence metric problem.

Authors:  William R Atchley; Jieping Zhao; Andrew D Fernandes; Tanja Drüke
Journal:  Proc Natl Acad Sci U S A       Date:  2005-04-25       Impact factor: 11.205

3.  Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences.

Authors:  Yen Hock Tan; He Huang; Daisuke Kihara
Journal:  Proteins       Date:  2006-08-15

4.  Intrinsic disorder prediction from the analysis of multiple protein fold recognition models.

Authors:  Liam J McGuffin
Journal:  Bioinformatics       Date:  2008-06-25       Impact factor: 6.937

Review 5.  Predicting intrinsic disorder in proteins: an overview.

Authors:  Bo He; Kejun Wang; Yunlong Liu; Bin Xue; Vladimir N Uversky; A Keith Dunker
Journal:  Cell Res       Date:  2009-08       Impact factor: 25.617

6.  MetaDisorder: a meta-server for the prediction of intrinsic disorder in proteins.

Authors:  Lukasz P Kozlowski; Janusz M Bujnicki
Journal:  BMC Bioinformatics       Date:  2012-05-24       Impact factor: 3.169

7.  DISOPRED3: precise disordered region predictions with annotated protein-binding activity.

Authors:  David T Jones; Domenico Cozzetto
Journal:  Bioinformatics       Date:  2014-11-12       Impact factor: 6.937

8.  The Protein Model Portal--a comprehensive resource for protein structure and model information.

Authors:  Juergen Haas; Steven Roth; Konstantin Arnold; Florian Kiefer; Tobias Schmidt; Lorenza Bordoli; Torsten Schwede
Journal:  Database (Oxford)       Date:  2013-04-26       Impact factor: 3.451

9.  Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

Authors:  Sheng Wang; Jian Peng; Jianzhu Ma; Jinbo Xu
Journal:  Sci Rep       Date:  2016-01-11       Impact factor: 4.379

10.  RaptorX-Property: a web server for protein structure property prediction.

Authors:  Sheng Wang; Wei Li; Shiwang Liu; Jinbo Xu
Journal:  Nucleic Acids Res       Date:  2016-04-25       Impact factor: 16.971

View more
  21 in total

1.  A Membrane Burial Potential with H-Bonds and Applications to Curved Membranes and Fast Simulations.

Authors:  Zongan Wang; John M Jumper; Sheng Wang; Karl F Freed; Tobin R Sosnick
Journal:  Biophys J       Date:  2018-10-23       Impact factor: 4.033

2.  Prediction of secondary structure population and intrinsic disorder of proteins using multitask deep learning.

Authors:  Xu Ying; Andre Leier; Tatiana T Marquez-Lago; Jue Xie; Antonio Jose Jimeno Yepes; James C Whisstock; Campbell Wilson; Jiangning Song
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

3.  Structural insights into phosphopantetheinyl hydrolase PptH from Mycobacterium tuberculosis.

Authors:  John Mosior; Ronnie Bourland; Shivatheja Soma; Carl Nathan; James Sacchettini
Journal:  Protein Sci       Date:  2020-01-20       Impact factor: 6.725

4.  Predicting Protein Conformational Disorder and Disordered Binding Sites.

Authors:  Ketty C Tamburrini; Giulia Pesce; Juliet Nilsson; Frank Gondelaud; Andrey V Kajava; Jean-Guy Berrin; Sonia Longhi
Journal:  Methods Mol Biol       Date:  2022

Review 5.  Protein Function Analysis through Machine Learning.

Authors:  Chris Avery; John Patterson; Tyler Grear; Theodore Frater; Donald J Jacobs
Journal:  Biomolecules       Date:  2022-09-06

Review 6.  Deep learning methods in protein structure prediction.

Authors:  Mirko Torrisi; Gianluca Pollastri; Quan Le
Journal:  Comput Struct Biotechnol J       Date:  2020-01-22       Impact factor: 7.271

7.  The Role of APOSTART in Switching between Sexuality and Apomixis in Poa pratensis.

Authors:  Gianpiero Marconi; Domenico Aiello; Bryan Kindiger; Loriano Storchi; Alessandro Marrone; Lara Reale; Niccolò Terzaroli; Emidio Albertini
Journal:  Genes (Basel)       Date:  2020-08-14       Impact factor: 4.096

8.  Quality and bias of protein disorder predictors.

Authors:  Jakob T Nielsen; Frans A A Mulder
Journal:  Sci Rep       Date:  2019-03-26       Impact factor: 4.379

9.  DeepBound: accurate identification of transcript boundaries via deep convolutional neural fields.

Authors:  Mingfu Shao; Jianzhu Ma; Sheng Wang
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

10.  Decision-Tree Based Meta-Strategy Improved Accuracy of Disorder Prediction and Identified Novel Disordered Residues Inside Binding Motifs.

Authors:  Bi Zhao; Bin Xue
Journal:  Int J Mol Sci       Date:  2018-10-07       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.