Literature DB >> 28369334

NeBcon: protein contact map prediction using neural network training coupled with naïve Bayes classifiers.

Baoji He1,2,3, S M Mortuza3, Yanting Wang1,2, Hong-Bin Shen3,4, Yang Zhang3,5.   

Abstract

MOTIVATION: Recent CASP experiments have witnessed exciting progress on folding large-size non-humongous proteins with the assistance of co-evolution based contact predictions. The success is however anecdotal due to the requirement of the contact prediction methods for the high volume of sequence homologs that are not available to most of the non-humongous protein targets. Development of efficient methods that can generate balanced and reliable contact maps for different type of protein targets is essential to enhance the success rate of the ab initio protein structure prediction.
RESULTS: We developed a new pipeline, NeBcon, which uses the naïve Bayes classifier (NBC) theorem to combine eight state of the art contact methods that are built from co-evolution and machine learning approaches. The posterior probabilities of the NBC model are then trained with intrinsic structural features through neural network learning for the final contact map prediction. NeBcon was tested on 98 non-redundant proteins, which improves the accuracy of the best co-evolution based meta-server predictor by 22%; the magnitude of the improvement increases to 45% for the hard targets that lack sequence and structural homologs in the databases. Detailed data analysis showed that the major contribution to the improvement is due to the optimized NBC combination of the complementary information from both co-evolution and machine learning predictions. The neural network training also helps to improve the coupling of the NBC posterior probability and the intrinsic structural features, which were found particularly important for the proteins that do not have sufficient number of homologous sequences to derive reliable co-evolution profiles. AVAILIABLITY AND IMPLEMENTATION: On-line server and standalone package of the program are available at http://zhanglab.ccmb.med.umich.edu/NeBcon/ . CONTACT: zhng@umich.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 28369334      PMCID: PMC5860114          DOI: 10.1093/bioinformatics/btx164

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  37 in total

1.  TOUCHSTONE II: a new approach to ab initio protein structure prediction.

Authors:  Yang Zhang; Andrzej Kolinski; Jeffrey Skolnick
Journal:  Biophys J       Date:  2003-08       Impact factor: 4.033

2.  Three-stage prediction of protein beta-sheets by neural networks, alignments and graph algorithms.

Authors:  Jianlin Cheng; Pierre Baldi
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

3.  Assessment of domain boundary predictions and the prediction of intramolecular contacts in CASP8.

Authors:  Iakes Ezkurdia; Osvaldo Graña; José M G Izarzugaza; Michael L Tress
Journal:  Proteins       Date:  2009

4.  Recovery of protein structure from contact maps.

Authors:  M Vendruscolo; E Kussell; E Domany
Journal:  Fold Des       Date:  1997

5.  MONSSTER: a method for folding globular proteins with a small number of distance restraints.

Authors:  J Skolnick; A Kolinski; A R Ortiz
Journal:  J Mol Biol       Date:  1997-01-17       Impact factor: 5.469

6.  New encouraging developments in contact prediction: Assessment of the CASP11 results.

Authors:  Bohdan Monastyrskyy; Daniel D'Andrea; Krzysztof Fidelis; Anna Tramontano; Andriy Kryshtafovych
Journal:  Proteins       Date:  2015-11-17

7.  MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.

Authors:  David T Jones; Tanya Singh; Tomasz Kosciolek; Stuart Tetchner
Journal:  Bioinformatics       Date:  2014-11-26       Impact factor: 6.937

8.  CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations.

Authors:  Stefan Seemayer; Markus Gruber; Johannes Söding
Journal:  Bioinformatics       Date:  2014-07-26       Impact factor: 6.937

9.  Evaluation of free modeling targets in CASP11 and ROLL.

Authors:  Lisa N Kinch; Wenlin Li; Bohdan Monastyrskyy; Andriy Kryshtafovych; Nick V Grishin
Journal:  Proteins       Date:  2016-01-20

10.  LOMETS: a local meta-threading-server for protein structure prediction.

Authors:  Sitao Wu; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2007-05-03       Impact factor: 16.971

View more
  27 in total

1.  Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors:  Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal:  PLoS Comput Biol       Date:  2018-11-05       Impact factor: 4.475

2.  Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.

Authors:  Chengxin Zhang; S M Mortuza; Baoji He; Yanting Wang; Yang Zhang
Journal:  Proteins       Date:  2017-11-14

3.  Deep-learning contact-map guided protein structure prediction in CASP13.

Authors:  Wei Zheng; Yang Li; Chengxin Zhang; Robin Pearce; S M Mortuza; Yang Zhang
Journal:  Proteins       Date:  2019-08-14

4.  I-TASSER gateway: A protein structure and function prediction server powered by XSEDE.

Authors:  Wei Zheng; Chengxin Zhang; Eric W Bell; Yang Zhang
Journal:  Future Gener Comput Syst       Date:  2019-04-09       Impact factor: 7.187

5.  ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.

Authors:  Yang Li; Jun Hu; Chengxin Zhang; Dong-Jun Yu; Yang Zhang
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

6.  DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.

Authors:  Chengxin Zhang; Wei Zheng; S M Mortuza; Yang Li; Yang Zhang
Journal:  Bioinformatics       Date:  2020-04-01       Impact factor: 6.937

7.  Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13.

Authors:  Yang Li; Chengxin Zhang; Eric W Bell; Dong-Jun Yu; Yang Zhang
Journal:  Proteins       Date:  2019-08-22

8.  Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.

Authors:  Yang Li; Chengxin Zhang; Eric W Bell; Wei Zheng; Xiaogen Zhou; Dong-Jun Yu; Yang Zhang
Journal:  PLoS Comput Biol       Date:  2021-03-26       Impact factor: 4.475

9.  Protein contact prediction using metagenome sequence data and residual neural networks.

Authors:  Qi Wu; Zhenling Peng; Ivan Anishchenko; Qian Cong; David Baker; Jianyi Yang
Journal:  Bioinformatics       Date:  2020-01-01       Impact factor: 6.937

10.  Protein inter-residue contact and distance prediction by coupling complementary coevolution features with deep residual networks in CASP14.

Authors:  Yang Li; Chengxin Zhang; Wei Zheng; Xiaogen Zhou; Eric W Bell; Dong-Jun Yu; Yang Zhang
Journal:  Proteins       Date:  2021-08-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.