Literature DB >> 19772383

BCL::contact-low confidence fold recognition hits boost protein contact prediction and de novo structure determination.

Mert Karakaş1, Nils Woetzel, Jens Meiler.   

Abstract

Knowledge of all residue-residue contacts within a protein allows determination of the protein fold. Accurate prediction of even a subset of long-range contacts (contacts between amino acids far apart in sequence) can be instrumental for determining tertiary structure. Here we present BCL::Contact, a novel contact prediction method that utilizes artificial neural networks (ANNs) and specializes in the prediction of medium to long-range contacts. BCL::Contact comes in two modes: sequence-based and structure-based. The sequence-based mode uses only sequence information and has individual ANNs specialized for helix-helix, helix-strand, strand-helix, strand-strand, and sheet-sheet contacts. The structure-based mode combines results from 32-fold recognition methods with sequence information to a consensus prediction. The two methods were presented in the 6(th) and 7(th) Critical Assessment of Techniques for Protein Structure Prediction (CASP) experiments. The present work focuses on elucidating the impact of fold recognition results onto contact prediction via a direct comparison of both methods on a joined benchmark set of proteins. The sequence-based mode predicted contacts with 42% accuracy (7% false positive rate), while the structure-based mode achieved 45% accuracy (2% false positive rate). Predictions by both modes of BCL::Contact were supplied as input to the protein tertiary structure prediction program Rosetta for a benchmark of 17 proteins with no close sequence homologs in the protein data bank (PDB). Rosetta created higher accuracy models, signified by an improvement of 1.3 A on average root mean square deviation (RMSD), when driven by the predicted contacts. Further, filtering Rosetta models by agreement with the predicted contacts enriches for native-like fold topologies.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 19772383      PMCID: PMC3148831          DOI: 10.1089/cmb.2009.0030

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  64 in total

1.  Ab initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions.

Authors:  E S Huang; R Samudrala; J W Ponder
Journal:  J Mol Biol       Date:  1999-07-02       Impact factor: 5.469

2.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-04-09       Impact factor: 5.469

3.  Coevolving protein residues: maximum likelihood identification and relationship to structure.

Authors:  D D Pollock; W R Taylor; N Goldman
Journal:  J Mol Biol       Date:  1999-03-19       Impact factor: 5.469

4.  A neural network based predictor of residue contacts in proteins.

Authors:  P Fariselli; R Casadio
Journal:  Protein Eng       Date:  1999-01

5.  LiveBench-8: the large-scale, continuous assessment of automated protein structure prediction.

Authors:  Leszek Rychlewski; Daniel Fischer
Journal:  Protein Sci       Date:  2005-01       Impact factor: 6.725

6.  Protein distance constraints predicted by neural networks and probability density functions.

Authors:  O Lund; K Frimand; J Gorodkin; H Bohr; J Bohr; J Hansen; S Brunak
Journal:  Protein Eng       Date:  1997-11

7.  Recognition of analogous and homologous protein folds--assessment of prediction success and associated alignment accuracy using empirical substitution matrices.

Authors:  R B Russell; M A Saqi; P A Bates; R A Sayle; M J Sternberg
Journal:  Protein Eng       Date:  1998-01

8.  Improving contact predictions by the combination of correlated mutations and other sources of sequence information.

Authors:  O Olmea; A Valencia
Journal:  Fold Des       Date:  1997

Review 9.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

10.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions.

Authors:  K T Simons; C Kooperberg; E Huang; D Baker
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

View more
  4 in total

1.  Lead discovery and in silico 3D structure modeling of tumorigenic FAM72A (p17).

Authors:  Subrata Pramanik; Arne Kutzner; Klaus Heese
Journal:  Tumour Biol       Date:  2014-09-20

2.  Combining physicochemical and evolutionary information for protein contact prediction.

Authors:  Michael Schneider; Oliver Brock
Journal:  PLoS One       Date:  2014-10-22       Impact factor: 3.240

3.  Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning.

Authors:  Pedro L Teixeira; Jeff L Mendenhall; Sten Heinze; Brian Weiner; Marcin J Skwark; Jens Meiler
Journal:  PLoS One       Date:  2017-05-24       Impact factor: 3.240

4.  EPSILON-CP: using deep learning to combine information from multiple sources for protein contact prediction.

Authors:  Kolja Stahl; Michael Schneider; Oliver Brock
Journal:  BMC Bioinformatics       Date:  2017-06-17       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.