Literature DB >> 19289446

Using multi-data hidden Markov models trained on local neighborhoods of protein structure to predict residue-residue contacts.

Patrik Björkholm1, Pawel Daniluk, Andriy Kryshtafovych, Krzysztof Fidelis, Robin Andersson, Torgeir R Hvidsten.   

Abstract

MOTIVATION: Correct prediction of residue-residue contacts in proteins that lack good templates with known structure would take ab initio protein structure prediction a large step forward. The lack of correct contacts, and in particular long-range contacts, is considered the main reason why these methods often fail.
RESULTS: We propose a novel hidden Markov model (HMM)-based method for predicting residue-residue contacts from protein sequences using as training data homologous sequences, predicted secondary structure and a library of local neighborhoods (local descriptors of protein structure). The library consists of recurring structural entities incorporating short-, medium- and long-range interactions and is general enough to reassemble the cores of nearly all proteins in the PDB. The method is tested on an external test set of 606 domains with no significant sequence similarity to the training set as well as 151 domains with SCOP folds not present in the training set. Considering the top 0.2 x L predictions (L = sequence length), our HMMs obtained an accuracy of 22.8% for long-range interactions in new fold targets, and an average accuracy of 28.6% for long-, medium- and short-range contacts. This is a significant performance increase over currently available methods when comparing against results published in the literature. AVAILABILITY: http://predictioncenter.org/Services/FragHMMent/.

Mesh:

Substances:

Year:  2009        PMID: 19289446      PMCID: PMC2677742          DOI: 10.1093/bioinformatics/btp149

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  The ASTRAL compendium for protein structure and sequence analysis.

Authors:  S E Brenner; P Koehl; M Levitt
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

3.  Of men and machines.

Authors:  Anna Tramontano
Journal:  Nat Struct Biol       Date:  2003-02

4.  TOUCHSTONE II: a new approach to ab initio protein structure prediction.

Authors:  Yang Zhang; Andrzej Kolinski; Jeffrey Skolnick
Journal:  Biophys J       Date:  2003-08       Impact factor: 4.033

5.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

6.  Protein contact prediction using patterns of correlation.

Authors:  Nicholas Hamilton; Kevin Burrage; Mark A Ragan; Thomas Huber
Journal:  Proteins       Date:  2004-09-01

7.  Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm.

Authors:  Jeffrey Skolnick; Daisuke Kihara; Yang Zhang
Journal:  Proteins       Date:  2004-08-15

8.  Using substitution probabilities to improve position-specific scoring matrices.

Authors:  J G Henikoff; S Henikoff
Journal:  Comput Appl Biosci       Date:  1996-04

9.  Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations?

Authors:  I N Shindyalov; N A Kolchanov; C Sander
Journal:  Protein Eng       Date:  1994-03

10.  SCOP database in 2004: refinements integrate structure and sequence family data.

Authors:  Antonina Andreeva; Dave Howorth; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more
  20 in total

1.  Efficient algorithms for training the parameters of hidden Markov models using stochastic expectation maximization (EM) training and Viterbi training.

Authors:  Tin Y Lam; Irmtraud M Meyer
Journal:  Algorithms Mol Biol       Date:  2010-12-09       Impact factor: 1.405

2.  Predicting protein residue-residue contacts using deep networks and boosting.

Authors:  Jesse Eickholt; Jianlin Cheng
Journal:  Bioinformatics       Date:  2012-10-09       Impact factor: 6.937

3.  Deep architectures for protein contact map prediction.

Authors:  Pietro Di Lena; Ken Nagata; Pierre Baldi
Journal:  Bioinformatics       Date:  2012-07-30       Impact factor: 6.937

4.  Evaluation of residue-residue contact predictions in CASP9.

Authors:  Bohdan Monastyrskyy; Krzysztof Fidelis; Anna Tramontano; Andriy Kryshtafovych
Journal:  Proteins       Date:  2011-09-17

5.  Recognizing protein substructure similarity using segmental threading.

Authors:  Sitao Wu; Yang Zhang
Journal:  Structure       Date:  2010-07-14       Impact factor: 5.006

6.  Evaluation of residue-residue contact prediction in CASP10.

Authors:  Bohdan Monastyrskyy; Daniel D'Andrea; Krzysztof Fidelis; Anna Tramontano; Andriy Kryshtafovych
Journal:  Proteins       Date:  2013-08-31

7.  Protein Residue Contacts and Prediction Methods.

Authors:  Badri Adhikari; Jianlin Cheng
Journal:  Methods Mol Biol       Date:  2016

Review 8.  Finding the needle in the haystack: towards solving the protein-folding problem computationally.

Authors:  Bian Li; Michaela Fooksa; Sten Heinze; Jens Meiler
Journal:  Crit Rev Biochem Mol Biol       Date:  2017-10-04       Impact factor: 8.250

9.  Local descriptors of protein structure: a systematic analysis of the sequence-structure relationship in proteins using short- and long-range interactions.

Authors:  Torgeir R Hvidsten; Andriy Kryshtafovych; Krzysztof Fidelis
Journal:  Proteins       Date:  2009-06

Review 10.  Recent applications of deep learning and machine intelligence on in silico drug discovery: methods, tools and databases.

Authors:  Ahmet Sureyya Rifaioglu; Heval Atas; Maria Jesus Martin; Rengul Cetin-Atalay; Volkan Atalay; Tunca Doğan
Journal:  Brief Bioinform       Date:  2019-09-27       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.