Literature DB >> 9096237

Protein topology recognition from secondary structure sequences: application of the hidden Markov models to the alpha class proteins.

V Di Francesco1, J Garnier, P J Munson.   

Abstract

The three-dimensional fold of a protein is described by the organization of its secondary structure elements in 3D space, i.e. its "topology". We find that the protein topology can be recognized from the ID sequence of secondary structure states of the residues alone. Automated recognition is facilitated by use of hidden Markov models (HMMs) to represent topology families of proteins. Such models can be trained on the experimentally observed secondary structure sequences of family members using well established algorithms. Here, we model various topology groups in the alpha class of proteins and identify, from a large database, those proteins having the topology described by each model. The correct topology family for protein secondary structure sequences could be recognized 12 out of 14 times. When the observed secondary structure sequences are replaced with predicted sequences recognition is still achievable 8 out of 14 times. The success rate for observed sequences indicates that our approach will become increasingly useful as the accuracy of secondary prediction algorithms is improved. Our study indicates that the HMMs are useful for protein topology recognition even when no detectable primary amino acid sequence similarity is present. To illustrate the potential utility of our method, protein topology recognition is attempted on leptin, the obese gene product, and the human interleukin-6 sequence, for which fold predictions have been previously published.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9096237     DOI: 10.1006/jmbi.1996.0874

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  7 in total

1.  Environment-dependent residue contact energies for proteins.

Authors:  C Zhang; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2000-03-14       Impact factor: 11.205

2.  Enhanced protein fold recognition using secondary structure information from NMR.

Authors:  D J Ayers; P R Gooley; A Widmer-Cooper; A E Torda
Journal:  Protein Sci       Date:  1999-05       Impact factor: 6.725

3.  A homology identification method that combines protein sequence and structure information.

Authors:  L Yu; J V White; T F Smith
Journal:  Protein Sci       Date:  1998-12       Impact factor: 6.725

4.  Seeking an ancient enzyme in Methanococcus jannaschii using ORF, a program based on predicted secondary structure comparisons.

Authors:  R Aurora; G D Rose
Journal:  Proc Natl Acad Sci U S A       Date:  1998-03-17       Impact factor: 11.205

5.  Hidden Markov Models and their Applications in Biological Sequence Analysis.

Authors:  Byung-Jun Yoon
Journal:  Curr Genomics       Date:  2009-09       Impact factor: 2.236

6.  Fold classification based on secondary structure--how much is gained by including loop topology?

Authors:  Jieun Jeong; Piotr Berman; Teresa Przytycka
Journal:  BMC Struct Biol       Date:  2006-03-08

7.  Using 3D Hidden Markov Models that explicitly represent spatial coordinates to model and compare protein structures.

Authors:  Vadim Alexandrov; Mark Gerstein
Journal:  BMC Bioinformatics       Date:  2004-01-09       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.