Literature DB >> 9865944

A homology identification method that combines protein sequence and structure information.

L Yu1, J V White, T F Smith.   

Abstract

A new method is presented for identifying distantly related homologous proteins that are unrecognizable by conventional sequence comparison methods. The method combines information about functionally conserved sequence patterns with information about structure context. This information is encoded in stochastic discrete state-space models (DSMs) that comprise a new family of hidden Markov models. The new models are called sequence-pattern-embedded DSMs (pDSMs). This method can identify distantly related protein family members with a high sensitivity and specificity. The method is illustrated with trypsin-like serine proteases and globins. The strategy for building pDSMs is presented. The method has been validated using carefully constructed positive and negative control sets. In addition to the ability to recognize remote homologs, pDSM sequence analysis predicts secondary structures with higher sensitivity, specificity, and Q3 accuracy than DSM analysis, which omits information about conserved sequence patterns. The identification of trypsin-like serine proteases in new genomes is discussed.

Mesh:

Substances:

Year:  1998        PMID: 9865944      PMCID: PMC2143896          DOI: 10.1002/pro.5560071203

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  45 in total

1.  PROSITE: a dictionary of sites and patterns in proteins.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1991-04-25       Impact factor: 16.971

2.  Multiple domain protein diagnostic patterns.

Authors:  R M Adams; S Das; T F Smith
Journal:  Protein Sci       Date:  1996-07       Impact factor: 6.725

Review 3.  Hidden Markov models.

Authors:  S R Eddy
Journal:  Curr Opin Struct Biol       Date:  1996-06       Impact factor: 6.809

Review 4.  Surprising similarities in structure comparison.

Authors:  J F Gibrat; T Madej; S H Bryant
Journal:  Curr Opin Struct Biol       Date:  1996-06       Impact factor: 6.809

Review 5.  Structural features of a superfamily of zinc-endopeptidases: the metzincins.

Authors:  W Stöcker; W Bode
Journal:  Curr Opin Struct Biol       Date:  1995-06       Impact factor: 6.809

6.  DNA polymerase beta belongs to an ancient nucleotidyltransferase superfamily.

Authors:  L Holm; C Sander
Journal:  Trends Biochem Sci       Date:  1995-09       Impact factor: 13.807

7.  Global optimum protein threading with gapped alignment and empirical pair score functions.

Authors:  R H Lathrop; T F Smith
Journal:  J Mol Biol       Date:  1996-02-02       Impact factor: 5.469

8.  The ENZYME data bank.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

9.  GenBank.

Authors:  D A Benson; M Boguski; D J Lipman; J Ostell
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

10.  Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii.

Authors:  C J Bult; O White; G J Olsen; L Zhou; R D Fleischmann; G G Sutton; J A Blake; L M FitzGerald; R A Clayton; J D Gocayne; A R Kerlavage; B A Dougherty; J F Tomb; M D Adams; C I Reich; R Overbeek; E F Kirkness; K G Weinstock; J M Merrick; A Glodek; J L Scott; N S Geoghagen; J C Venter
Journal:  Science       Date:  1996-08-23       Impact factor: 47.728

View more
  6 in total

1.  Thirty-plus functional families from a single motif.

Authors:  L Yu; C Gaitatzes; E Neer; T F Smith
Journal:  Protein Sci       Date:  2000-12       Impact factor: 6.725

2.  Fungi and animals may share a common ancestor to nuclear receptors.

Authors:  Chris Phelps; Valentina Gburcik; Elena Suslova; Peter Dudek; Fedor Forafonov; Nathalie Bot; Morag MacLean; Richard J Fagan; Didier Picard
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-24       Impact factor: 11.205

3.  Functional divergence of Kaposi's sarcoma-associated herpesvirus and related gamma-2 herpesvirus thymidine kinases: novel cytoplasmic phosphoproteins that alter cellular morphology and disrupt adhesion.

Authors:  Michael B Gill; Jo-Ellen Murphy; Joyce D Fingeroth
Journal:  J Virol       Date:  2005-12       Impact factor: 5.103

4.  Comparative model building of interleukin-7 using interleukin-4 as a template: a structural hypothesis that displays atypical surface chemistry in helix D important for receptor activation.

Authors:  L Cosenza; A Rosenbach; J V White; J R Murphy; T Smith
Journal:  Protein Sci       Date:  2000-05       Impact factor: 6.725

5.  Protein family comparison using statistical models and predicted structural information.

Authors:  Richard Chung; Golan Yona
Journal:  BMC Bioinformatics       Date:  2004-11-25       Impact factor: 3.169

6.  Identification of an ideal-like fingerprint for a protein fold using overlapped conserved residues based approach.

Authors:  Amit Goyal; Sriram Sokalingam; Kyu-Suk Hwang; Sun-Gu Lee
Journal:  Sci Rep       Date:  2014-07-10       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.