Literature DB >> 8897599

Construction and analysis of a profile library characterizing groups of structurally known proteins.

A Ogiwara1, I Uchiyama, T Takagi, M Kanehisa.   

Abstract

A new sequence motif library StrProf was constructed characterizing the groups of related proteins in the PDB three-dimensional structure database. For a representative member of each protein family, which was identified by cross-referencing the PDB with the PIR superfamily classification, a group of related sequences was collected by the BLAST search against the nonredundant protein sequence database. For every group, the motifs were identified automatically according to the criteria of conservation and uniqueness of pentapeptide patterns and with a dual dynamic programming algorithm. In the StrProf library, motifs are represented by profile matrices rather than consensus patterns to allow more flexible search capabilities. Another dynamic programming algorithm was then developed to search this motif library. When the computationally derived StrProf was compared with PROSITE, which is a manually derived motif library in the best consensus pattern representation, the numbers of identified patterns were comparable. StrProf missed about one third of the PROSITE motifs, but there were also new motifs lacking in PROSITE. The new library was incorporated in SMART (Sequence Motif Analysis and Retrieval Tool), a computer tool designed to help search and annotate biologically important sites in an unknown protein sequence. The client program is available free of charge through the Internet.

Mesh:

Substances:

Year:  1996        PMID: 8897599      PMCID: PMC2143267          DOI: 10.1002/pro.5560051005

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  13 in total

1.  Construction of a dictionary of sequence motifs that characterize groups of related proteins.

Authors:  A Ogiwara; I Uchiyama; Y Seto; M Kanehisa
Journal:  Protein Eng       Date:  1992-09

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  PROSITE: a dictionary of sites and patterns in proteins.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

4.  Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragments.

Authors:  L R Cardon; G D Stormo
Journal:  J Mol Biol       Date:  1992-01-05       Impact factor: 5.469

5.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

6.  Information content of binding sites on nucleotide sequences.

Authors:  T D Schneider; G D Stormo; L Gold; A Ehrenfeucht
Journal:  J Mol Biol       Date:  1986-04-05       Impact factor: 5.469

7.  Rigorous pattern-recognition methods for DNA sequences. Analysis of promoter sequences from Escherichia coli.

Authors:  D J Galas; M Eggert; M S Waterman
Journal:  J Mol Biol       Date:  1985-11-05       Impact factor: 5.469

8.  Identifying protein-binding sites from unaligned DNA fragments.

Authors:  G D Stormo; G W Hartzell
Journal:  Proc Natl Acad Sci U S A       Date:  1989-02       Impact factor: 11.205

9.  Structure and activity of recombinant human interferon-gamma analogs.

Authors:  Y R Hsu; B Ferguson; M Narachi; R M Richards; Y Stabinsky; N K Alton; N Stebbing; T Arakawa
Journal:  J Interferon Res       Date:  1986-12

10.  Protein structural similarities predicted by a sequence-structure compatibility method.

Authors:  Y Matsuo; K Nishikawa
Journal:  Protein Sci       Date:  1994-11       Impact factor: 6.725

View more
  12 in total

1.  Repression of TFII-I-dependent transcription by nuclear exclusion.

Authors:  M I Tussié-Luna; D Bayarsaihan; F H Ruddle; A L Roy
Journal:  Proc Natl Acad Sci U S A       Date:  2001-07-03       Impact factor: 11.205

2.  Cloning and characterization of a novel membrane-associated antigenic protein of Helicobacter pylori.

Authors:  M Yoshida; Y Wakatsuki; Y Kobayashi; T Itoh; K Murakami; A Mizoguchi; T Usui; T Chiba; T Kita
Journal:  Infect Immun       Date:  1999-01       Impact factor: 3.441

3.  Protein sequence similarity searches using patterns as seeds.

Authors:  Z Zhang; A A Schäffer; W Miller; T L Madden; D J Lipman; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  1998-09-01       Impact factor: 16.971

4.  Identification and analysis of genes involved in anaerobic toluene metabolism by strain T1: putative role of a glycine free radical.

Authors:  P W Coschigano; T S Wehrman; L Y Young
Journal:  Appl Environ Microbiol       Date:  1998-05       Impact factor: 4.792

5.  Characterization of a Chlamydia psittaci DNA binding protein (EUO) synthesized during the early and middle phases of the developmental cycle.

Authors:  L Zhang; A L Douglas; T P Hatch
Journal:  Infect Immun       Date:  1998-03       Impact factor: 3.441

6.  A genome-wide survey of RS domain proteins.

Authors:  L Boucher; C A Ouzounis; A J Enright; B J Blencowe
Journal:  RNA       Date:  2001-12       Impact factor: 4.942

7.  Identification of two novel hrp-associated genes in the hrp gene cluster of Xanthomonas oryzae pv. oryzae.

Authors:  W Zhu; M M MaGbanua; F F White
Journal:  J Bacteriol       Date:  2000-04       Impact factor: 3.490

8.  Characterization of the glycoprotein B gene from ruminant alphaherpesviruses.

Authors:  Carlos Ros; Sándor Belák
Journal:  Virus Genes       Date:  2002-03       Impact factor: 2.332

9.  Transcriptional analysis of the tutE tutFDGH gene cluster from Thauera aromatica strain T1.

Authors:  P W Coschigano
Journal:  Appl Environ Microbiol       Date:  2000-03       Impact factor: 4.792

10.  Feature amplified voting algorithm for functional analysis of protein superfamily.

Authors:  Che-Lun Hung; Chihan Lee; Chun-Yuan Lin; Chih-Hung Chang; Yeh-Ching Chung; Chuan Yi Tang
Journal:  BMC Genomics       Date:  2010-12-01       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.