Literature DB >> 12084910

TRILOGY: Discovery of sequence-structure patterns across diverse proteins.

Philip Bradley1, Peter S Kim, Bonnie Berger.   

Abstract

We describe a new computer program, trilogy, for the automated discovery of sequence-structure patterns in proteins. trilogy implements a pattern discovery algorithm that begins with an exhaustive analysis of flexible three-residue patterns; a subset of these patterns are selected as seeds for an extension process in which longer patterns are identified. A key feature of the method is explicit treatment of both the sequence and structure components of these motifs: each trilogy pattern is a pair consisting of a sequence pattern and a structure pattern. Matches to both these component patterns are identified independently, allowing the program to assign a significance score to each sequence-structure pattern that assesses the degree of correlation between the corresponding sequence and structure motifs. trilogy identifies several thousand high-scoring patterns that occur across protein families. These include both previously identified and potentially novel motifs. We expect that these sequence-structure patterns will be useful in predicting protein structure from sequence, annotating newly determined protein structures, and identifying novel motifs of potential functional or structural significance. Further details on 7,768 significant patterns identified by trilogy can be found at http://theory.lcs.mit.edu/trilogy.

Mesh:

Substances:

Year:  2002        PMID: 12084910      PMCID: PMC124288          DOI: 10.1073/pnas.112221999

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  26 in total

1.  Database of homology-derived protein structures and the structural meaning of sequence alignment.

Authors:  C Sander; R Schneider
Journal:  Proteins       Date:  1991

2.  The helix-hairpin-helix DNA-binding motif: a structural basis for non-sequence-specific recognition of DNA.

Authors:  A J Doherty; L C Serpell; C P Ponting
Journal:  Nucleic Acids Res       Date:  1996-07-01       Impact factor: 16.971

3.  Recurring local sequence motifs in proteins.

Authors:  K F Han; D Baker
Journal:  J Mol Biol       Date:  1995-08-04       Impact factor: 5.469

4.  Pfam: multiple sequence alignments and HMM-profiles of protein domains.

Authors:  E L Sonnhammer; S R Eddy; E Birney; A Bateman; R Durbin
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

5.  Amino acid preferences for specific locations at the ends of alpha helices.

Authors:  J S Richardson; D C Richardson
Journal:  Science       Date:  1988-06-17       Impact factor: 47.728

6.  Helix signals in proteins.

Authors:  L G Presta; G D Rose
Journal:  Science       Date:  1988-06-17       Impact factor: 47.728

7.  RASMOL: biomolecular graphics for all.

Authors:  R A Sayle; E J Milner-White
Journal:  Trends Biochem Sci       Date:  1995-09       Impact factor: 13.807

8.  A revised set of potentials for beta-turn formation in proteins.

Authors:  E G Hutchinson; J M Thornton
Journal:  Protein Sci       Date:  1994-12       Impact factor: 6.725

9.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

10.  Helix stop signals in proteins and peptides: the capping box.

Authors:  E T Harper; G D Rose
Journal:  Biochemistry       Date:  1993-08-03       Impact factor: 3.162

View more
  6 in total

1.  Mining frequent patterns in protein structures: a study of protease families.

Authors:  Shann-Ching Chen; Ivet Bahar
Journal:  Bioinformatics       Date:  2004-08-04       Impact factor: 6.937

2.  Characterizing the regularity of tetrahedral packing motifs in protein tertiary structure.

Authors:  Ryan Day; Kristin P Lennox; David B Dahl; Marina Vannucci; Jerry W Tsai
Journal:  Bioinformatics       Date:  2010-11-02       Impact factor: 6.937

3.  Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.

Authors:  Deepak Bandyopadhyay; Jun Huan; Jan Prins; Jack Snoeyink; Wei Wang; Alexander Tropsha
Journal:  J Comput Aided Mol Des       Date:  2009-06-20       Impact factor: 3.686

4.  Recurrent structural motifs in non-homologous protein structures.

Authors:  Maria U Johansson; Vincent Zoete; Nicolas Guex
Journal:  Int J Mol Sci       Date:  2013-04-10       Impact factor: 5.923

5.  Towards comprehensive structural motif mining for better fold annotation in the "twilight zone" of sequence dissimilarity.

Authors:  Yi Jia; Jun Huan; Vincent Buhr; Jintao Zhang; Leonidas N Carayannopoulos
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

6.  Identification of an ideal-like fingerprint for a protein fold using overlapped conserved residues based approach.

Authors:  Amit Goyal; Sriram Sokalingam; Kyu-Suk Hwang; Sun-Gu Lee
Journal:  Sci Rep       Date:  2014-07-10       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.