Literature DB >> 2319605

Flexible protein sequence patterns. A sensitive method to detect weak structural similarities.

G J Barton1, M J Sternberg.   

Abstract

The concept of a flexible protein sequence pattern is defined. In contrast to conventional pattern matching, template or sequence alignment methods, flexible patterns allow residue patterns typical of a complete protein fold to be developed in terms of residue positions (elements), separated by gaps of defined range. An efficient dynamic programming algorithm is presented to enable the best alignment(s) of a pattern with a sequence to be identified. The flexible pattern method is evaluated in detail by reference to the globin protein family, and by comparison to alignment techniques that exploit single sequence, multiple sequence and secondary structural information. A flexible pattern derived from seven globins aligned on structural criteria successfully discriminates all 345 globins from non-globins in the Protein Identification Resource database. Furthermore, a pattern that uses helical regions from just human alpha-haemoglobin identified 337 globins compared to 318 for the best non-pattern global alignment method. Patterns derived from successively fewer, yet more highly conserved positions in a structural alignment of seven globins show that as few as 38 residue positions (25 buried hydrophobic, 4 exposed and 9 others) may be used to uniquely identify the globin fold. The study suggests that flexible patterns gain discriminating power both by discarding regions known to vary within the protein family, and by defining gaps within specific ranges. Flexible patterns therefore provide a convenient and powerful bridge between regular expression pattern matching techniques and more conventional local and global sequence comparison algorithms.

Entities:  

Mesh:

Substances:

Year:  1990        PMID: 2319605     DOI: 10.1016/0022-2836(90)90133-7

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  16 in total

1.  The morph server: a standardized system for analyzing and visualizing macromolecular motions in a database framework.

Authors:  W G Krebs; M Gerstein
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

2.  Homology modeling and molecular dynamics simulation studies of an inward rectifier potassium channel.

Authors:  C E Capener; I H Shrivastava; K M Ranatunga; L R Forrest; G R Smith; M S Sansom
Journal:  Biophys J       Date:  2000-06       Impact factor: 4.033

3.  CORA--topological fingerprints for protein structural families.

Authors:  C A Orengo
Journal:  Protein Sci       Date:  1999-04       Impact factor: 6.725

Review 4.  Advances in homology protein structure modeling.

Authors:  Zhexin Xiang
Journal:  Curr Protein Pept Sci       Date:  2006-06       Impact factor: 3.272

5.  RIP-140 interacts with multiple nuclear receptors by means of two distinct sites.

Authors:  F L'Horset; S Dauvois; D M Heery; V Cavaillès; M G Parker
Journal:  Mol Cell Biol       Date:  1996-11       Impact factor: 4.272

6.  The gene G13 in the class III region of the human MHC encodes a potential DNA-binding protein.

Authors:  A Khanna; R D Campbell
Journal:  Biochem J       Date:  1996-10-01       Impact factor: 3.857

7.  Residue-residue contact substitution probabilities derived from aligned three-dimensional structures and the identification of common folds.

Authors:  M A Rodionov; M S Johnson
Journal:  Protein Sci       Date:  1994-12       Impact factor: 6.725

8.  Protein sequence randomness and sequence/structure correlations.

Authors:  R S Rahman; S Rackovsky
Journal:  Biophys J       Date:  1995-04       Impact factor: 4.033

9.  Protein structural similarities predicted by a sequence-structure compatibility method.

Authors:  Y Matsuo; K Nishikawa
Journal:  Protein Sci       Date:  1994-11       Impact factor: 6.725

10.  An investigation of the role of Glu-842, Glu-844 and His-846 in the function of the cytoplasmic domain of the epidermal growth factor receptor.

Authors:  J F Timms; M E Noble; M Gregoriou
Journal:  Biochem J       Date:  1995-05-15       Impact factor: 3.857

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.