Literature DB >> 7787038

Protein sequence randomness and sequence/structure correlations.

R S Rahman1, S Rackovsky.   

Abstract

We investigated protein sequence/structure correlation by constructing a space of protein sequences, based on methods developed previously for constructing a space of protein structures. The space is constructed by using a representation of the amino acids as vectors of 10 property factors that encode almost all of their physical properties. Each sequence is represented by a distribution of overlapping sequence fragments. A distance between any two sequences can be calculated. By attaching a weight to each factor, intersequence distances can be varied. We optimize the correlation between corresponding distances in the sequence and structure spaces. The optimal correlation between the sequence and structure spaces is significantly better than that which results from correlating randomly generated sequences, having the overall composition of the data base, with the structure space. However, sets of randomly generated sequences, each of which approximates the composition of the real sequence it replaces, produce correlations with the structure space that are as good as that observed for the actual protein sequences. A connection is proposed with previous studies of the protein folding code. It is shown that the most important property factors for the correlation of the sequence and structure spaces are related to helix/bend preference, side chain bulk, and beta-structure preference.

Mesh:

Substances:

Year:  1995        PMID: 7787038      PMCID: PMC1282047          DOI: 10.1016/S0006-3495(95)80325-5

Source DB:  PubMed          Journal:  Biophys J        ISSN: 0006-3495            Impact factor:   4.033


  38 in total

1.  Quantitative organization of the known protein x-ray structures. I. Methods and short-length-scale results.

Authors:  S Rackovsky
Journal:  Proteins       Date:  1990

2.  Implications of thermodynamics of protein folding for evolution of primary sequences.

Authors:  E I Shakhnovich; A M Gutin
Journal:  Nature       Date:  1990-08-23       Impact factor: 49.962

3.  Theory for protein mutability and biogenesis.

Authors:  K F Lau; K A Dill
Journal:  Proc Natl Acad Sci U S A       Date:  1990-01       Impact factor: 11.205

4.  Protein database searches for multiple alignments.

Authors:  S F Altschul; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1990-07       Impact factor: 11.205

5.  Flexible protein sequence patterns. A sensitive method to detect weak structural similarities.

Authors:  G J Barton; M J Sternberg
Journal:  J Mol Biol       Date:  1990-03-20       Impact factor: 5.469

6.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

7.  Linguistic measure of taxonomic and functional relatedness of nucleotide sequences.

Authors:  S Pietrokovski; J Hirshon; E N Trifonov
Journal:  J Biomol Struct Dyn       Date:  1990-06

8.  Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.

Authors:  B E Blaisdell
Journal:  J Mol Evol       Date:  1989-12       Impact factor: 2.395

9.  Nonlinear methods for discrimination and their application to classification of protein structures.

Authors:  P Klein; R L Somorjai
Journal:  J Theor Biol       Date:  1988-02-21       Impact factor: 2.691

10.  On the nature of the protein folding code.

Authors:  S Rackovsky
Journal:  Proc Natl Acad Sci U S A       Date:  1993-01-15       Impact factor: 11.205

View more
  1 in total

1.  "Hidden" sequence periodicities and protein architecture.

Authors:  S Rackovsky
Journal:  Proc Natl Acad Sci U S A       Date:  1998-07-21       Impact factor: 11.205

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.