Literature DB >> 10975579

Use of residue pairs in protein sequence-sequence and sequence-structure alignments.

J Jung1, B Lee.   

Abstract

Two new sets of scoring matrices are introduced: H2 for the protein sequence comparison and T2 for the protein sequence-structure correlation. Each element of H2 or T2 measures the frequency with which a pair of amino acid types in one protein, k-residues apart in the sequence, is aligned with another pair of residues, of given amino acid types (for H2) or in given structural states (for T2), in other structurally homologous proteins. There are four types, corresponding to the k-values of 1 to 4, for both H2 and T2. These matrices were set up using a large number of structurally homologous protein pairs, with little sequence homology between the pair, that were recently generated using the structure comparison program SHEBA. The two scoring matrices were incorporated into the main body of the sequence alignment program SSEARCH in the FASTA package and tested in a fold recognition setting in which a set of 107 test sequences were aligned to each of a panel of 3,539 domains that represent all known protein structures. Six procedures were tested; the straight Smith-Waterman (SW) and FASTA procedures, which used the Blosum62 single residue type substitution matrix; BLAST and PSI-BLAST procedures, which also used the Blosum62 matrix; PASH, which used Blosum62 and H2 matrices; and PASSC, which used Blosum62, H2, and T2 matrices. All procedures gave similar results when the probe and target sequences had greater than 30% sequence identity. However, when the sequence identity was below 30%, a similar structure could be found for more sequences using PASSC than using any other procedure. PASH and PSI-BLAST gave the next best results.

Mesh:

Year:  2000        PMID: 10975579      PMCID: PMC2144723          DOI: 10.1110/ps.9.8.1576

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  42 in total

1.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-04-09       Impact factor: 5.469

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  A method to identify protein sequences that fold into a known three-dimensional structure.

Authors:  J U Bowie; R Lüthy; D Eisenberg
Journal:  Science       Date:  1991-07-12       Impact factor: 47.728

4.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

5.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

6.  Method for clustering proteins by use of all possible pairs of amino acids as structural descriptors.

Authors:  S Nakayama; S Shigezumi; M Yoshida
Journal:  J Chem Inf Comput Sci       Date:  1988-05

7.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

8.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

9.  A new family of powerful multivariate statistical sequence analysis techniques.

Authors:  M van Heel
Journal:  J Mol Biol       Date:  1991-08-20       Impact factor: 5.469

10.  Amino acid substitution matrices from an information theoretic perspective.

Authors:  S F Altschul
Journal:  J Mol Biol       Date:  1991-06-05       Impact factor: 5.469

View more
  5 in total

1.  Sequence context-specific profiles for homology searching.

Authors:  A Biegert; J Söding
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-20       Impact factor: 11.205

2.  New amino acid substitution matrix brings sequence alignments into agreement with structure matches.

Authors:  Kejue Jia; Robert L Jernigan
Journal:  Proteins       Date:  2021-02-02

3.  Protein sequence and structure alignments within one framework.

Authors:  Gundolf Schenk; Thomas Margraf; Andrew E Torda
Journal:  Algorithms Mol Biol       Date:  2008-04-01       Impact factor: 1.405

4.  Revisiting amino acid substitution matrices for identifying distantly related proteins.

Authors:  Kazunori Yamada; Kentaro Tomii
Journal:  Bioinformatics       Date:  2013-11-26       Impact factor: 6.937

5.  Profile Comparer Extended: phylogeny of lytic polysaccharide monooxygenase families using profile hidden Markov model alignments.

Authors:  Gerben P Voshol; Peter J Punt; Erik Vijgenboom
Journal:  F1000Res       Date:  2019-10-31
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.