Literature DB >> 10975570

Improving the quality of twilight-zone alignments.

L Jaroszewski1, L Rychlewski, A Godzik .   

Abstract

Several recent publications illustrated advantages of using sequence profiles in recognizing distant homologies between proteins. At the same time, the practical usefulness of distant homology recognition depends not only on the sensitivity of the algorithm, but also on the quality of the alignment between a prediction target and the template from the database of known proteins. Here, we study this question for several supersensitive protein algorithms that were previously compared in their recognition sensitivity (Rychlewski et al., 2000). A database of protein pairs with similar structures, but low sequence similarity is used to rate the alignments obtained with several different methods, which included sequence-sequence, sequence-profile, and profile-profile alignment methods. We show that incorporation of evolutionary information encoded in sequence profiles into alignment calculation methods significantly increases the alignment accuracy, bringing them closer to the alignments obtained from structure comparison. In general, alignment quality is correlated with recognition and alignment score significance. For every alignment method, alignments with statistically significant scores correlate with both correct structural templates and good quality alignments. At the same time, average alignment lengths differ in various methods, making the comparison between them difficult. For instance, the alignments obtained by FFAS, the profile-profile alignment algorithm developed in our group are always longer that the alignments obtained with the PSI-BLAST algorithms. To address this problem, we develop methods to truncate or extend alignments to cover a specified percentage of protein lengths. In most cases, the elongation of the alignment by profile-profile methods is reasonable, adding fragments of similar structure. The examples of erroneous alignment are examined and it is shown that they can be identified based on the model quality.

Mesh:

Substances:

Year:  2000        PMID: 10975570      PMCID: PMC2144727          DOI: 10.1110/ps.9.8.1487

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  41 in total

1.  Threading with explicit models for evolutionary conservation of structure and sequence.

Authors:  A Panchenko; A Marchler-Bauer; S H Bryant
Journal:  Proteins       Date:  1999

2.  The Pfam protein families database.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

4.  Topology fingerprint approach to the inverse protein folding problem.

Authors:  A Godzik; A Kolinski; J Skolnick
Journal:  J Mol Biol       Date:  1992-09-05       Impact factor: 5.469

5.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

6.  Suboptimal sequence alignment in molecular biology. Alignment with error analysis.

Authors:  M Zuker
Journal:  J Mol Biol       Date:  1991-09-20       Impact factor: 5.469

7.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

8.  Regularities in interaction patterns of globular proteins.

Authors:  A Godzik; J Skolnick; A Kolinski
Journal:  Protein Eng       Date:  1993-11

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  Structure of donor side components in photosystem II predicted by computer modelling.

Authors:  B Svensson; I Vass; E Cedergren; S Styring
Journal:  EMBO J       Date:  1990-07       Impact factor: 11.598

View more
  42 in total

1.  More for less in structural genomics.

Authors:  A Heger; L Holm
Journal:  J Struct Funct Genomics       Date:  2003

2.  In search for more accurate alignments in the twilight zone.

Authors:  Lukasz Jaroszewski; Weizhong Li; Adam Godzik
Journal:  Protein Sci       Date:  2002-07       Impact factor: 6.725

3.  Sequence conserved for subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

4.  Homology modeling provides insights into the binding mode of the PAAD/DAPIN/pyrin domain, a fourth member of the CARD/DD/DED domain family.

Authors:  Tong Liu; Ana Rojas; Yuzhen Ye; Adam Godzik
Journal:  Protein Sci       Date:  2003-09       Impact factor: 6.725

5.  Wurst: a protein threading server with a structural scoring function, sequence profiles and optimized substitution matrices.

Authors:  Andrew E Torda; James B Procter; Thomas Huber
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

6.  Protein structure prediction and analysis using the Robetta server.

Authors:  David E Kim; Dylan Chivian; David Baker
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

7.  Alignment of protein sequences by their profiles.

Authors:  Marc A Marti-Renom; M S Madhusudhan; Andrej Sali
Journal:  Protein Sci       Date:  2004-04       Impact factor: 6.725

Review 8.  Structural correlates of selectivity and inactivation in potassium channels.

Authors:  Jason G McCoy; Crina M Nimigean
Journal:  Biochim Biophys Acta       Date:  2011-09-16

Review 9.  Caspases. Regulating death since the origin of life.

Authors:  Maite Sanmartín; Lukasz Jaroszewski; Natasha V Raikhel; Enrique Rojo
Journal:  Plant Physiol       Date:  2005-03       Impact factor: 8.340

10.  PvdP is a tyrosinase that drives maturation of the pyoverdine chromophore in Pseudomonas aeruginosa.

Authors:  Pol Nadal-Jimenez; Gudrun Koch; Carlos R Reis; Remco Muntendam; Hans Raj; C Margot Jeronimus-Stratingh; Robbert H Cool; Wim J Quax
Journal:  J Bacteriol       Date:  2014-05-09       Impact factor: 3.490

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.