Literature DB >> 12070323

In search for more accurate alignments in the twilight zone.

Lukasz Jaroszewski1, Weizhong Li, Adam Godzik.   

Abstract

A major bottleneck in comparative modeling is the alignment quality; this is especially true for proteins whose distant relationships could be reliably recognized only by recent advances in fold recognition. The best algorithms excel in recognizing distant homologs but often produce incorrect alignments for over 50% of protein pairs in large fold-prediction benchmarks. The alignments obtained by sequence-sequence or sequence-structure matching algorithms differ significantly from the structural alignments. To study this problem, we developed a simplified method to explicitly enumerate all possible alignments for a pair of proteins. This allowed us to estimate the number of significantly different alignments for a given scoring method that score better than the structural alignment. Using several examples of distantly related proteins, we show that for standard sequence-sequence alignment methods, the number of significantly different alignments is usually large, often about 10(10) alternatives. This distance decreases when the alignment method is improved, but the number is still too large for the brute force enumeration approach. More effective strategies were needed, so we evaluated and compared two well-known approaches for searching the space of suboptimal alignments. We combined their best features and produced a hybrid method, which yielded alignments that surpassed the original alignments for about 50% of protein pairs with minimal computational effort.

Mesh:

Year:  2002        PMID: 12070323      PMCID: PMC2373660          DOI: 10.1110/ps.4820102

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  39 in total

1.  CASP3 comparative modeling evaluation.

Authors:  T Alwyn Jones; G J Kleywegt
Journal:  Proteins       Date:  1999

2.  The Protein Data Bank and the challenge of structural genomics.

Authors:  H M Berman; T N Bhat; P E Bourne; Z Feng; G Gilliland; H Weissig; J Westbrook
Journal:  Nat Struct Biol       Date:  2000-11

Review 3.  Fold predictions for bacterial genomes.

Authors:  K Pawlowski; L Rychlewski; B Zhang; A Godzik
Journal:  J Struct Biol       Date:  2001 May-Jun       Impact factor: 2.867

4.  Analysis and assessment of comparative modeling predictions in CASP4.

Authors:  A Tramontano; R Leplae; V Morea
Journal:  Proteins       Date:  2001

5.  Assessment of the CASP4 fold recognition category.

Authors:  M J Sippl; P Lackner; F S Domingues; A Prlić; R Malik; A Andreeva; M Wiederstein
Journal:  Proteins       Date:  2001

Review 6.  Determination of reliable regions in protein sequence alignments.

Authors:  M Vingron; P Argos
Journal:  Protein Eng       Date:  1990-07

7.  Protein model structure evaluation using the solvation free energy of folding.

Authors:  L Chiche; L M Gregoret; F E Cohen; P A Kollman
Journal:  Proc Natl Acad Sci U S A       Date:  1990-04       Impact factor: 11.205

8.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

9.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

10.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

View more
  18 in total

1.  Can correct protein models be identified?

Authors:  Björn Wallner; Arne Elofsson
Journal:  Protein Sci       Date:  2003-05       Impact factor: 6.725

2.  Homology modeling provides insights into the binding mode of the PAAD/DAPIN/pyrin domain, a fourth member of the CARD/DD/DED domain family.

Authors:  Tong Liu; Ana Rojas; Yuzhen Ye; Adam Godzik
Journal:  Protein Sci       Date:  2003-09       Impact factor: 6.725

3.  Solution structure of a putative ribosome binding protein from Mycoplasma pneumoniae and comparison to a distant homolog.

Authors:  Seth M Rubin; Jeffrey G Pelton; Hisao Yokota; Rosalind Kim; David E Wemmer
Journal:  J Struct Funct Genomics       Date:  2003

4.  A novel approach to structural alignment using realistic structural and environmental information.

Authors:  Yu Chen; Gordon M Crippen
Journal:  Protein Sci       Date:  2005-10-31       Impact factor: 6.725

5.  ProbCons: Probabilistic consistency-based multiple sequence alignment.

Authors:  Chuong B Do; Mahathi S P Mahabhashyam; Michael Brudno; Serafim Batzoglou
Journal:  Genome Res       Date:  2005-02       Impact factor: 9.043

6.  All are not equal: a benchmark of different homology modeling programs.

Authors:  Björn Wallner; Arne Elofsson
Journal:  Protein Sci       Date:  2005-05       Impact factor: 6.725

7.  Spatiotemporal control of spindle midzone formation by PRC1 in human cells.

Authors:  Changjun Zhu; Eric Lau; Robert Schwarzenbacher; Ella Bossy-Wetzel; Wei Jiang
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-07       Impact factor: 11.205

8.  How well can the accuracy of comparative protein structure models be predicted?

Authors:  David Eramian; Narayanan Eswar; Min-Yi Shen; Andrej Sali
Journal:  Protein Sci       Date:  2008-10-01       Impact factor: 6.725

9.  High-throughput protein production for X-ray crystallography and use of size exclusion chromatography to validate or refute computational biological unit predictions.

Authors:  Daniel McMullan; Jaume M Canaves; Kevin Quijano; Polat Abdubek; Edward Nigoghossian; Justin Haugen; Heath E Klock; Juli Vincent; Joanna Hale; Jessica Paulsen; Scott A Lesley
Journal:  J Struct Funct Genomics       Date:  2005

10.  Improving pairwise sequence alignment accuracy using near-optimal protein sequence alignments.

Authors:  Michael L Sierk; Michael E Smoot; Ellen J Bass; William R Pearson
Journal:  BMC Bioinformatics       Date:  2010-03-22       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.