Literature DB >> 15304646

Approximate protein structural alignment in polynomial time.

Rachel Kolodny1, Nathan Linial.   

Abstract

Alignment of protein structures is a fundamental task in computational molecular biology. Good structural alignments can help detect distant evolutionary relationships that are hard or impossible to discern from protein sequences alone. Here, we study the structural alignment problem as a family of optimization problems and develop an approximate polynomial-time algorithm to solve them. For a commonly used scoring function, the algorithm runs in O(n(10)/epsilon(6)) time, for globular protein of length n, and it detects alignments that score within an additive error of epsilon from all optima. Thus, we prove that this task is computationally feasible, although the method that we introduce is too slow to be a useful everyday tool. We argue that such approximate solutions are, in fact, of greater interest than exact ones because of the noisy nature of experimentally determined protein coordinates. The measurement of similarity between a pair of protein structures used by our algorithm involves the Euclidean distance between the structures (appropriately rigidly transformed). We show that an alternative approach, which relies on internal distance matrices, must incorporate sophisticated geometric ingredients if it is to guarantee optimality and run in polynomial time. We use these observations to visualize the scoring function for several real instances of the problem. Our investigations yield insights on the computational complexity of protein alignment under various scoring functions. These insights can be used in the design of scoring functions for which the optimum can be approximated efficiently and perhaps in the development of efficient algorithms for the multiple structural alignment problem.

Mesh:

Substances:

Year:  2004        PMID: 15304646      PMCID: PMC514457          DOI: 10.1073/pnas.0404383101

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  28 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Computational methods for the structural alignment of molecules.

Authors:  C Lemmen; T Lengauer
Journal:  J Comput Aided Mol Des       Date:  2000-03       Impact factor: 3.686

Review 3.  Molecular dynamics simulations in biology.

Authors:  M Karplus; G A Petsko
Journal:  Nature       Date:  1990-10-18       Impact factor: 49.962

4.  Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques.

Authors:  R Nussinov; H J Wolfson
Journal:  Proc Natl Acad Sci U S A       Date:  1991-12-01       Impact factor: 11.205

5.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path.

Authors:  I N Shindyalov; P E Bourne
Journal:  Protein Eng       Date:  1998-09

6.  Regression analysis of multiple protein structures.

Authors:  T D Wu; S C Schmidler; T Hastie; D L Brutlag
Journal:  J Comput Biol       Date:  1998       Impact factor: 1.479

7.  The structural alignment between two proteins: is there a unique answer?

Authors:  A Godzik
Journal:  Protein Sci       Date:  1996-07       Impact factor: 6.725

8.  Threading a database of protein cores.

Authors:  T Madej; J F Gibrat; S H Bryant
Journal:  Proteins       Date:  1995-11

9.  Families and the structural relatedness among globular proteins.

Authors:  D P Yee; K A Dill
Journal:  Protein Sci       Date:  1993-06       Impact factor: 6.725

10.  The FSSP database of structurally aligned protein fold families.

Authors:  L Holm; C Sander
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

View more
  26 in total

1.  Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures.

Authors:  Rachel Kolodny; Patrice Koehl; Michael Levitt
Journal:  J Mol Biol       Date:  2005-01-16       Impact factor: 5.469

2.  Finding nearly optimal GDT scores.

Authors:  Shuai Cheng Li; Dongbo Bu; Jinbo Xu; Ming Li
Journal:  J Comput Biol       Date:  2011-05       Impact factor: 1.479

3.  iPARTS: an improved tool of pairwise alignment of RNA tertiary structures.

Authors:  Chih-Wei Wang; Kun-Tze Chen; Chin Lung Lu
Journal:  Nucleic Acids Res       Date:  2010-05-27       Impact factor: 16.971

4.  FlexSnap: flexible non-sequential protein structure alignment.

Authors:  Saeed Salem; Mohammed J Zaki; Chris Bystroff
Journal:  Algorithms Mol Biol       Date:  2010-01-04       Impact factor: 1.405

5.  Distance matrix-based approach to protein structure prediction.

Authors:  Andrzej Kloczkowski; Robert L Jernigan; Zhijun Wu; Guang Song; Lei Yang; Andrzej Kolinski; Piotr Pokarowski
Journal:  J Struct Funct Genomics       Date:  2009-02-18

6.  Structural alphabets for protein structure classification: a comparison study.

Authors:  Quan Le; Gianluca Pollastri; Patrice Koehl
Journal:  J Mol Biol       Date:  2008-12-25       Impact factor: 5.469

7.  On the difference in quality between current heuristic and optimal solutions to the protein structure alignment problem.

Authors:  Mauricio Arriagada; Aleksandar Poleksic
Journal:  Biomed Res Int       Date:  2012-12-23       Impact factor: 3.411

8.  Efficient protein alignment algorithm for protein search.

Authors:  Zaixin Lu; Zhiyu Zhao; Bin Fu
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

9.  FASTR3D: a fast and accurate search tool for similar RNA 3D structures.

Authors:  Chin-En Lai; Ming-Yuan Tsai; Yun-Chen Liu; Chih-Wei Wang; Kun-Tze Chen; Chin Lung Lu
Journal:  Nucleic Acids Res       Date:  2009-05-12       Impact factor: 16.971

10.  The difficulty of protein structure alignment under the RMSD.

Authors:  Shuai Cheng Li
Journal:  Algorithms Mol Biol       Date:  2013-01-04       Impact factor: 1.405

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.