Literature DB >> 16260755

A novel approach to structural alignment using realistic structural and environmental information.

Yu Chen1, Gordon M Crippen.   

Abstract

In the era of structural genomics, it is necessary to generate accurate structural alignments in order to build good templates for homology modeling. Although a great number of structural alignment algorithms have been developed, most of them ignore intermolecular interactions during the alignment procedure. Therefore, structures in different oligomeric states are barely distinguishable, and it is very challenging to find correct alignment in coil regions. Here we present a novel approach to structural alignment using a clique finding algorithm and environmental information (SAUCE). In this approach, we build the alignment based on not only structural coordinate information but also realistic environmental information extracted from biological unit files provided by the Protein Data Bank (PDB). At first, we eliminate all environmentally unfavorable pairings of residues. Then we identify alignments in core regions via a maximal clique finding algorithm. Two extreme value distribution (EVD) form statistics have been developed to evaluate core region alignments. With an optional extension step, global alignment can be derived based on environment-based dynamic programming linking. We show that our method is able to differentiate three-dimensional structures in different oligomeric states, and is able to find flexible alignments between multidomain structures without predetermined hinge regions. The overall performance is also evaluated on a large scale by comparisons to current structural classification databases as well as to other alignment methods.

Mesh:

Substances:

Year:  2005        PMID: 16260755      PMCID: PMC2253243          DOI: 10.1110/ps.051428205

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  29 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Structural genomics programs at the US National Institute of General Medical Sciences.

Authors:  J C Norvell; A Z Machalek
Journal:  Nat Struct Biol       Date:  2000-11

3.  FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties.

Authors:  J Shi; T L Blundell; K Mizuguchi
Journal:  J Mol Biol       Date:  2001-06-29       Impact factor: 5.469

4.  An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.

Authors:  A S Yang; B Honig
Journal:  J Mol Biol       Date:  2000-08-18       Impact factor: 5.469

5.  Analysis of protein sequence/structure similarity relationships.

Authors:  Hin Hark Gan; Rebecca A Perlow; Sharmili Roy; Joy Ko; Min Wu; Jing Huang; Shixiang Yan; Angelo Nicoletta; Jonathan Vafai; Ding Sun; Lihua Wang; Joyce E Noah; Samuela Pasquali; Tamar Schlick
Journal:  Biophys J       Date:  2002-11       Impact factor: 4.033

6.  In search for more accurate alignments in the twilight zone.

Authors:  Lukasz Jaroszewski; Weizhong Li; Adam Godzik
Journal:  Protein Sci       Date:  2002-07       Impact factor: 6.725

7.  Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching.

Authors:  M Gribskov; N L Robinson
Journal:  Comput Chem       Date:  1996-03

8.  Comparison of protein structures using 3D profile alignment.

Authors:  M Suyama; Y Matsuo; K Nishikawa
Journal:  J Mol Evol       Date:  1997       Impact factor: 2.395

9.  A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.

Authors:  D W Rice; D Eisenberg
Journal:  J Mol Biol       Date:  1997-04-11       Impact factor: 5.469

10.  Size-independent comparison of protein three-dimensional structures.

Authors:  V N Maiorov; G M Crippen
Journal:  Proteins       Date:  1995-07
View more
  4 in total

1.  The SALAMI protein structure search server.

Authors:  Thomas Margraf; Gundolf Schenk; Andrew E Torda
Journal:  Nucleic Acids Res       Date:  2009-05-22       Impact factor: 16.971

2.  Evolutionary inaccuracy of pairwise structural alignments.

Authors:  M I Sadowski; W R Taylor
Journal:  Bioinformatics       Date:  2012-03-06       Impact factor: 6.937

3.  Comparative analysis of protein structure alignments.

Authors:  Gabriele Mayr; Francisco S Domingues; Peter Lackner
Journal:  BMC Struct Biol       Date:  2007-07-26

4.  Protein sequence and structure alignments within one framework.

Authors:  Gundolf Schenk; Thomas Margraf; Andrew E Torda
Journal:  Algorithms Mol Biol       Date:  2008-04-01       Impact factor: 1.405

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.