Literature DB >> 22733542

Overcoming sequence misalignments with weighted structural superposition.

Nickolay A Khazanov1, Kelly L Damm-Ganamet, Daniel X Quang, Heather A Carlson.   

Abstract

An appropriate structural superposition identifies similarities and differences between homologous proteins that are not evident from sequence alignments alone. We have coupled our Gaussian-weighted RMSD (wRMSD) tool with a sequence aligner and seed extension (SE) algorithm to create a robust technique for overlaying structures and aligning sequences of homologous proteins (HwRMSD). HwRMSD overcomes errors in the initial sequence alignment that would normally propagate into a standard RMSD overlay. SE can generate a corrected sequence alignment from the improved structural superposition obtained by wRMSD. HwRMSD's robust performance and its superiority over standard RMSD are demonstrated over a range of homologous proteins. Its better overlay results in corrected sequence alignments with good agreement to HOMSTRAD. Finally, HwRMSD is compared to established structural alignment methods: FATCAT, secondary-structure matching, combinatorial extension, and Dalilite. Most methods are comparable at placing residue pairs within 2 Å, but HwRMSD places many more residue pairs within 1 Å, providing a clear advantage. Such high accuracy is essential in drug design, where small distances can have a large impact on computational predictions. This level of accuracy is also needed to correct sequence alignments in an automated fashion, especially for omics-scale analysis. HwRMSD can align homologs with low-sequence identity and large conformational differences, cases where both sequence-based and structural-based methods may fail. The HwRMSD pipeline overcomes the dependency of structural overlays on initial sequence pairing and removes the need to determine the best sequence-alignment method, substitution matrix, and gap parameters for each unique pair of homologs.
Copyright © 2012 Wiley Periodicals, Inc.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22733542      PMCID: PMC3464347          DOI: 10.1002/prot.24134

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  45 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Twilight zone of protein sequence alignments.

Authors:  B Rost
Journal:  Protein Eng       Date:  1999-02

Review 3.  Protein families and their evolution-a structural perspective.

Authors:  Christine A Orengo; Janet M Thornton
Journal:  Annu Rev Biochem       Date:  2005       Impact factor: 23.643

4.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path.

Authors:  I N Shindyalov; P E Bourne
Journal:  Protein Eng       Date:  1998-09

5.  Conformational variability in the refined structure of the chaperonin GroEL at 2.8 A resolution.

Authors:  K Braig; P D Adams; A T Brünger
Journal:  Nat Struct Biol       Date:  1995-12

6.  HOMSTRAD: a database of protein structure alignments for homologous families.

Authors:  K Mizuguchi; C M Deane; T L Blundell; J P Overington
Journal:  Protein Sci       Date:  1998-11       Impact factor: 6.725

7.  Crystal structure of the alpha appendage of AP-2 reveals a recruitment platform for clathrin-coat assembly.

Authors:  L M Traub; M A Downs; J L Westrich; D H Fremont
Journal:  Proc Natl Acad Sci U S A       Date:  1999-08-03       Impact factor: 11.205

8.  The structure and function of the beta 2-adaptin appendage domain.

Authors:  D J Owen; Y Vallis; B M Pearse; H T McMahon; P R Evans
Journal:  EMBO J       Date:  2000-08-15       Impact factor: 11.598

9.  The crystal structure of phenol hydroxylase in complex with FAD and phenol provides evidence for a concerted conformational change in the enzyme and its cofactor during catalysis.

Authors:  C Enroth; H Neujahr; G Schneider; Y Lindqvist
Journal:  Structure       Date:  1998-05-15       Impact factor: 5.006

10.  MMDB: annotating protein sequences with Entrez's 3D-structure database.

Authors:  Yanli Wang; Kenneth J Addess; Jie Chen; Lewis Y Geer; Jane He; Siqian He; Shennan Lu; Thomas Madej; Aron Marchler-Bauer; Paul A Thiessen; Naigong Zhang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2006-11-29       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.