Literature DB >> 7602593

An assessment of amino acid exchange matrices in aligning protein sequences: the twilight zone revisited.

G Vogt1, T Etzold, P Argos.   

Abstract

The sensitivity of most protein sequence alignment methods depends strongly on the quality of the comparison matrices used. These matrices, which assign weights or similarity scores to every possible amino acid substitution pair, are utilized to differentiate amongst the various possible alignments of two or more sequences. There are many ways to generate these exchange weights and new matrices are constantly published. There has been no overall assessment of these various matrices when applied in different alignment techniques and over many protein folds and families, both close and distant and with the use of several gap penalty values. In this work, a set of amino acid sequences matched by superposition of known protein tertiary topologies is used to test the alignment accuracy of the different method/matrix/penalty combinations. The comparisons show relatively similar results for the top scoring matrices, a preference for the global alignment method of Needleman and Wunsch, and the importance of matrix modification and optimized gap penalties. The relationship between the percentage identity in a resulting alignment and the level of correctness to be expected are given for the top-performing matrix, resulting in a better definition of the so-called "twilight zone". Estimates are made for the probability that two sequences, aligned at a certain level of residue percentage identity, are in fact unrelated.

Mesh:

Substances:

Year:  1995        PMID: 7602593     DOI: 10.1006/jmbi.1995.0340

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  46 in total

1.  The identification of conserved interactions within the SH3 domain by alignment of sequences and structures.

Authors:  S M Larson; A R Davidson
Journal:  Protein Sci       Date:  2000-11       Impact factor: 6.725

2.  DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches.

Authors:  J D Thompson; F Plewniak; J Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2000-08-01       Impact factor: 16.971

3.  Improving the quality of twilight-zone alignments.

Authors:  L Jaroszewski; L Rychlewski; A Godzik
Journal:  Protein Sci       Date:  2000-08       Impact factor: 6.725

4.  In search for more accurate alignments in the twilight zone.

Authors:  Lukasz Jaroszewski; Weizhong Li; Adam Godzik
Journal:  Protein Sci       Date:  2002-07       Impact factor: 6.725

5.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

6.  Sequence conserved for subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

7.  Contact-based sequence alignment.

Authors:  Jens Kleinjung; John Romein; Kuang Lin; Jaap Heringa
Journal:  Nucleic Acids Res       Date:  2004-04-30       Impact factor: 16.971

8.  Aligning protein sequence and analysing substitution pattern using a class-specific matrix.

Authors:  Hai Song Xu; Wen Ke Ren; Xiao Hui Liu; Xiao Qin Li
Journal:  J Biosci       Date:  2010-06       Impact factor: 1.826

9.  Evolutionarily conserved regions and hydrophobic contacts at the superfamily level: The case of the fold-type I, pyridoxal-5'-phosphate-dependent enzymes.

Authors:  Alessandro Paiardini; Francesco Bossa; Stefano Pascarella
Journal:  Protein Sci       Date:  2004-11       Impact factor: 6.725

10.  The structure of serine hydroxymethyltransferase as modeled by homology and validated by site-directed mutagenesis.

Authors:  S Pascarella; S Angelaccio; R Contestabile; S Delle Fratte; M Di Salvo; F Bossa
Journal:  Protein Sci       Date:  1998-09       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.