Literature DB >> 11835508

A study on protein sequence alignment quality.

Arne Elofsson1.   

Abstract

One of the most central methods in bioinformatics is the alignment of two protein or DNA sequences. However, so far large-scale benchmarks examining the quality of these alignments are scarce. On the other hand, recently several large-scale studies of the capacity of different methods to identify related sequences has led to new insights about the performance of fold recognition methods. To increase our understanding about fold recognition methods, we present a large-scale benchmark of alignment quality. We compare alignments from several different alignment methods, including sequence alignments, hidden Markov models, PSI-BLAST, CLUSTALW, and threading methods. For most methods, the alignment quality increases significantly at about 20% sequence identity. The difference in alignment quality between different methods is quite small, and the main difference can be seen at the exact positioning of the sharp rise in alignment quality, that is, around 15-20% sequence identity. The alignments are improved by using structural information. In general, the best alignments are obtained by methods that use predicted secondary structure information and sequence profiles obtained from PSI-BLAST. One interesting observation is that for different pairs many different methods create the best alignments. This finding implies that if a method that could select the best alignment method for each pair existed, a significant improvement of the alignment quality could be gained. Copyright 2002 Wiley-Liss, Inc.

Mesh:

Year:  2002        PMID: 11835508     DOI: 10.1002/prot.10043

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  18 in total

1.  Comparative protein structure modeling by iterative alignment, model building and model assessment.

Authors:  Bino John; Andrej Sali
Journal:  Nucleic Acids Res       Date:  2003-07-15       Impact factor: 16.971

2.  Wurst: a protein threading server with a structural scoring function, sequence profiles and optimized substitution matrices.

Authors:  Andrew E Torda; James B Procter; Thomas Huber
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  Assessing strategies for improved superfamily recognition.

Authors:  Ian Sillitoe; Mark Dibley; James Bray; Sarah Addou; Christine Orengo
Journal:  Protein Sci       Date:  2005-06-03       Impact factor: 6.725

4.  On the accuracy of homology modeling and sequence alignment methods applied to membrane proteins.

Authors:  Lucy R Forrest; Christopher L Tang; Barry Honig
Journal:  Biophys J       Date:  2006-04-28       Impact factor: 4.033

5.  LigProf: a simple tool for in silico prediction of ligand-binding sites.

Authors:  Grzegorz Koczyk; Lucjan S Wyrwicz; Leszek Rychlewski
Journal:  J Mol Model       Date:  2007-01-03       Impact factor: 1.810

6.  Three globin lineages belonging to two structural classes in genomes from the three kingdoms of life.

Authors:  Serge N Vinogradov; David Hoogewijs; Xavier Bailly; Raúl Arredondo-Peter; Michel Guertin; Julian Gough; Sylvia Dewilde; Luc Moens; Jacques R Vanfleteren
Journal:  Proc Natl Acad Sci U S A       Date:  2005-08-01       Impact factor: 11.205

7.  Overcoming sequence misalignments with weighted structural superposition.

Authors:  Nickolay A Khazanov; Kelly L Damm-Ganamet; Daniel X Quang; Heather A Carlson
Journal:  Proteins       Date:  2012-07-28

8.  A comparison of different functions for predicted protein model quality assessment.

Authors:  Juan Li; Huisheng Fang
Journal:  J Comput Aided Mol Des       Date:  2016-08-03       Impact factor: 3.686

9.  Identifying functionally informative evolutionary sequence profiles.

Authors:  Nelson Gil; Andras Fiser
Journal:  Bioinformatics       Date:  2018-04-15       Impact factor: 6.937

10.  Virtual interactomics of proteins from biochemical standpoint.

Authors:  Jaroslav Kubrycht; Karel Sigler; Pavel Souček
Journal:  Mol Biol Int       Date:  2012-08-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.