Literature DB >> 14962936

A comparison of scoring functions for protein sequence profile alignment.

Robert C Edgar1, Kimmen Sjölander.   

Abstract

MOTIVATION: In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence-sequence methods (e.g. BLAST) and profile-sequence methods (e.g. PSI-BLAST). Profile-profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile-profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile-profile scoring functions by comparing alignments of 488 pairs of sequences with identity < or =30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence-sequence and sequence-profile methods, including BLAST and PSI-BLAST.
RESULTS: We find that profile-profile alignment gives an average improvement over our test set of typically 2-3% over profile-sequence alignment and approximately 40% over sequence-sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments. AVAILABILITY: Source code, reference alignments and more detailed results are freely available at http://phylogenomics.berkeley.edu/profilealignment/

Mesh:

Year:  2004        PMID: 14962936     DOI: 10.1093/bioinformatics/bth090

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  42 in total

1.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

2.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

Review 3.  The limits of protein sequence comparison?

Authors:  William R Pearson; Michael L Sierk
Journal:  Curr Opin Struct Biol       Date:  2005-06       Impact factor: 6.809

4.  BCL::Align-sequence alignment and fold recognition with a custom scoring function online.

Authors:  Elizabeth Dong; Jarrod Smith; Sten Heinze; Nathan Alexander; Jens Meiler
Journal:  Gene       Date:  2008-06-07       Impact factor: 3.688

5.  Benchmarking of TASSER_2.0: an improved protein structure prediction algorithm with more accurate predicted contact restraints.

Authors:  Seung Yup Lee; Jeffrey Skolnick
Journal:  Biophys J       Date:  2008-05-16       Impact factor: 4.033

6.  Comparative protein structure modeling using Modeller.

Authors:  Ben Webb; Andrej Sali; Narayanan Eswar; Marc A Marti-Renom; M S Madhusudhan; David Eramian; Min-Yi Shen; Ursula Pieper
Journal:  Curr Protoc Bioinformatics       Date:  2006-10

7.  Systematic assessment of accuracy of comparative model of proteins belonging to different structural fold classes.

Authors:  Suvobrata Chakravarty; Dario Ghersi; Roberto Sanchez
Journal:  J Mol Model       Date:  2011-02-08       Impact factor: 1.810

8.  INTREPID: a web server for prediction of functionally important residues by evolutionary analysis.

Authors:  Sriram Sankararaman; Bryan Kolaczkowski; Kimmen Sjölander
Journal:  Nucleic Acids Res       Date:  2009-05-13       Impact factor: 16.971

9.  CORAL: aligning conserved core regions across domain families.

Authors:  Jessica H Fong; Aron Marchler-Bauer
Journal:  Bioinformatics       Date:  2009-05-26       Impact factor: 6.937

10.  A modular kernel approach for integrative analysis of protein domain boundaries.

Authors:  Paul D Yoo; Bing Bing Zhou; Albert Y Zomaya
Journal:  BMC Genomics       Date:  2009-12-03       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.