Literature DB >> 15152092

Scoring profile-to-profile sequence alignments.

Guoli Wang1, Roland L Dunbrack.   

Abstract

Sequence alignment profiles have been shown to be very powerful in creating accurate sequence alignments. Profiles are often used to search a sequence database with a local alignment algorithm. More accurate and longer alignments have been obtained with profile-to-profile comparison. There are several steps that must be performed in creating profile-profile alignments, and each involves choices in parameters and algorithms. These steps include (1) what sequences to include in a multiple alignment used to build each profile, (2) how to weight similar sequences in the multiple alignment and how to determine amino acid frequencies from the weighted alignment, (3) how to score a column from one profile aligned to a column of the other profile, (4) how to score gaps in the profile-profile alignment, and (5) how to include structural information. Large-scale benchmarks consisting of pairs of homologous proteins with structurally determined sequence alignments are necessary for evaluating the efficacy of each scoring scheme. With such a benchmark, we have investigated the properties of profile-profile alignments and found that (1) with optimized gap penalties, most column-column scoring functions behave similarly to one another in alignment accuracy; (2) some functions, however, have much higher search sensitivity and specificity; (3) position-specific weighting schemes in determining amino acid counts in columns of multiple sequence alignments are better than sequence-specific schemes; (4) removing positions in the profile with gaps in the query sequence results in better alignments; and (5) adding predicted and known secondary structure information improves alignments.

Mesh:

Year:  2004        PMID: 15152092      PMCID: PMC2279992          DOI: 10.1110/ps.03601504

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  32 in total

1.  Knowledge-based protein secondary structure assignment.

Authors:  D Frishman; P Argos
Journal:  Proteins       Date:  1995-12

2.  Assessing the performance of fold recognition methods by means of a comprehensive benchmark.

Authors:  D Fischer; A Elofsson; D Rice; D Eisenberg
Journal:  Pac Symp Biocomput       Date:  1996

3.  Searching databases of conserved sequence regions by aligning protein multiple-alignments.

Authors:  S Pietrokovski
Journal:  Nucleic Acids Res       Date:  1996-10-01       Impact factor: 16.971

4.  Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology.

Authors:  K Sjölander; K Karplus; M Brown; R Hughey; A Krogh; I S Mian; D Haussler
Journal:  Comput Appl Biosci       Date:  1996-08

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

7.  Protein structure comparison by alignment of distance matrices.

Authors:  L Holm; C Sander
Journal:  J Mol Biol       Date:  1993-09-05       Impact factor: 5.469

8.  Performance evaluation of amino acid substitution matrices.

Authors:  S Henikoff; J G Henikoff
Journal:  Proteins       Date:  1993-09

9.  Position-based sequence weights.

Authors:  S Henikoff; J G Henikoff
Journal:  J Mol Biol       Date:  1994-11-04       Impact factor: 5.469

10.  Fold and function predictions for Mycoplasma genitalium proteins.

Authors:  L Rychlewski; B Zhang; A Godzik
Journal:  Fold Des       Date:  1998
View more
  40 in total

1.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

2.  Relative packing groups in template-based structure prediction: cooperative effects of true positive constraints.

Authors:  Ryan Day; Xiaotao Qu; Rosemarie Swanson; Zach Bohannan; Robert Bliss; Jerry Tsai
Journal:  J Comput Biol       Date:  2011-01       Impact factor: 1.479

3.  Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments.

Authors:  Hongyi Zhou; Yaoqi Zhou
Journal:  Proteins       Date:  2005-02-01

Review 4.  The limits of protein sequence comparison?

Authors:  William R Pearson; Michael L Sierk
Journal:  Curr Opin Struct Biol       Date:  2005-06       Impact factor: 6.809

5.  Identification of correct regions in protein models using structural, alignment, and consensus information.

Authors:  Björn Wallner; Arne Elofsson
Journal:  Protein Sci       Date:  2006-03-07       Impact factor: 6.725

6.  Discrimination between distant homologs and structural analogs: lessons from manually constructed, reliable data sets.

Authors:  Hua Cheng; Bong-Hyun Kim; Nick V Grishin
Journal:  J Mol Biol       Date:  2008-01-05       Impact factor: 5.469

7.  Using multiple templates to improve quality of homology models in automated homology modeling.

Authors:  Per Larsson; Björn Wallner; Erik Lindahl; Arne Elofsson
Journal:  Protein Sci       Date:  2008-04-25       Impact factor: 6.725

8.  Comparative protein structure modeling using Modeller.

Authors:  Ben Webb; Andrej Sali; Narayanan Eswar; Marc A Marti-Renom; M S Madhusudhan; David Eramian; Min-Yi Shen; Ursula Pieper
Journal:  Curr Protoc Bioinformatics       Date:  2006-10

9.  SATCHMO-JS: a webserver for simultaneous protein multiple sequence alignment and phylogenetic tree construction.

Authors:  Raffi Hagopian; John R Davidson; Ruchira S Datta; Bushra Samad; Glen R Jarvis; Kimmen Sjölander
Journal:  Nucleic Acids Res       Date:  2010-04-29       Impact factor: 16.971

10.  A model for protein sequence evolution based on selective pressure for protein stability: application to hemoglobins.

Authors:  Lorraine Marsh
Journal:  Evol Bioinform Online       Date:  2009-08-27       Impact factor: 1.625

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.