Literature DB >> 8819751

A non-local gap-penalty for profile alignment.

W R Taylor1.   

Abstract

The length of an alignment of biological sequences is typically longer than the mean length of its component sequences. (This arises from the insertion of gaps in the alignment.) When such an alignment is used as a profile for the alignment of further sequences (or profiles), it will have a bias toward additional sequences that match the length of the profile, rather than the mean length of sequences in the profile, as the alignment of these will entail fewer (or smaller) insertions (so avoiding gap-penalties). An algorithm is described to correct this bias that entails monitoring the correspondence, for every pair of positions, of the mean separations in both profiles as they are aligned. The correction was incorporated into a standard dynamic programming algorithm through a modification of the gap-penalty, but, unlike other approaches, this modification is not local and takes into consideration the overall alignment of the sequences. This implies that the algorithm cannot guarantee to find the optimal alignment, but tests suggest that close approximations are obtained. The method was tested on protein families by measuring the area in the parameter space of the phase containing the correct multiple alignment. No improvement (increase in phase area) was found with a family that required few gaps to be aligned correctly. However, for highly gapped alignments, a 50% increase in area was obtained with one family and the correct alignment was found for another that could not be aligned with the unbiased method.

Mesh:

Substances:

Year:  1996        PMID: 8819751     DOI: 10.1007/bf02458279

Source DB:  PubMed          Journal:  Bull Math Biol        ISSN: 0092-8240            Impact factor:   1.758


  18 in total

Review 1.  SH3--an abundant protein domain in search of a function.

Authors:  A Musacchio; T Gibson; V P Lehto; M Saraste
Journal:  FEBS Lett       Date:  1992-07-27       Impact factor: 4.124

Review 2.  A template based method of pattern matching in protein sequences.

Authors:  W R Taylor
Journal:  Prog Biophys Mol Biol       Date:  1989       Impact factor: 3.667

Review 3.  Origins and evolutionary relationships of retroviruses.

Authors:  R F Doolittle; D F Feng; M S Johnson; M A McClure
Journal:  Q Rev Biol       Date:  1989-03       Impact factor: 4.875

4.  Evaluation and improvements in the automatic alignment of protein sequences.

Authors:  G J Barton; M J Sternberg
Journal:  Protein Eng       Date:  1987 Feb-Mar

5.  Alignment of the amino acid sequences of distantly related proteins using variable gap penalties.

Authors:  A M Lesk; M Levitt; C Chothia
Journal:  Protein Eng       Date:  1986 Oct-Nov

6.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

7.  CLUSTAL: a package for performing multiple sequence alignment on a microcomputer.

Authors:  D G Higgins; P M Sharp
Journal:  Gene       Date:  1988-12-15       Impact factor: 3.688

8.  Progressive sequence alignment as a prerequisite to correct phylogenetic trees.

Authors:  D F Feng; R F Doolittle
Journal:  J Mol Evol       Date:  1987       Impact factor: 2.395

9.  The protein threading problem with sequence amino acid interaction preferences is NP-complete.

Authors:  R H Lathrop
Journal:  Protein Eng       Date:  1994-09

10.  Comparative analysis of multiple protein-sequence alignment methods.

Authors:  M A McClure; T K Vasi; W M Fitch
Journal:  Mol Biol Evol       Date:  1994-07       Impact factor: 16.240

View more
  1 in total

1.  Reduction, alignment and visualisation of large diverse sequence families.

Authors:  William R Taylor
Journal:  BMC Bioinformatics       Date:  2016-08-02       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.